Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3715 |
Symbol | |
ID | 4024231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4146931 |
End bp | 4149261 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637963919 |
Product | carbon-monoxide dehydrogenase |
Protein accession | YP_570837 |
Protein GI | 91978178 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR02416] carbon-monoxide dehydrogenase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.264219 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATTC TCCCCGGCTC CATGCGGTTC GGTGCGGGTC AGCCCGTCAA GCGTCTCGAG GATCAGCGCC TGCTCACCGG GCACGGCCTC TATCTCGACG ACAAGCCCGC CGACGGCGCG CTGTGGCTGG TGGTGCTGCG CTCGCCTCAT GCGCACGCGA AGATCGTTGC GATCGACGGT GAGGCGGCGC GGGCGATGCC GGGCGTCGAG TCCGTGCTGA CCGGAGCCGA TCTGGTCGCC GACGCGGTCG GCACGATCCC GACTCTGCCG ATCTTCAAGC GGCCGGACGG CTCGCCAATG ACGCTGCCGC CACGGCGTCT GCTCGCGCAT GAGATCGTCC GTTTCGTCGG CGAGCCCGTC GCCGCCGTCA TCGCGTCGTC GCAAGCCGCC GCGCAAGCGG CCGCCGAGGC TGTCGTCGTC GAGTACGAAG AGCTTCCCGC CGTGACCGAT CCGACCGCGG CAATCCAGCC CGGCGCGCCG GTCGTGTACG ACACCGCTCC CGACAACATC GTCGCGGCGA TGAGCTATGG CGATGCCGCC AAGGTCGATG AGGCCTTCGC CAAAGCCGCG CACACGGTCT CGCTCGATAT CGTCAGCCAG CGGCTGATCC CTTCCGCGAT GGAGCCGCGC GCGACCATCG CCGAGATCGA GAAGAAGACC GGCCGGCTGA TCCTGCACGT GCAGTCGCAG ACGCCGGCGA CGACGCGCGA CACGCTCGCC GACGCCATCC TGAAGCGGCC GAAGGACAGC ATTCAGGTTC TGGTCGGCGA CATCGGCGGC GGTTTCGGCC AGAAGACCGG CCTCTATCCG GAGGATGGTC TCGTCGCCTA CGCGGCGGTC AAGCTCAACC GCAAGGTGCG ATGGCGCGGC GACCGGACCG ACGAATTCGT CGGCGGCACC CATGGCCGCG ACCTGACCTC GACGGCGTCG ATTGCGCTCG ACGCCAAGGG CCGCGTGCTG GCCTATCGCG TGTCGTCGAT CGGCGGCACC GGCGCCTATC TCGCTGGCGC CGGCGTGATT ATTCCGCTGG TGCTCGGCCC GTTCGTGCAG ACCGGCGTCT ACGATCTGCC GCTGGTGCAT TTCGACATCA AGGCGGTGTT GACCCACACC GCGCCGGTCG GAGCCTATCG CGGGGCCGGT CGCCCCGAGG CGGTGTACAT CATCGAGCGG CTGATGGACG CCGCTGCGCG ACAGCTCGGC ATGGACCCGC GCGCGATCCG CAAGGTCAAT TACATCAAGC CGTCGCAGCT GCCTTACACC AACGCGGTCG GGCAGGTGTA CGATAGCGGC GCCTTCGCCC ATATGATGCA GCGCGCCGCC GAACTGTCCG ACTGGGTCGG CTTCAAGGCG CGCAAGAAGG AAGCCGCGAA GAAGGGCCTG CTCTACGGCC GCGGCGTCAC AAGCTACATC GAATGGACCG GCGGCCGCGC GCATACCGAA AAGGTGAGCC TGCACGCCAC GGCGGAAGGC CGCATCGTGC TGCATTCCGG CACGCAGGCG ATGGGGCAGG GGCTCGAGAC CACCTACACC CAGATGATCG CGCAGGCGCT CGACATTCCG ATGGATCAGA TCGACGTGGT GCAGGGCAAC ACCGATCTTG CGCAGGGTTT CGGCAGCGTC GGCTCGCGCT CGCTGTTCGT CGGCGGCACT GCGGTCGCGG TGTCGACCGT CGACATGATC GCCAAGGCGC GCGAGAAGGC GGCGAACATC CTCGAAGCCT CGGTCGAGGA CATCGAGTAT TCCGGCGGCA CGCTGACGAT CGCCGGCACC GATCGCAAGA TCAGCCTGTT CGAGATCGCC GCCAAGGAGA ACGGCGCCAA GCTCAGCGTC GACAGCACCG GCGAAGTGGA CGGCCCGAGC TGGCCGAACG GCGCGCATAT CTGCGAGGTC GAGGTCGATC CAGAGACGGG TGTGTCGCGC GTGGTGCGCT ACACCACGGT CGATGACGTC GGCAATGCGG TCAATCCGAT GTTGGTGGCC GGCCAGATCC ACGGTGGCGT CGCGCAGGGG GTTGGACAAG CGCTGTACGA AGGCGCGTCC TACAATGACG ACGGCCAGTT GGTGACCGCG AGCTATCAGG ACTACTGCAT CCCGCGGGCC GACAATCTGC CGCCGATCTC GGTGACGCTC GATCCTTCGG CGCCGTGCCG GACCAATCCG CTCGGCGCCA AGGGCTGCGG CGAATCCGGC GCGATCGGCG GTCCCCCTTG CGTGGTCCAT GGCGTGCTCG ATGCGCTGGC ACCGCTCGGC GTCACCGCGC TGAACACGCC GCTGACGCCG GAAAAGGTGT GGCGCGCAAT TCAGGAAGCC AAAGCTGCGC AGGCGGCCTG A
|
Protein sequence | MNILPGSMRF GAGQPVKRLE DQRLLTGHGL YLDDKPADGA LWLVVLRSPH AHAKIVAIDG EAARAMPGVE SVLTGADLVA DAVGTIPTLP IFKRPDGSPM TLPPRRLLAH EIVRFVGEPV AAVIASSQAA AQAAAEAVVV EYEELPAVTD PTAAIQPGAP VVYDTAPDNI VAAMSYGDAA KVDEAFAKAA HTVSLDIVSQ RLIPSAMEPR ATIAEIEKKT GRLILHVQSQ TPATTRDTLA DAILKRPKDS IQVLVGDIGG GFGQKTGLYP EDGLVAYAAV KLNRKVRWRG DRTDEFVGGT HGRDLTSTAS IALDAKGRVL AYRVSSIGGT GAYLAGAGVI IPLVLGPFVQ TGVYDLPLVH FDIKAVLTHT APVGAYRGAG RPEAVYIIER LMDAAARQLG MDPRAIRKVN YIKPSQLPYT NAVGQVYDSG AFAHMMQRAA ELSDWVGFKA RKKEAAKKGL LYGRGVTSYI EWTGGRAHTE KVSLHATAEG RIVLHSGTQA MGQGLETTYT QMIAQALDIP MDQIDVVQGN TDLAQGFGSV GSRSLFVGGT AVAVSTVDMI AKAREKAANI LEASVEDIEY SGGTLTIAGT DRKISLFEIA AKENGAKLSV DSTGEVDGPS WPNGAHICEV EVDPETGVSR VVRYTTVDDV GNAVNPMLVA GQIHGGVAQG VGQALYEGAS YNDDGQLVTA SYQDYCIPRA DNLPPISVTL DPSAPCRTNP LGAKGCGESG AIGGPPCVVH GVLDALAPLG VTALNTPLTP EKVWRAIQEA KAAQAA
|
| |