Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3957 |
Symbol | |
ID | 3911764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4515405 |
End bp | 4517735 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637885861 |
Product | carbon-monoxide dehydrogenase |
Protein accession | YP_487561 |
Protein GI | 86751065 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR02416] carbon-monoxide dehydrogenase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.92859 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATTC TTCCCGGCTC CATGCGGTTC GGCGCGGGGC AGCCCGTCAA GCGTCTCGAG GATCAGCGGC TCGTCACCGG ACACGGGCAC TATCTCGACG ACAAGCCCGC CGACGGCGCG TTGTGGCTGG TGGTGCTGCG CTCACCACAC GCGCACGCCA AGATCGTCTC GATCGATGCC GAGGCGGCGC GCGCGATGCC GGGAGTCGAA AGCGTTCTGA CCGGCGCGGA CCTCGTCGCC GACGAGATCG GCACGATCCC GACCCTGCCG ATCTTCAAGC GGCCGGACGG TTCGCCGATG CTGCTGCCGC CGCGCCGGCT CTTGGCGCAC GAGATCGTCC GCTTCGTCGG CGAGCCGGTC GCCGCGGTGA TCGCGGCGTC GCAGGCCGCG GCGCAGGCTG CGGCCGAGGC GGTCGTCGTC GAGTATGAAG AATTGCCGGC GGTGACCGAT CCGGTCGCGG CGATCCAGCC CGGCGCGCCG GTGGTGGTCG AGACCGCGCC CGACAACATC GTCGCGGCGA TGAGCTATGG CGATGCCGCC AAGGTCGATG AGGCTTTCGC CAGCGCCGCG CACACCGTGT CGCTCGACAT CGTCAGCCAG CGCCTGATCC CCTCGGCGAT GGAGCCGCGC GCCACTATCG CGGAAATCGA GAAGAAGACC GGCCGGCTGA TCCTGCACGT GCAGTCGCAG ACGCCGGCGC AGACCCGCGA CGCGCTCGCC GACGCGATCC TGAAGCGGCC GAAGGAGTCG ATCCAGGTGC TGGTCGGCGA CATCGGCGGC GGTTTCGGCC AGAAGACCGG CGTCTATCCC GAGGACGCGC TGGTGGCCTA TGCGGCGGTG AAGCTCAACA AGAAGATCCG CTGGCGCGGG GACCGCACCG ACGAATTCGT CGGTGGCACC CATGGCCGCG ACCTGACCTC GACCGCGTCG ATCGCGCTCG ACGCCAAGGG CCGCGTGCTG GCCTATCGGG TGTCGTCGAT CGGCGGCACC GGCGCGTATC TCGCCGGCGC CGGCGTGATC ATTCCGCTGG TGCTCGGCCC GTTCGTGCAG ACCGGCGTCT ATGATCTGCC GCTGGTGCAT TTCGACATCA AGGCGGTGAT GACCCACACC GCGCCCGTCG GCGCCTATCG CGGCGCAGGC CGCCCGGAAG CCGTGTACAT CATCGAGCGC CTGATGGACG CCGCGGCGCG CCAGCTGAAC ATGGACCCGC GCGCGATCCG CAAGGTCAAC TACATCAAGC CGACGCAACT GCCCTACACC AACGCGGTCG GGCAGGTGTA CGATTCGGGC GCCTTCGCGC ATCTGATGCA GCGCGCGACC GAGCTGTCCG ACTGGGACGG CTTCAAGGCG CGCAAGAAGG AAGCGCAGAA GAAGGGCCTG CTCTACGGCC GCGGCGTCAC CAGCTACATC GAATGGACCG GCGGCCGCGC CCACACCGAG AAGGTCAGCC TGCACGCCAC CGCGGAAGGC CGCATCGTGC TGCATTCCGG CACGCAGGCG ATGGGGCAGG GGCTGGAGAC CACCTACTCG CAGATGATCG CGCAGGCGCT CGACATCCCG ATCGAGAGCA TCGACGTCGT GCAGGGCAAC ACCGATCTGG CGCAGGGCTT CGGCAGCGTC GGCTCGCGCT CGCTGTTCGT CGGCGGCACC GCGGTCGCGG TGTCGACCGT CGATATGATC GCCAAAGCGC GCGAGAAGGC CGCGAACATT CTCGAAGCCT CGATCGAGGA CATCGAGTAT TCCGGCGGCA TGCTGACGAT CGCCGGCACC GATCGCAAGA TCAGCCTGTT CGAAATCGCC GCCAAGGAAA AAGGTACCAA GCTCAGCGTC GATTCGACCG GCGAAGTCGA CGGTCCGAGC TGGCCGAACG GCGCGCATAT CTGCGAGGTC GAGGTCGATC CCGAAACCGG CGTCAGCCGT GTGGTGCGCT ACACCACGGT CGACGACGTC GGCAATGCGG TCAATCCGAT GCTGGTCGCG GGGCAGATCC ATGGCGGCGT CGCGCAGGGC GTCGGCCAGG CGCTGTACGA AGGCGCGGCC TATAACGACG ACGGCCAGCT GCTGACCGCG AGCTATCAGG ACTACTGCAT CCCGCGCGCC GACAATCTGC CGCCGATCAA CGTCACGCTC GATCCGTCGG CGCCGTGCCG GACCAATCCG CTCGGCGCCA AGGGCTGCGG CGAATCCGGC GCGATCGGTG GGCCGCCCTG CGTCGTCCAC GGCGTGCTCG ACGCGCTGGC GCCGCTCGGC GTCACCACGC TGAACACGCC GCTGACCCCG GAAAAGGTGT GGCGGGCGAT CCAGGACGCC AAGGCCGCGC AGGCGGCCTG A
|
Protein sequence | MNILPGSMRF GAGQPVKRLE DQRLVTGHGH YLDDKPADGA LWLVVLRSPH AHAKIVSIDA EAARAMPGVE SVLTGADLVA DEIGTIPTLP IFKRPDGSPM LLPPRRLLAH EIVRFVGEPV AAVIAASQAA AQAAAEAVVV EYEELPAVTD PVAAIQPGAP VVVETAPDNI VAAMSYGDAA KVDEAFASAA HTVSLDIVSQ RLIPSAMEPR ATIAEIEKKT GRLILHVQSQ TPAQTRDALA DAILKRPKES IQVLVGDIGG GFGQKTGVYP EDALVAYAAV KLNKKIRWRG DRTDEFVGGT HGRDLTSTAS IALDAKGRVL AYRVSSIGGT GAYLAGAGVI IPLVLGPFVQ TGVYDLPLVH FDIKAVMTHT APVGAYRGAG RPEAVYIIER LMDAAARQLN MDPRAIRKVN YIKPTQLPYT NAVGQVYDSG AFAHLMQRAT ELSDWDGFKA RKKEAQKKGL LYGRGVTSYI EWTGGRAHTE KVSLHATAEG RIVLHSGTQA MGQGLETTYS QMIAQALDIP IESIDVVQGN TDLAQGFGSV GSRSLFVGGT AVAVSTVDMI AKAREKAANI LEASIEDIEY SGGMLTIAGT DRKISLFEIA AKEKGTKLSV DSTGEVDGPS WPNGAHICEV EVDPETGVSR VVRYTTVDDV GNAVNPMLVA GQIHGGVAQG VGQALYEGAA YNDDGQLLTA SYQDYCIPRA DNLPPINVTL DPSAPCRTNP LGAKGCGESG AIGGPPCVVH GVLDALAPLG VTTLNTPLTP EKVWRAIQDA KAAQAA
|
| |