Gene Cpha266_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0149 
Symbol 
ID4568790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp168834 
End bp170876 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content54% 
IMG OID639764751 
Productcarbon-monoxide dehydrogenase (acceptor) 
Protein accessionYP_910643 
Protein GI119355999 
COG category[C] Energy production and conversion 
COG ID[COG1151] 6Fe-6S prismane cluster-containing protein 
TIGRFAM ID[TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTTA AAGAGAGGAT CAAGCTTGTT CACCGGCTTA ACTACAGCAA GGAGGAGGTG 
ATAAAGCATA CGGCCAATAA AGCCGTAGCC GAGATGGTTG AACATATGGA CAAGGAGGCG
ATCAGCAATA CCTTTGATCG CTTTGCGCAG CAGCATCCGC AATGCGGCTA CGGACTTACC
GGCGCGTGCT GCGCCTTCTG CTCTTACGGG CCCTGCCGGG TAACGGAAAA AACGCTCTAT
TCGGTTTGCG GCAAGGATGT CGATCTCATC GTGGCAGGCA ATGCCCTGCG ACGTCTTGCC
TCCGGTATGG CAGCGCATGG CGCACATGCG AGGGAGGTAT TTATCGCCCT CAAAGCGGCG
GCGGAAGGCA GCGCTCCCAT CCCGATCAAA TGCCCGGAAA AAGGGGTCGC CGTAGCGCGG
GCGCTCGGCA TTGAAACGGA GGGAAAAACC ATTGAGGCCA TCTGTGGAGA GATTGCCGAT
ATTTTCATTG ACGATCTGCA GCGCTCGCTT CCCAAAAGGC ACGAGACGCT CCATGCGCTT
GCGCCGAAAG AGCGCGCGGA GCTGTGGGAA AGGCTCGGCA TCATACCAAT CAGTGCCTAT
CACGAGTGTT TCGAGGTCAA TAATCTTACC AGTCACGGCA CCGATTCCGA TTTTGAAAGC
CATATGCAGG CTTTTCTTCG AACGGTTCTT GCCTATGCCA TCACCACCGT TACCAGCACG
TCGCTTGCCA CCGATATTGT CTACGGGCTT CCCCGGCGTT CAAAGCTCAA TGTCAATCTC
GGCAGCATTG TTCCTGACGG CTGCGTCAAC ATCGGCATCA ACGGGCACGC ACCGATGGTC
GCCTTTGCTA TCTGCGATAT TGTCGGCACG CCGAAAATCA TGGAAAAGGT AAAGAGAGCC
GGTGCGGATA CTATCAGGCT GTACGGCATG TGTTGTACCG GCGGAGAGTT TATTGAACGC
GACCTCAACA TCCCTCTTGT CGCCATGGCC TCTTCGGCTG AGATGGCGGT TGCTACAGGC
GCATTTGATG CCATCGTTGT TGATCAGCAG GATGTGCTGC CGGGAATGAT GCACGTAGCC
CGTCAGTTTC ATACCAGGGT CATCACCACC TCGCCTTCCG GACGGAAAGA GGGGGCGATT
GTGCTTGAAC TTGACTACTA CCTGAAGAAT CTCGATAAAA TCTATGAGCT TGCCGAAGAG
ATTCTCGATA TCGCCATTGA TAATTATCGC AACAGGGAGA GTAAAAAAGT TCATGTGCCC
AATATCAGGG CAAAGGTTGA GCTTGGGTTC AGCGTTGAAG AGGTCATGAA GCTCTTTAAC
GGTTCGATGC CCGATAAAAA AATACATGGT CTCGCAGCGC TTCTCAAAGC CGGAAAAATC
CGGGGTATCG TGAACTTCGG ATCATGCGGC AACGTTCGCG GCGCCGTGTT CGAGAGGAAC
CAGATCATTA TTGCCAAACA GCTCATAAAA AATGATGTGC TGGTCACTGC CCATGGCTGC
TCAGGAATGG GGCTCCTCTT TGCCGGCCTT GCTCACCCCG ACGCCTCCGT GTTATGCGGC
ACAGGGCTGC GGGAGGTAGT GCAGGCCAAG GATATTCCCC CGGTTCTGCA TGTCGGCGCC
TGTACTGACA GCACCAGAGC CAGTCAGATC ATGGCTTATA CGGCCAATGC CGCCGCACAG
CCCAATCCTG CCATGCCGTT TGCCATGGTT GCAGCCGATC CGGCAGCGGA AAAAACCATG
GGCGCGAGGT ACGCCTTTGT TCTTAACGGC ATCGAAACCT ACTCCTGCGT GCAGGACAAC
ACGCTGGCTT CGGATCGATT TATCGATTAC GTCAGCAACA GGCTGCGGAC GATCGTCGGC
GCGGCAATGA ACTGGAACCC CGATCCCTAT CGTACCAGCG AAGATATTCT CTGCATGCTT
GACGAGAAAC GAGCCGCTCT TGGATGGCCG GTTCGTGACT ATACGATAGG GACAAAAGAG
GAGATAGAGG ACAAGATTCC CGATACCGTT GAAATCGGTC GCTCGATCTG TACGATCATC
TGA
 
Protein sequence
MSLKERIKLV HRLNYSKEEV IKHTANKAVA EMVEHMDKEA ISNTFDRFAQ QHPQCGYGLT 
GACCAFCSYG PCRVTEKTLY SVCGKDVDLI VAGNALRRLA SGMAAHGAHA REVFIALKAA
AEGSAPIPIK CPEKGVAVAR ALGIETEGKT IEAICGEIAD IFIDDLQRSL PKRHETLHAL
APKERAELWE RLGIIPISAY HECFEVNNLT SHGTDSDFES HMQAFLRTVL AYAITTVTST
SLATDIVYGL PRRSKLNVNL GSIVPDGCVN IGINGHAPMV AFAICDIVGT PKIMEKVKRA
GADTIRLYGM CCTGGEFIER DLNIPLVAMA SSAEMAVATG AFDAIVVDQQ DVLPGMMHVA
RQFHTRVITT SPSGRKEGAI VLELDYYLKN LDKIYELAEE ILDIAIDNYR NRESKKVHVP
NIRAKVELGF SVEEVMKLFN GSMPDKKIHG LAALLKAGKI RGIVNFGSCG NVRGAVFERN
QIIIAKQLIK NDVLVTAHGC SGMGLLFAGL AHPDASVLCG TGLREVVQAK DIPPVLHVGA
CTDSTRASQI MAYTANAAAQ PNPAMPFAMV AADPAAEKTM GARYAFVLNG IETYSCVQDN
TLASDRFIDY VSNRLRTIVG AAMNWNPDPY RTSEDILCML DEKRAALGWP VRDYTIGTKE
EIEDKIPDTV EIGRSICTII