Gene Cag_0637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0637 
Symbol 
ID3747314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp908789 
End bp909991 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content42% 
IMG OID637773173 
ProductNADH dehydrogenase I, 49 kDa subunit 
Protein accessionYP_378953 
Protein GI78188615 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.447467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAAT TAGAAAAAGC CGTGCCAAGT TCAGTAAGGG TAACACGGCA AAGTGAAAAC 
CTTGTTATTC TTGAAAAAGA TCTTGCCACC GAACAAATGG TGCTTGCAAT GGGTCCGCAA
CACCCCTCCA CGCACGGTGT GCTCAAGTTG GAGTGCTTAA CCGATGGTGA AGTGGTAACA
GAAGCTGAGC CAGTTCTTGG TTATTTGCAC CGTTGTTTTG AAAAAACGGC TGAAAATGTG
GATTATCCTG CCGTTGTGCC ATTTACCGAT CGTCTTGATT ACTTAGCAGC AATGAACAGT
GAGTTTGCTT ATGCACTTAC TGTTGAAAAA TTGCTTGATA TTGAAATTCC TCGTCGTGTT
GAGTTTATTC GTATTCTTGT AGCTGAACTT AACCGCATTG CTTCTCACCT TGTAGCTATA
GGTACTTATG GTATTGATTT GGGTGCCTTT ACGCCATTTC TCTTTTGCTT CCGTGATCGT
GAAATAATTC TTGGATTGCT TGAATGGGCA TCGGGCGCAC GAATGCTTTA TAATTATGTA
TGGGTTGGTG GGCTTGCTTA TGATGTGCCT GCTGATTTCT TAAAGCGTAT TCGTGAGTTT
TGTGCTTACT TCCGCCCAAA AGCCAAAGAG CTTGCTGATT TATTGACATC TAATGAAATT
TTTGTTAAGC GCACACATGG TATTGGAATT ATGCCTGCCG ATGTAGCTAT TAACTACGGT
TGGTCAGGGC CTATGCTCCG TGGTTCAGGC GTTGAGTGGG ATTTACGTCG TAACGATCCC
TATTCACTTT ATTCCGAACT TGATTTTAAT GTTTGTGTGC CTGATGGAAA ACACTCTGTT
ATAGGTGATT GTTTATCTCG CCACCTTGTT CGTGCCTATG AAATGGAGGA AAGTTTAAAT
ATTATTGAGC AGTGTATTGA TAAAATGCCA TCCTCTGATG GTTTTAATTC ACGCGCAGCT
ATTCCAAAGA AAATTCGTCC AAAAGCTGGT GAAGTGTACG GTAGAGCTGA AAATCCTCGT
GGCGAACTTG GTTTTTATAT CCAAAGTGAT GGAAAATCTA CAAAACCACT TCGTTGCAAA
GCGCGTTCCT CTTGTTTTGT TAATCTTTCA GCTATGAAAG ATCTTTCAAA AGGTCAATTG
ATTCCCGATC TTGTAGCAAT TATTGGTAGC ATTGATATTG TGCTTGGGGA GGTTGACCGA
TGA
 
Protein sequence
MQELEKAVPS SVRVTRQSEN LVILEKDLAT EQMVLAMGPQ HPSTHGVLKL ECLTDGEVVT 
EAEPVLGYLH RCFEKTAENV DYPAVVPFTD RLDYLAAMNS EFAYALTVEK LLDIEIPRRV
EFIRILVAEL NRIASHLVAI GTYGIDLGAF TPFLFCFRDR EIILGLLEWA SGARMLYNYV
WVGGLAYDVP ADFLKRIREF CAYFRPKAKE LADLLTSNEI FVKRTHGIGI MPADVAINYG
WSGPMLRGSG VEWDLRRNDP YSLYSELDFN VCVPDGKHSV IGDCLSRHLV RAYEMEESLN
IIEQCIDKMP SSDGFNSRAA IPKKIRPKAG EVYGRAENPR GELGFYIQSD GKSTKPLRCK
ARSSCFVNLS AMKDLSKGQL IPDLVAIIGS IDIVLGEVDR