Gene Cag_0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0638 
Symbol 
ID3747315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp909988 
End bp911106 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content40% 
IMG OID637773174 
ProductNADH dehydrogenase I chain H 
Protein accessionYP_378954 
Protein GI78188616 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.621422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTAA CTGCTTTATC GCAATTATCA CTCCCTCTTT TTATGGGCAC CACTCTTAAT 
GCTTGGTCAG ATGCACTTGC TGGTTTTACT CCTTGGGGTT TACCTGTAGG ATTGCTGATT
ATTGCTGCTA TTCCCCTTGT GTTTATTGCT TTATATGCCT TAACCTATGG TGTGTATGGT
GAGCGAAAAA TATCAGCTTT TATGCAAGAT CGCCTTGGAC CTATGGAGGT TGGTAAGTGG
GGTATTTTGC AGACATTAGC CGATATTCTT AAGCTTTTAC AAAAAGAGGA TATTGTGCCA
CTGTCAGCCG ATAAATTTCT TTTTGTTATT GGTCCTGGAG TGTTGTTTGT TGGTTCTTTT
TTAGCATTTG CGGTGTTGCC ATTTAGCCCT GCATTTATTG GGGCAAGTCT TAATGTTGGT
CTTTTTTATG CTGTTGGAAT TGTAGCACTT GAAGTAGTTG GTATTCTTGC CGCAGGTTGG
GGATCGAATA ATAAGTGGTC GTTGTATGGT GCTGTTCGAA GCGTAGCCCA AATTGTGAGC
TATGAAATTC CAGCATCAAT TGCATTGCTT TGTGGTGCTA TGATGGCAGG CACACTTGAT
ATGCAAAAAA TTACGATCTT GCAATCGGGA GAACTTGGTT TTGCTCATTT TTATCTTTTT
CAAAATCCAA TTGCTTGGTT ACCATTCCTT ATTTATTTTA TTGCTTCACT TGCTGAAACA
AACCGAGCAC CATTTGATAT TCCTGAAGCT GAATCGGAGT TAGTTGCAGG ATACTTTACA
GAATACTCAG GTATGAAGTT TGCGGTTATC TTTCTTGCTG AGTATGGTCG TATGTTTATG
GTGTCGGCTA TTATTTCTAT TGTATTTCTT GGTGGCTGGA ATTCGCCGCT TCCTAATATT
GGAGCTTTTG AGTTAAATAC ATGGACAAGT GGTGCGGTGT GGGGTGCATT TTGGATTATT
ATGAAAGGAT TTTTCTTTAT TTTTGTGCAG ATGTGGCTTC GTTGGACACT CCCTCGTTTA
AGGGTTGATC AGCTTATGTA TCTTTGCTGG AAAGTTCTTA CGCCGTTTGC TTTTGTCAGC
TTTGTGCTGA CTGCACTATG GGAAATATAT GTTCCTTAG
 
Protein sequence
MTVTALSQLS LPLFMGTTLN AWSDALAGFT PWGLPVGLLI IAAIPLVFIA LYALTYGVYG 
ERKISAFMQD RLGPMEVGKW GILQTLADIL KLLQKEDIVP LSADKFLFVI GPGVLFVGSF
LAFAVLPFSP AFIGASLNVG LFYAVGIVAL EVVGILAAGW GSNNKWSLYG AVRSVAQIVS
YEIPASIALL CGAMMAGTLD MQKITILQSG ELGFAHFYLF QNPIAWLPFL IYFIASLAET
NRAPFDIPEA ESELVAGYFT EYSGMKFAVI FLAEYGRMFM VSAIISIVFL GGWNSPLPNI
GAFELNTWTS GAVWGAFWII MKGFFFIFVQ MWLRWTLPRL RVDQLMYLCW KVLTPFAFVS
FVLTALWEIY VP