Gene GWCH70_3298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3298 
Symbol 
ID7977204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3324882 
End bp3325982 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content47% 
IMG OID644800065 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_002951204 
Protein GI239828580 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCGGA CGGAAGAAAT GATTTTAAAC GTAGGACCGC AGCACCCGAG TACACACGGA 
GTTTTCCGCC TCATTTTAAA AATAGATGGA GAGATCATTC AAGAAGCGAA ACCTGTCATC
GGCTACCTCC ACCGCGGAAC GGAAAAGTTA GCGGAAAACT TACAATATAC GCAAATCATC
CCGTATACAG ACCGGATGGA TTATTTATCG GCAATGACCA ATAACTATGT TATTTGTCAT
GCAGTGGAAA CAATGATGGG CATTGAAGTT CCGGAACGAG CGGAATATTT GCGCGTTTTA
GCAATGGAAC TTGGCAGAAT CGCCAGCCAT CTTGTCTGGT GGGGGACGTA TTTGCTCGAC
CTTGGCGCCA CAAGCCCGTT TTTGTACGCA TTCCGCGAGC GGGAAATGAT TATTAATCTA
TTAAACGAGC TGTCAGGGGC GCGACTGACG TTCAATTACA TGCGCGTCGG CGGCGTGAAA
TGGGATGCGC CGGATGGATG GATTGAAAAA GTAAAACAAT TTGTCCCGTA TATGCGGGAA
AAACTCGCTG GTTATCATGA CCTTGTGACA GGAAATGAAA TTTTCCGCCA TCGTGTCATC
GGTGTTGGCA AATATACGAA AGAAGAGGCG ATCAATTATT CGTTAAGCGG CGTAAACTTG
CGTTGTACCG GCGTGAAATG GGACTTACGG AAAAACGAGC CGTATTCGAT TTATGACCGT
TTTGATTTTG ACATTCCGGT GCGGGAAGAA GGAGACTGCC TTGCCCGTTA TGAATGCCGC
TTGGCGGAAA TAGAAGAATC ATTAAAAATC ATCGAACAAG CATGTGAACA ATTTCCAAAA
AGCGGAGAAA TTATGGGGAA AGTGCCGCGC ATCATTAAAG CGCCGCCGGG AGAGACATTT
GTCCGCATTG AATCACCGCG CGGGGAAATC GGCTGTTACA TCGCCAGCGA TGGAAAGAAA
GAGCCGTACC GCATCAAATT CCGTCGGCCG TCGTTTTACA ATTTGCAAAT ACTCCCGAAA
CTGTTAAAAG GGGAAAATAT TGCGAATGTG ATTGCGATTC TTGGCTCGAT TGATATTGTG
CTCGGGGAGG TCGACGGATG A
 
Protein sequence
MLRTEEMILN VGPQHPSTHG VFRLILKIDG EIIQEAKPVI GYLHRGTEKL AENLQYTQII 
PYTDRMDYLS AMTNNYVICH AVETMMGIEV PERAEYLRVL AMELGRIASH LVWWGTYLLD
LGATSPFLYA FREREMIINL LNELSGARLT FNYMRVGGVK WDAPDGWIEK VKQFVPYMRE
KLAGYHDLVT GNEIFRHRVI GVGKYTKEEA INYSLSGVNL RCTGVKWDLR KNEPYSIYDR
FDFDIPVREE GDCLARYECR LAEIEESLKI IEQACEQFPK SGEIMGKVPR IIKAPPGETF
VRIESPRGEI GCYIASDGKK EPYRIKFRRP SFYNLQILPK LLKGENIANV IAILGSIDIV
LGEVDG