Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3298 |
Symbol | |
ID | 7977204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3324882 |
End bp | 3325982 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644800065 |
Product | NADH dehydrogenase subunit D |
Protein accession | YP_002951204 |
Protein GI | 239828580 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | [TIGR01962] NADH dehydrogenase I, D subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCGGA CGGAAGAAAT GATTTTAAAC GTAGGACCGC AGCACCCGAG TACACACGGA GTTTTCCGCC TCATTTTAAA AATAGATGGA GAGATCATTC AAGAAGCGAA ACCTGTCATC GGCTACCTCC ACCGCGGAAC GGAAAAGTTA GCGGAAAACT TACAATATAC GCAAATCATC CCGTATACAG ACCGGATGGA TTATTTATCG GCAATGACCA ATAACTATGT TATTTGTCAT GCAGTGGAAA CAATGATGGG CATTGAAGTT CCGGAACGAG CGGAATATTT GCGCGTTTTA GCAATGGAAC TTGGCAGAAT CGCCAGCCAT CTTGTCTGGT GGGGGACGTA TTTGCTCGAC CTTGGCGCCA CAAGCCCGTT TTTGTACGCA TTCCGCGAGC GGGAAATGAT TATTAATCTA TTAAACGAGC TGTCAGGGGC GCGACTGACG TTCAATTACA TGCGCGTCGG CGGCGTGAAA TGGGATGCGC CGGATGGATG GATTGAAAAA GTAAAACAAT TTGTCCCGTA TATGCGGGAA AAACTCGCTG GTTATCATGA CCTTGTGACA GGAAATGAAA TTTTCCGCCA TCGTGTCATC GGTGTTGGCA AATATACGAA AGAAGAGGCG ATCAATTATT CGTTAAGCGG CGTAAACTTG CGTTGTACCG GCGTGAAATG GGACTTACGG AAAAACGAGC CGTATTCGAT TTATGACCGT TTTGATTTTG ACATTCCGGT GCGGGAAGAA GGAGACTGCC TTGCCCGTTA TGAATGCCGC TTGGCGGAAA TAGAAGAATC ATTAAAAATC ATCGAACAAG CATGTGAACA ATTTCCAAAA AGCGGAGAAA TTATGGGGAA AGTGCCGCGC ATCATTAAAG CGCCGCCGGG AGAGACATTT GTCCGCATTG AATCACCGCG CGGGGAAATC GGCTGTTACA TCGCCAGCGA TGGAAAGAAA GAGCCGTACC GCATCAAATT CCGTCGGCCG TCGTTTTACA ATTTGCAAAT ACTCCCGAAA CTGTTAAAAG GGGAAAATAT TGCGAATGTG ATTGCGATTC TTGGCTCGAT TGATATTGTG CTCGGGGAGG TCGACGGATG A
|
Protein sequence | MLRTEEMILN VGPQHPSTHG VFRLILKIDG EIIQEAKPVI GYLHRGTEKL AENLQYTQII PYTDRMDYLS AMTNNYVICH AVETMMGIEV PERAEYLRVL AMELGRIASH LVWWGTYLLD LGATSPFLYA FREREMIINL LNELSGARLT FNYMRVGGVK WDAPDGWIEK VKQFVPYMRE KLAGYHDLVT GNEIFRHRVI GVGKYTKEEA INYSLSGVNL RCTGVKWDLR KNEPYSIYDR FDFDIPVREE GDCLARYECR LAEIEESLKI IEQACEQFPK SGEIMGKVPR IIKAPPGETF VRIESPRGEI GCYIASDGKK EPYRIKFRRP SFYNLQILPK LLKGENIANV IAILGSIDIV LGEVDG
|
| |