Gene BAS5141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5141 
Symbol 
ID2852740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5025425 
End bp5026918 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content42% 
IMG OID637508396 
ProductNADH dehydrogenase subunit M 
Protein accessionYP_031380 
Protein GI49188127 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.97602 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATT TGTTATTAAC GTTCTTCATT TTTTCCCCGC TTTTAGGAAT TCTCTTACTT 
GCATTAACAC CGAAAAAAGA ATCGCGTACA GTGCGAGCGC TCGGTTTATT TGGAACAGTG
CTTCCATTTG GAATGGCTAT CGTCCTCGCT TGTACATATG CTTCTGGAAG GGATTTGTCT
CTATTTGATG AAAAAGTGAA ATGGATTAAA TTCGGGGATT TTGCAGCGGT AGATAAAAGG
TGGTTTTCCA TTTATTACGA ACTTGGTATA GATGGTTTAT CGCTCGTTAT GATGGTGCTG
ACGGCCCTTC TAGCGATGCT TGCAGCATTT ACAATTAAAA GAAATTTGAA AGCATTTTAT
ATGCTGTTAC TCATGTTAGA AATAGGGATG CTCGGCGTCT TCGCTGCTCA AAATTTAATG
TTGTTCTTTA TCTTCTTTGA AATTACATTG CCGCCGATGT TTTTATTAAT TGGGAAGTGG
GGGAAATTGT CGAGTGAAAA GGCTGCTTAT AGTTATTTAA TATATAACGG TATTGGCTCC
GCTATTTTAC TCATCGTTTT CTCGGTTTTA TTTGCGAAAA CAGGTACAAC AAATATTACA
GAACTGAAAG AAATATTAGC GAGTGTAGGT GCTGGGGGAG GAATGGTTGT CCCAAGTAGC
TTACAGTTCG GCTTGTTTCT TGCCATAATG ATTGCTTTCG CAATTAAATT ACCCGTTTTC
CCGTTACATC GCTGGATGGT CAATGTACAT ATTGAAGCGC ATCCGGCTGT AGTAATGCTT
CATGCAGGTG TTTTACTGAA GATTGGAGCG TACGGTATAA TTCGCTTCGG GCAGGGGCTA
TTCCCGGAAT ACTTTCGCGA GTTTGCAACG CTCATTGCGA TTTTAGGGGT TATCAATTTA
TTGTACGGAG CCTTCTTAGC GCTCATTCAA ACAGACTTTC GGAAGGTGCT TGCTTATTCT
AGTATTTCGC ATATGGGTAT TGTATTAATG GGCCTTGCGG CGTTAAATGC ACCAGGTACA
CAAGGGGCAC TATTCCAAGT TGTGTCGCAT GGCTTAATTG CAGCCTTACT CTTTTTCTTA
CTTGGTGTTA TAGAACAGCG TTTTGGAACG TCGGATATTA CAGCGCTTGG CGGACTTGCA
AAAAGTGTAC CAGTACTTAG CGGTTTCTTC TTAGCGGGAG GAATGGCATC GCTTGGATTG
CCGGGAATGT CTGGATTTGT TAGCGAGTTT CTTGCCTTTC TCGGTTTATT CCAAGGAGAG
CCAGTCATTG CTGCGGCCGG AGTACTTGGC ATCATTTTAA CAGCTGTATA CGTATTAAGA
GCAACACTGC AAGTAACATT TGGTAAGAAA GAGTGGGAAG CGAAAGCTGA TATACACGGA
TGGGAGTATG TTCCTATCTT GCTACTTATC TTCTGCATTA TTGCAATTGG CGTAATGCCA
GAAATACTAG GGGATCCGCT TCAAAATACA TTGAAAACAT TGGGGGTGAA GTAG
 
Protein sequence
MNDLLLTFFI FSPLLGILLL ALTPKKESRT VRALGLFGTV LPFGMAIVLA CTYASGRDLS 
LFDEKVKWIK FGDFAAVDKR WFSIYYELGI DGLSLVMMVL TALLAMLAAF TIKRNLKAFY
MLLLMLEIGM LGVFAAQNLM LFFIFFEITL PPMFLLIGKW GKLSSEKAAY SYLIYNGIGS
AILLIVFSVL FAKTGTTNIT ELKEILASVG AGGGMVVPSS LQFGLFLAIM IAFAIKLPVF
PLHRWMVNVH IEAHPAVVML HAGVLLKIGA YGIIRFGQGL FPEYFREFAT LIAILGVINL
LYGAFLALIQ TDFRKVLAYS SISHMGIVLM GLAALNAPGT QGALFQVVSH GLIAALLFFL
LGVIEQRFGT SDITALGGLA KSVPVLSGFF LAGGMASLGL PGMSGFVSEF LAFLGLFQGE
PVIAAAGVLG IILTAVYVLR ATLQVTFGKK EWEAKADIHG WEYVPILLLI FCIIAIGVMP
EILGDPLQNT LKTLGVK