Gene BAS5140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5140 
Symbol 
ID2852797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5023903 
End bp5025423 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content39% 
IMG OID637508395 
ProductNADH dehydrogenase subunit N 
Protein accessionYP_031379 
Protein GI49188126 
COG category[C] Energy production and conversion 
COG ID[COG1007] NADH:ubiquinone oxidoreductase subunit 2 (chain N) 
TIGRFAM ID[TIGR01770] proton-translocating NADH-quinone oxidoreductase, chain N 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATGA ATACGTTACT TAGCTTATCG TGGCATCTCA TGGTGCCAGA ATTCATCATT 
CTCGGGGCTG CCATCCTTCT TTCCATATGT GATTTGTTTT TTAAGCTGAA CCATAGATAT
GTAGCGCTTG GTGCAATCGC GGCGGTCGTA TTAGCAATCG TGTCACTAAT TACGCTATAT
AGCGAACCAG CGGGAGATAT TTTAAATGGA TCGTTTGTGT TAGATGGATT TTCAAAAGGA
TTTAAAACGT TGTTGTTAGG CGGAGCCGCC CTCATCTTAT GCACCGCAAT GAGCGATGAT
AAGAAAAATC CAATTGAAGA TAAGGGAGAG TATTATTACT TATTTTTAAT GGCGCTTCTT
GGGGCGATGT TTATGGCTTC TAGCGTTGAT TTCGTTACAC TTTTCGTCGG TTTAGAATTG
CTTTCACTTT CTTCCTACAT TTTAGTAGGA ATACGAAAAA AGAACCGTGC ATCAAATGAG
GCGGCCATGA AATATGTCAT TAATGGAGGA ATTGGAACGG CAATTACACT CTTTGGAATG
AGTTACTTAT ACGGCATTAC AGGGTCAACT AATATCGTGG ATATGCAAAA AGTATTTGCC
GGGGAACTGG CTAGTGGCAT TCAGTTATTG CTAGCTCTCG CATTCCTACT CTTACTCGTT
GGGCTCTCAT TCAAAATTGC GACAGTGCCG TTTCATATGT GGGCACCTGA TGTGTATGAG
GGAGCGGCTA CACCTGTTAC TGCTTTTCTT GGAACAATTT CTAAAATGGC GGGTTTCTTA
CTCATTATTC GTTTGTTCCT CATGGTTTTT GCAAGTGTAT CAGTGCAAGG AGATATGCAA
TCTTTATACG GACGTATGAG CATATATATC GCTGTGCTAG CTAGCATTAC GATGATTATT
GGAAATGTAG TCGCATTAAA GCAATATAAC GTAAAACGCC TATTTGCTTA TTCGGGAATC
GCCCATGCGG GATATTTACT CGTTCCGCTT GTCGCATTAT CACCATTTAC GATGGATAGT
ATGTGGTTTT ACATGCTTGC GTATATGCTT ATGAATATAG GCGCGTTTGC AATTATCCAC
GGCTTAATCT TACAAAGCAA TAAAGAAAAT ATTACCATTT TCACTGGATT ATATAAAAGG
TCGCCATTTA CAGCGATTGT GATGACGATT TTTATTTTAT CGCTAGCGGG GATACCAGGA
ACAGCTGGTT TCATTGGGAA AATTAACATC TTTTTAGGGG CTCTTCATGT AGAGCCAGCT
CATTACGTAC TAGCTTCTAT TATGATGGGG ACGACAGTCA TTTCATTCGT ATATTATTTC
CGTATTTTAC AGCAAATGTT TTTCCGGACG GGAGAAGTAG AAGAGAAAAT TCGCTTGCCG
CTCAATATAA AGATTGTGAT GAGCTTTTGT GCAATTTCGA TTGTAATACT AGGGATTGTG
CCGATGATTG GATACAATTT CTTTTATGAA TATTTTCCAT TAATGAAAGA TTTCTTCTTC
TTAGGGAACG TGGTACAATA G
 
Protein sequence
MDMNTLLSLS WHLMVPEFII LGAAILLSIC DLFFKLNHRY VALGAIAAVV LAIVSLITLY 
SEPAGDILNG SFVLDGFSKG FKTLLLGGAA LILCTAMSDD KKNPIEDKGE YYYLFLMALL
GAMFMASSVD FVTLFVGLEL LSLSSYILVG IRKKNRASNE AAMKYVINGG IGTAITLFGM
SYLYGITGST NIVDMQKVFA GELASGIQLL LALAFLLLLV GLSFKIATVP FHMWAPDVYE
GAATPVTAFL GTISKMAGFL LIIRLFLMVF ASVSVQGDMQ SLYGRMSIYI AVLASITMII
GNVVALKQYN VKRLFAYSGI AHAGYLLVPL VALSPFTMDS MWFYMLAYML MNIGAFAIIH
GLILQSNKEN ITIFTGLYKR SPFTAIVMTI FILSLAGIPG TAGFIGKINI FLGALHVEPA
HYVLASIMMG TTVISFVYYF RILQQMFFRT GEVEEKIRLP LNIKIVMSFC AISIVILGIV
PMIGYNFFYE YFPLMKDFFF LGNVVQ