Gene BAS4251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4251 
Symbol 
ID2853149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4162324 
End bp4163556 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content37% 
IMG OID637507487 
ProductTolB domain-containing protein 
Protein accessionYP_030499 
Protein GI49187247 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG CTATAGCAAG TGTCATCGCA TTATTATTCA TATTTTTAAG TTCAATTACT 
ATTACGAAAG CAGAAAATAG TGGAGTAAAA ATTGCTTTTA TTCGTCATCA TGACCTCTGG
ATTAAAGTTG ATGGAAAAGA AAAACAACTT ACAAAAGGAG AATACATAAC AGGACCGAAG
TGGTCATATG ATGGGGAGTG GCTAGCATAT GTAAAAGGAG AGAAACAAAA TACTCTTGAG
TTATATCGGC TAAAAGATGG AAAGAAAGTT ACGCCGTTTC ACTCAGAAGT ATCAAATTAT
CAATGGTCAC CAACAGAAAA TATAATTGCA TTTATATTTA CAGGTACATT ACATACCTTC
AAGGTAGAAA AAAAGAATGC AGATTTTGAA AATGTATCGG CTGGTGTAGG TGATTATGCA
TGGTACCCGA ATGGAAAGAA GTTTCTTGTA TCTTCTGAAG CACACTTACT TCCAACTGGA
TGGACAGGAG CTCAGCTATA TGAAGTACAA AAAGATGCGC ATATGAATCC TCACAAAATG
AAGCATTTGT ATGCATTGCC AAATGAACAT GATGATTTCC TAGCGTTAGT TGCAAGTGGC
TTTCAGTGGT CACCAGATCA AAAGTGGATT TCATTTTTAG CAGTACCGAC AGCTTCATGG
TCAGCTGATA GCAATACGCT TTGCTTAGTT CGTGCAGATG GTAGTCGTTT TGAAAAGGTA
GATCAAATGT TATTAAACAC ACAATGGTTC AAATGGGCGC CAGCCAACAA TATATTGGCC
TATATTGAAG GAAGCGGGAG AGTTGCGTTA GAGAATAAAC ATTTAAAAGT AAAAGAATTG
CCAGCACTTC AGCAGAACAC ATTTACACCG AAAGGATATG TCGATTGGGA TTTTACATGG
AAGAACGATA ACGTAATTAT CGTTTCACGA GCAAAAGAAG CGGGGATAGA AACTCCACCA
GAAAAAAGGC CACTACCATC TTTATATGAG ATCGATAGTA CAAGCGACGA ACAACATCGA
ATCACAAAGC CACCTCATAG GCAAGGAGAT TATCATCCGC TCTTCATGAA TAAGAGTAAT
CAATTAATAT GGATACGTTC AGACCGTAAG AAAGCGGATG TATGGCTTGC TCATAAGGAT
GGAAAGCATG AAATGAAGTG GATTGAAAAT ATAGATGTAC CAGAGTGGTA TTACGAGAAA
TGGAATTGGG AACATGTTAT CTCGGTGAAA TAA
 
Protein sequence
MKKAIASVIA LLFIFLSSIT ITKAENSGVK IAFIRHHDLW IKVDGKEKQL TKGEYITGPK 
WSYDGEWLAY VKGEKQNTLE LYRLKDGKKV TPFHSEVSNY QWSPTENIIA FIFTGTLHTF
KVEKKNADFE NVSAGVGDYA WYPNGKKFLV SSEAHLLPTG WTGAQLYEVQ KDAHMNPHKM
KHLYALPNEH DDFLALVASG FQWSPDQKWI SFLAVPTASW SADSNTLCLV RADGSRFEKV
DQMLLNTQWF KWAPANNILA YIEGSGRVAL ENKHLKVKEL PALQQNTFTP KGYVDWDFTW
KNDNVIIVSR AKEAGIETPP EKRPLPSLYE IDSTSDEQHR ITKPPHRQGD YHPLFMNKSN
QLIWIRSDRK KADVWLAHKD GKHEMKWIEN IDVPEWYYEK WNWEHVISVK