Gene BAS5066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5066 
Symbol 
ID2849287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4940459 
End bp4941616 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content34% 
IMG OID637508321 
Producthypothetical protein 
Protein accessionYP_031305 
Protein GI49188052 
COG category[S] Function unknown 
COG ID[COG3274] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAA GCGCACCAGA ATTTAAAGTT TTGCAAAGCA TTGCATTCCT TGCTGTCGTT 
TTGCAAAGTT CGTTATTATA TACAATGAAT CAAGGAAATG TCTTACTTGA GCAATCTCTC
ATTATGGGCA TGCTATTTAA CCTTGCAAAA TTTTCGGCAC CTGCATTCAT ATTTATCGTT
GGATTTCATT TAATTCGTCA CTATACAAAG CAATTAGTAT ACAAAGAATA TATTTCTGAA
AAAGCCGCAC ATTTACTCAT TCCTTATTTC TTCTGGTCTA TTCTTTACTT ATTAACAACA
AACGATATGA TCACATTACA AGGCGGAATA AAAAGTGTAT TACTCGGAAC GGCTGCACCT
CACCTTTGGT ACGTAATTAT GATGTTCCAA ATTCACTTAT TGTTCCCTTT GCTGTGCACA
CTATTTTATT GGTTTCAAAA ACGAACAGAA AATAAAAAAG ACATATATAA ATATATGACC
ATCTTTGCTT GTCTATATTT CCTCTTAATG TGGTATTCTT CGCACTACAT TTTTAATGGA
GAGAAATTGA CTAGCTCAAC CATTTTACAT TATACAGATC GTTCCTTCTT CTTCTATTCG
TTCTATTTCG TCATGGGAGG AATCGCTGCT GTAGCACTAA AAACATGGCG GCTATTCGTC
ATGAAACATA TCCCGCTTAT CACAATCTTA TTTTTCATCT TATTTTTATT CATCAATTAT
GAGTTATTTA GTTTTTACGG CGCAAACTCT ATTCATTTAA CCGTTTCGAC TTATTTAAAA
CCGTCTATGT TTTTATATAT CGTATGCGAA ATTATTATAC TTTACGTGCT TTCTATTACA
ATCGTACAGC GACGCGGTTT CTTATATAAA GCTTTACGAT TTATCGGGAA TTACACGTAT
GGTGCTTATT TAGCTCACTT TTTCTTCTTG CAACTATGTA CAAAGTTTCT TTCTTTATTC
ACACTGCAAG AAAACACAAT ATTATATAGC TTATTATTAT TTACAATAAC GGCTACAATC
TCAATTTCAG CAATGGTCCT TTGTAGTACA CTACCATTTC ATACGTGGAT TACAGGACCG
TCTCCTAGGG CAACTGTGAG ATGGGCGAAG ATCGTACTTC GGAAACATCA TGAAAAAGTA
TGTAAACCAT ATCTTTGA
 
Protein sequence
MTQSAPEFKV LQSIAFLAVV LQSSLLYTMN QGNVLLEQSL IMGMLFNLAK FSAPAFIFIV 
GFHLIRHYTK QLVYKEYISE KAAHLLIPYF FWSILYLLTT NDMITLQGGI KSVLLGTAAP
HLWYVIMMFQ IHLLFPLLCT LFYWFQKRTE NKKDIYKYMT IFACLYFLLM WYSSHYIFNG
EKLTSSTILH YTDRSFFFYS FYFVMGGIAA VALKTWRLFV MKHIPLITIL FFILFLFINY
ELFSFYGANS IHLTVSTYLK PSMFLYIVCE IIILYVLSIT IVQRRGFLYK ALRFIGNYTY
GAYLAHFFFL QLCTKFLSLF TLQENTILYS LLLFTITATI SISAMVLCST LPFHTWITGP
SPRATVRWAK IVLRKHHEKV CKPYL