Gene BAS2687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2687 
Symbol 
ID2848924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2668447 
End bp2670183 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content33% 
IMG OID637505932 
Productsolute-binding family 5 protein 
Protein accessionYP_028945 
Protein GI49185693 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.240712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTATTT TAGATCAATA TATTGAACTA TGGTGTGCCT ATGGTAAAGG GAGACAAGAA 
GGCGAACAAT TTGAAGTAAC AGTACAAATG ATTGCAGAAA CATTATTTTG TACAGAACGT
AATAGTAAAT TAATTATCAA AAAGTTAGAT GAATTAAATT GGATTGCTTG GTTTCCAGGG
CGCGGAAGAG GGAATCGTTC TAAATTAATA TTTAAAAAGC AACCAATGAC ATTAATTTTA
GACAGAGGAA AAGAACTAAC GAAAAAAGGG GATGTAAAAA GCGGAATTTC ATTTGTGAAA
CGCTATAGCT CACAATTTCC GTCAGTAAAG GAAGAGTATG AAGTTTGGAT AGATTCAATA
TTTGGTCATA AAATAGAAAG GACATCCGAA GGGAGAAGAG ATGTACTTCG TTTGCAGGTT
CAAATGAATT TAGATATTGC ATTAGATCCG GTCTACGCTA CAATGCGATC AGAATGTCAT
ATGGTTAAAC ATATTTTTGA TACACTCGTA TATGTAAATG AGGAATCAAA CTATATAGAA
CCAAGGCTAG CTTTTCAATG GGAATATAAT GATGCAGAAA AGATATGGAC GTTTTATTTA
CGAAAAGGAG TTCACTTTCA TAATAGGAAA CAACTTACTG CACATGATGT TATACATTCA
TGGAATCGAT TTATGAAAGC TGAAAATAAC CCACATGCGT GGATGTTACA ACATATTGAA
AGCTTCCGCG CAGTAGATGA ATATGTTATT GAAATTCAGT TACGTACGGA AAATAGGATG
TTTTTACATA TGATAAGTGC AGAACAGTGT TCTATCGTAA AGGAAGATGA AGCACGAAAC
CTCATTGGAA CAGGCCCCTT TAAATTAAGC GAAAAGAATG CACATTTATT TGTATTGGAA
GCACATGATT TATATTATCG TGAAAGATCT TTTCTTGACC GAATTGAACT ATTGAATGTA
GAACAAAGTG TAAATACATA CGATATTTTA GTAAAGGCGC AGTATAAAGA TAAAGAAAAA
CATAATAAAG AATTATCTCG GCTTGAGTCG AACGTGACAT ATATAACATG CAATCTTGCA
AAAGAAGGAT CAATGCAAGA TTATATGTTC CGAAAAGCGT TATATAAAAT CATTCATGGC
CAAGCAATCG TTCAAGAACT CGGTGGAGAA CGTGGAGAAG TGGCAAAGGA AATACTATTA
GCTAGTGACA GTATAGTAGA GATTGAGGAA GATATAGAAA GTTTAATTAA AGAAAGTATG
TATCAAAATG AAGTGCTACA ACTTTACACA TTTACAGGAC AAGATCATGT AGAAGATGCG
CAATGGATAC AAAAAGAGTG TGCGAAGTAC GGTATTCGTG TAGAAAATAA TTTTCTTGAA
ATAGAAGAGT TATTGGAAAT AAATACGATA CAAAAGGCTG ATATGATGCA TGATAGTGCA
ACGATTAGCG AACGAATAGA AGATAGTCTA CTATACATGT TTCTTACAAA AAATAGTTTT
ATTCATGGGC AAAGCAGCAT GGACTTTCAT GCAACGTTAT CTCCTTATTT CAAACTAGAA
CAAGTAGAGA ATAGAGTTAC ACTGTTACGC GATATTGAGG ACACATTGTT ACGTCAAATT
CATGTTATTC CTTTATATCG CAACAAACAA CAAGTAACTT CTCATGAAAA AGTACAAAAT
ATAATGATTA ATTCACAAGG GTGGATCGAT TTTTATAAAA TATGGTTTAA ACCCTGA
 
Protein sequence
MFILDQYIEL WCAYGKGRQE GEQFEVTVQM IAETLFCTER NSKLIIKKLD ELNWIAWFPG 
RGRGNRSKLI FKKQPMTLIL DRGKELTKKG DVKSGISFVK RYSSQFPSVK EEYEVWIDSI
FGHKIERTSE GRRDVLRLQV QMNLDIALDP VYATMRSECH MVKHIFDTLV YVNEESNYIE
PRLAFQWEYN DAEKIWTFYL RKGVHFHNRK QLTAHDVIHS WNRFMKAENN PHAWMLQHIE
SFRAVDEYVI EIQLRTENRM FLHMISAEQC SIVKEDEARN LIGTGPFKLS EKNAHLFVLE
AHDLYYRERS FLDRIELLNV EQSVNTYDIL VKAQYKDKEK HNKELSRLES NVTYITCNLA
KEGSMQDYMF RKALYKIIHG QAIVQELGGE RGEVAKEILL ASDSIVEIEE DIESLIKESM
YQNEVLQLYT FTGQDHVEDA QWIQKECAKY GIRVENNFLE IEELLEINTI QKADMMHDSA
TISERIEDSL LYMFLTKNSF IHGQSSMDFH ATLSPYFKLE QVENRVTLLR DIEDTLLRQI
HVIPLYRNKQ QVTSHEKVQN IMINSQGWID FYKIWFKP