Gene BAS2126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2126 
Symbol 
ID2851177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2125689 
End bp2127200 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content36% 
IMG OID637505375 
Productglycine betaine/L-proline ABC transporter permease 
Protein accessionYP_028388 
Protein GI49185136 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component
[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.887969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTGATT TTATACAAAC GTTTCAAGAA CGAAAAATAG AATTACTAAC TGCATTAAGT 
GAGCATTTAC AAATATCGCT TATTTCATTA TTTTTTGCAG TAATTATTGC GGTACCGCTG
GGCATTTTAT TAACGAGAAA AGAAAGAATG GCTGAGTTGA TTATAGGAAC TTCTGCAGTT
ATGCAGACCG TCCCATCACT TGCATTACTC GGACTATTAA TTCCGTTGGT AGGAATCGGA
AAAATCCCGG CCGTTATTGC ATTAGTTGTA TATGCACTAT TACCTATTTT ACGCAATACA
TATACAGGAA TACGGGAGTT AGATGAATCT TTAATCGAAG CAGCGAAAGC TATGGGGATG
AATAGTTGGA GAAGACTGTG GAAGGTAGAG CTTCCTCTTG CCTTACCAAT TATTATGGCC
GGTATTCGTA CGGCAATGGT ATTAATTGTT GGAACAGCTA CATTGGCAGC TCTTATAGGT
GCTGGTGGAC TCGGTAAACT CATATTACTT GGTATTGATC GGAATGATCA TGCACTTATT
ATTTTAGGGG CCGTACCAGC CGCGTTACTC GCTTTATTCT TTGATGTAGT ACTTCGAGTG
CTTGAGAAGC CAAAACGCTC GTCTAAGCGT GTTATATTGA CGATATGTAT CGTTTGTATA
ATGGTTGCTT CTCCATTTCT TTGGAATACA GAAAAGAAAG ATATTGTTAT TGCGGGCAAA
CTTGGATCAG AACCTGAAAT TTTAATTCAA ATGTATAAGC AGCTTATTGA ACAAGATACC
GATTTACACG TACAATTAAA ACCAGGTCTT GGAAAAACAG CATTTGTATT TGAAGCGTTG
AAATCAGGAG AAGTAGATAT ATATCCTGAG TTTTCGGGAA CAGCTTTATC TACTTTCGTG
AAAGAAGAGC CAAAAAGTAC AAATCGGGAT GAAGTATATG AGCAGGCTCG CGTTGGAATG
GAAAAGAAAT ATAATATGGT TATGTTGAAG CCGATGGAGT ATAACAATAC ATATGCACTG
GCTATGCCGA AAAAAATAGC AGATCAAAAT AATATAAATA CAATCTCTGA TTTAGGAAAT
ATTGCACAGG ATACGAAAGT AGGATTCACA TTAGAATTTG CTGATCGTGA AGACGGTTAT
AAAGGAATGC AAAAGTTATA TAACTATAAG TTTTCAAATG TGAAAACGAT GGAACCGAAA
TTACGTTACA GTGCAATTCA ATCAGGGGAT GTTAACGTAA TCGATGCATA TTCAACAGAT
AGTGAATTGG AGCAATACGG ACTTAAAGCG CTAAAAGATG ATAAAGGGTT ATTCCCACCA
TACCAAGGTG CACCATTATT AAGAAAAGAG ACATTACAAA AATATCCTGA ACTTGAAAAA
GTATTAAATA AATTATCTGG AAAGATTACA GATGAAGAAA TGCGAAAAAT GAATTATGAA
GTAAATGTGA ATGGTAAAAA TAGTGAAGAA GTAGCGAAAC AATTTTTACA AAAAGAGAAT
TTACTTCGTT AA
 
Protein sequence
MTDFIQTFQE RKIELLTALS EHLQISLISL FFAVIIAVPL GILLTRKERM AELIIGTSAV 
MQTVPSLALL GLLIPLVGIG KIPAVIALVV YALLPILRNT YTGIRELDES LIEAAKAMGM
NSWRRLWKVE LPLALPIIMA GIRTAMVLIV GTATLAALIG AGGLGKLILL GIDRNDHALI
ILGAVPAALL ALFFDVVLRV LEKPKRSSKR VILTICIVCI MVASPFLWNT EKKDIVIAGK
LGSEPEILIQ MYKQLIEQDT DLHVQLKPGL GKTAFVFEAL KSGEVDIYPE FSGTALSTFV
KEEPKSTNRD EVYEQARVGM EKKYNMVMLK PMEYNNTYAL AMPKKIADQN NINTISDLGN
IAQDTKVGFT LEFADREDGY KGMQKLYNYK FSNVKTMEPK LRYSAIQSGD VNVIDAYSTD
SELEQYGLKA LKDDKGLFPP YQGAPLLRKE TLQKYPELEK VLNKLSGKIT DEEMRKMNYE
VNVNGKNSEE VAKQFLQKEN LLR