Gene BAS2597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2597 
Symbol 
ID2848015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2590058 
End bp2591263 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content37% 
IMG OID637505843 
Productglycine betaine/L-proline ABC transporter ATP-binding protein 
Protein accessionYP_028856 
Protein GI49185604 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAATA CAAAGGTGCG TGTTGAAAAC GTAACAAAAG TATTCGGGAA ACATCCGCAA 
AGAGCCCTTT CCTTATTAAA AGAAGGAAAA AGCAAATCGG AAATTTTGAA AGAGACAGGA
ATGAACGTTG GTGTGAAAAA AGCAACGTTT GAAGTATACT CTGGAGAAAT TTTCGTTATT
ATGGGTTTGT CAGGTAGCGG GAAATCTACG TTAGTTCGTA TGTTAAATCA ATTAATTAAA
CCGACAGCGG GCCATATATA CATAGATGGT GAAGATATCG CGACGATGGG CAAGGAAGAA
TTACGGAGAG TAAGAAGAAC GAAAATGAGC ATGGTATTTC AGAAATTCGC TCTATTTCCT
CACCGTACCG TTTTGCAAAA CGTTGCATAT GGATTAGAAA TACAAGGAGT TCCAGTAGAA
GAACGTGAGA AAAAAGCATT AGAATCATTG AAATTAGTTG GACTTGATCA TCATAAAGAT
AATTATCCAA GTCAACTGAG TGGTGGTATG CAGCAGCGTG TTGGCATTGC AAGAGCGCTT
ACGAATGATC CAGATGTTTT ACTTATGGAT GAGTCATTTA GTGCATTAGA TCCACTTATT
CGAAAAGAAA TGCAAGATGA GCTATTAGAA CTTCAAGATA AAATGGAAAA AACAATTATT
TTTATTACGC ATGATTTAGA TGAAGCGCTT CGAATTGGTG ATCGAATTGC ATTGATGAAA
GACGGAGAAG TTGTCCAAAT TGGAACACCA GAAGAAATTA TGATGAGTCC TGCCAATGAA
TTTGTAGAGA AGTTCGTGGC GGACGTGAAT TTAGGAAAAG TAATTACAGC AGAATCTATT
TTAAAACGAC CGGAGACTTT ATTAATTGAT CGTGGACCAC GTGTAGCACT TCAAATTATG
AGGAACGCCG GGGTTTCTAC TGTGTATGTT GTTAATAAAA AGTATGAATT TTTAGGTATA
TTAACTGCGG ATGATGCAAG TAAAGCCGTT CAAAAACAGT GGCCGATTGC CGATTTATTA
CTTAACGATA TTCCGCACGT GTATTTAGAT ACTTTATTAG AGGAAACGTA TGCAAAAATG
GCAGAGATGA AATATCCGTT ACCCGTAATT GATGAGAAGA AAAGACTTAG AGGAATCATT
AAACGTGAAA GTGTCATTCA AGCTCTTGCA GGAAACATTG AGGAAGAGGT GAAAGACGAT
GAATAG
 
Protein sequence
MDNTKVRVEN VTKVFGKHPQ RALSLLKEGK SKSEILKETG MNVGVKKATF EVYSGEIFVI 
MGLSGSGKST LVRMLNQLIK PTAGHIYIDG EDIATMGKEE LRRVRRTKMS MVFQKFALFP
HRTVLQNVAY GLEIQGVPVE EREKKALESL KLVGLDHHKD NYPSQLSGGM QQRVGIARAL
TNDPDVLLMD ESFSALDPLI RKEMQDELLE LQDKMEKTII FITHDLDEAL RIGDRIALMK
DGEVVQIGTP EEIMMSPANE FVEKFVADVN LGKVITAESI LKRPETLLID RGPRVALQIM
RNAGVSTVYV VNKKYEFLGI LTADDASKAV QKQWPIADLL LNDIPHVYLD TLLEETYAKM
AEMKYPLPVI DEKKRLRGII KRESVIQALA GNIEEEVKDD E