Gene BAS5089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5089 
Symbol 
ID2849784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4968263 
End bp4969780 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content38% 
IMG OID637508344 
Productglycine betaine transporter 
Protein accessionYP_031328 
Protein GI49188075 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAC TGACAAAAAC ATTCATCGTT TCATTAACAT TATGTATTGC ATTTACACTT 
TGGGGGATTA TTCCCGAATC TATTATTGGA AAAGGTAGCC TAGGAAATGT AACAACCGCA
ATTCAAACTG CATTAGTTAG TAAGTTTGGA TGGTCCTATA TTATTTCTGT TTCTATTATT
TTAGGTGTGT CTATCTTTTT AATTGTTTCG AAATACGGTT CTATTCGTTT AGGTAAAGAT
GATGACGAGC CTGATTATAG TTATATGACA TGGTTTGCTA TGTTATTTAG TGCTGGTATG
GGTATCGGCT TAGTCTTCTG GGGCGTTGCG GAACCATTAA ACCATTTGTA TGCACCTCCG
TTTGGAGAGA GTGCAACTGA GGAAAGTGCA CGTCTTGCAC TGCGTTTTTC ATTTTTCCAT
TGGGGATTAC ATCCTTGGGG ACTATATGCA TTTGTAGCGC TTTGTATTGC TTACTTTACT
TTTAGAAAAG GAAAAGCAAG TACAATTAGT GCGACAGTAG GACCGTTATT TAAAGGCGGG
GACCATGGAC GTATTGCTCA TTTATTTGAT GTGTTAGCTG TTTTCGCGAC TGTGTTTGGT
GTGGCAACAT CATTAGGTCT TGGTGCAAAA CAAATTGCCG GTGGTGTTAG TTATTTAACA
TCCATCCCGA ATTCATTAAC GACTCAGTTA GTTATTATCG CAATCGTAAC AGTGTTATTT
ATGTTATCTG CGCAAACAGG TCTTGATAAA GGAATTAAAT ATTTAAGTAA TACGAATATT
ATTTTGGCAT TTGCACTTAT GATTATTGTA TTATTTGTGG GTCCAACAAA CTTTATTATG
AATTACTTCA CCTCAACAAT TGGTGCTTAC ATTCAGGAAT TACCAAGCAT GAGTTTCCGA
TTAAGTCCAT TAGATGAAGG TGGAAACCAA TGGATTCAAT CGTGGACAAT TTTCTATTGG
GCATGGTGGA TTGCATGGTC ACCATTCGTA GGTACATTTA TTGCTCGTGT TTCACGAGGA
CGTACCATTC GTGAGTTTGT TATCGGTGTG TTACTCGTAC CGACCGTAAT TGGTGCCCTT
TGGTTCTCTG TTTTCGGCGG AACTGGTATT CATATGGAGC TGTTCGGTGA TGCACATATT
TTTGAAAAAG TGAAAGAGAT GGGAACAGAA GTAGGGTTAT TCGCTATGTT TGACCAGATG
GGAAGCTTTG GATCGGCTTT ATCTGTTCTA GCTATTCTTC TTATTTCTAC ATTCTTTATT
ACATCTGCAG ATTCAGCGAC ATTCGTTTTA GGAATGTTAA CAACACATGG TAGTTTAAAT
CCGCCAAACC GCATTAAAAT GATCTGGGGT ATCGTTTTAG CAGCCTTAGC TTCTATCTTA
TTATATGTAG GTGGCTTAGA GGCCTTACAA ACGGCAGCTA TCATTGCAGC ATTCCCATTC
GTTTTTGTTA TTTTCTTTAT GATGGCAGCC TTATTTAAAG AGTTACAAAA AGAAGGACGT
ATGAAGCGTC ATAAATAA
 
Protein sequence
MRKLTKTFIV SLTLCIAFTL WGIIPESIIG KGSLGNVTTA IQTALVSKFG WSYIISVSII 
LGVSIFLIVS KYGSIRLGKD DDEPDYSYMT WFAMLFSAGM GIGLVFWGVA EPLNHLYAPP
FGESATEESA RLALRFSFFH WGLHPWGLYA FVALCIAYFT FRKGKASTIS ATVGPLFKGG
DHGRIAHLFD VLAVFATVFG VATSLGLGAK QIAGGVSYLT SIPNSLTTQL VIIAIVTVLF
MLSAQTGLDK GIKYLSNTNI ILAFALMIIV LFVGPTNFIM NYFTSTIGAY IQELPSMSFR
LSPLDEGGNQ WIQSWTIFYW AWWIAWSPFV GTFIARVSRG RTIREFVIGV LLVPTVIGAL
WFSVFGGTGI HMELFGDAHI FEKVKEMGTE VGLFAMFDQM GSFGSALSVL AILLISTFFI
TSADSATFVL GMLTTHGSLN PPNRIKMIWG IVLAALASIL LYVGGLEALQ TAAIIAAFPF
VFVIFFMMAA LFKELQKEGR MKRHK