Gene BAS1666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1666 
Symbol 
ID2849507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1681448 
End bp1682719 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content37% 
IMG OID637504919 
Productproton/sodium-glutamate symporter 
Protein accessionYP_027932 
Protein GI49184680 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT TTGGATTAGC GACACAAATT TTTGTCGCAC TTGTTTTAGG GATTGTAGTA 
GGGGCAATCT TCTATGGTAA TAAAACGGCG ATTTCTTATA TCACACCAAT TGGGGATATA
TTTATTCACT TAATTAAAAT GATTGTAGTA CCGATTGTTA TTTCAGCATT AATTGTTGCG
GTAGCTGGTG TAGGCGATAT GAAGAAGCTT GGGAAACTAG GCGGAAAGAC AATTCTTTAT
TTTGAAATCA TTACGACGAT CGCTATTTTA ATGGGATTAC TTGCGGCGAA TATATTCCAG
CCAGGTACTG GCGTTGATAT GAATAACTTA CAGCAAAGTG ACATTTCTTC TTATAAACAA
ACAGCAGATG CTACAGAGAA GCAAGGATTC GCTGAGACGA TTGTTCATAT TGTACCGAAA
AACGTATTTG AATCGATTGC ACAAGGTGAC TTATTACCGA TTATTTTCTT CTCAGTATTA
TTCGGTTTAG GAGTTGCAGC AATTGGGGAA AAAGGAAAGC CTGTCTTTAA CTTCTTTGAA
GGTGTACTCG AAGCGATGTT TTGGGTTACA AATCAAGTTA TGAAATTTGC ACCATTTGGT
GTATTCGCAT TAATTGCGGT TACGGTTGCA AAATTTGGTG TAGCAACACT ACTTCCTTTA
GGAAAACTAG TGCTAGCTGT ATACGTAACT GTTATACTAT TCGTTGTGAT TGTATTAGGT
ATTAATGCAC GAATGGTTGG CGTAAATATT TTTACATTAA TGAAAATTTT AAAAGAAGAA
CTAATTCTTT CATTTACGAC AGCAAGTTCA GAAGCTGTTT TACCTAATAT AATGAGAAAA
ATGGAAGAGT TCGGTTGTCC AAAGGCAGTT GCCTCTTTCG TAATTCCGAC AGGTTATACA
TTTAACTTGA CTGGATCAGC TATTTATCAA GCGTTAGCGG CATTATTTGT TACACAAATG
TACGGTGTGC ACATGTCACT GACAGAGCAA ATAACGTTAT TATTCGTTCT CATGTTAACA
TCCAAAGGTA TGGCGGGAGT TCCAGGTGCA TCGTTCGTTG TTGTATTAGC AACGTTAGGT
TCAATGGGGT TACCACTAGA AGGTATCGCG TTAATTGCGG GAATTGACCG CATTTTAGAT
ATGATTCGCT CATCTGTCAA TGTATTAGGA AATGCATTAG CGGCCATTGT TATGTCGAAG
TGGGAAGGCG AATTCGATAA TGAGAAAGCA AAACAATATG TAGAAACAGT CAAAGAAACA
AAAGCAGCAT AA
 
Protein sequence
MKKFGLATQI FVALVLGIVV GAIFYGNKTA ISYITPIGDI FIHLIKMIVV PIVISALIVA 
VAGVGDMKKL GKLGGKTILY FEIITTIAIL MGLLAANIFQ PGTGVDMNNL QQSDISSYKQ
TADATEKQGF AETIVHIVPK NVFESIAQGD LLPIIFFSVL FGLGVAAIGE KGKPVFNFFE
GVLEAMFWVT NQVMKFAPFG VFALIAVTVA KFGVATLLPL GKLVLAVYVT VILFVVIVLG
INARMVGVNI FTLMKILKEE LILSFTTASS EAVLPNIMRK MEEFGCPKAV ASFVIPTGYT
FNLTGSAIYQ ALAALFVTQM YGVHMSLTEQ ITLLFVLMLT SKGMAGVPGA SFVVVLATLG
SMGLPLEGIA LIAGIDRILD MIRSSVNVLG NALAAIVMSK WEGEFDNEKA KQYVETVKET
KAA