Gene BAS5065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5065 
Symbol 
ID2848421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4939093 
End bp4940310 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content38% 
IMG OID637508320 
Productproton/glutamate symporter family protein 
Protein accessionYP_031304 
Protein GI49188051 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCAT ATCGCTTTCC ACTTATTTTA TTATCTTCTA TCCTAATTGG TGGTTTCATT 
GGTTATTTCA TGGGTACCGA TGCCGTTGCT TTAAAGCCGC TTGGTGACAT TTTCTTAAAC
TTAATGTTTA CGATTGTTGT ACCGTTAGTG TTCTTTAGCA TCGCGTCATC TATTGCTAAT
ATGGATGGAT TAAAACGTTT CGGTAAAATT ATGTCTAGTA TGGCTGGGAC TTTCTTATTT
ACAAGTATTT TAGCTGCTAT TTTTATGATT ATTGTCGTGA AAGTATTCCC GCCAGCACAA
GGTGTTGTAT TAGAATTAAC ACAACCTGAC AAAGCTGAAA AAGCTGTTAG CGTTGCAGAT
CAAATTGTTG GTATTCTAAC AGTATCTGAC TTCTCGAAGT TACTATCTCG TGAAAATATG
TTAGCTCTTA TTTTCTTCTC TATTTTAATG GGGATTGCAA CTTCAGCAGT TGGTGAAAAA
GGAAAACCAT TCGCTACATT CTTACAAGCT GGTGCAGAAA TTTCAATGAA AGTTGTATCT
TTCATTATGT ACTACGCTCC AATTGGACTT GCTGCTTACT TCGCAGCATT AGTTGGTGAA
TTCGGACCAC AACTTCTTGG AACTTACTTC CGAGCAGCAA TGGTATACTA TCCAGCGTCT
CTCATTTACT TCTTTGTATT CTTTACGTTC TATGCATACC TTGCAGGTCG CAAGCAAGGT
GTACAAGTAT TTTGGAAGAA CATGGTCTCT CCTACAGTTA CATCACTAGC AACTTGTAGT
AGTGCTGCTA GTATTCCAGC GAACTTAGAA GCAACGAAGA AAATGGGTAT CTCTTCTGAT
GTTCGTGAAA CAGTTATCCT TCTTGGATCT ACACTTCATA AAGACGGATC TGTTTTAGGC
GGCGTATTAA AAATTGCTTT CTTATTCGGT ATTTTCAACA TGGAATTCGA AGGACCGAAA
ACATTAGCAA TCGCACTTGT TGTTTCTCTA TTAGTAGGAA CAGTAATGGG CGCTATTCCA
GGCGGCGGTA TGATTGGTGA AATGTTAATC GTTTCTCTAT ACGGATTCCC GCCAGAAGCA
TTACCAATTA TTGCAGCAAT TAGTACAATC ATTGATCCTC CTGCAACAAT GTTAAACGTA
ACAGCAGATA ACGCTTGTGC CGTAATGACA GCTCGCCTTG TAGAAGGTAA GAACTGGATC
AAAAACAAAT TTGCTTAA
 
Protein sequence
MKAYRFPLIL LSSILIGGFI GYFMGTDAVA LKPLGDIFLN LMFTIVVPLV FFSIASSIAN 
MDGLKRFGKI MSSMAGTFLF TSILAAIFMI IVVKVFPPAQ GVVLELTQPD KAEKAVSVAD
QIVGILTVSD FSKLLSRENM LALIFFSILM GIATSAVGEK GKPFATFLQA GAEISMKVVS
FIMYYAPIGL AAYFAALVGE FGPQLLGTYF RAAMVYYPAS LIYFFVFFTF YAYLAGRKQG
VQVFWKNMVS PTVTSLATCS SAASIPANLE ATKKMGISSD VRETVILLGS TLHKDGSVLG
GVLKIAFLFG IFNMEFEGPK TLAIALVVSL LVGTVMGAIP GGGMIGEMLI VSLYGFPPEA
LPIIAAISTI IDPPATMLNV TADNACAVMT ARLVEGKNWI KNKFA