Gene BAS2639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2639 
Symbol 
ID2849510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2625262 
End bp2626752 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content37% 
IMG OID637505885 
Productsodium/alanine symporter family protein 
Protein accessionYP_028898 
Protein GI49185646 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1115] Na+/alanine symporter 
TIGRFAM ID[TIGR00835] amino acid carrier protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.658687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCAAT TAGTAGAGTG GTTAGTAGGG CAAGTGTGGA GTATTGGTTT AGTTGTTTTC 
GCGTTAGGAG CAGGTGTGTA TTTTACAATT GCAACTCGTT TTCTTCAAAT TCGTTATTTT
AAAGAGATGA TTAAACTATT ATTTGAAGGG AAGAGCTCAG AAACGGGAAT ATCATCCTTT
CAAGCATTTT GTTTAGCTTT ATCAGGCAGG GTTGGAATAG GTAATATTGC AGGGGTCGCG
ACAGCTATCG CTTTTGGCGG GCCTGGAGCT GTATTTTGGA TGTGGGTAAT GGCTCTTTTA
GGAGCAGCTA GTGCCTTTGT CGAATCAACA TTATCTCAAA TATATAAAAG TAAAGTTGAA
AATGAATATC GCGGTGGTAC ACCGTATTTC ATTGAAAAAG GCTTAAACAT GAAATGGTTT
GCAGTCATTG TAGCGGTCGT TGTAACACTT TCATATGGTG TTTTATTACC AGGTATTCAA
TCTAGTAGTA TCGCAGTTGG ATTCGAAAAC TCTAATGGGA TTAGCAAATA TATAACTGGT
ATCTTGTTAG TTGTATTATT AGCAGCAATT ATTTTTGGTG GCGTAAAGAG AATTGCTGGC
GTTTCTCAAA TGCTCGTTCC ATTTATGGCA ATTGGTTATG TAATTGTTAC ATGTATCGTA
TTAATTGCGA ATGTAAAAGA AATCCCAAGT ATGTTCGCTT TAATTTTCTC AAGTGCTTTT
GGTGTGAATG AAATGTTTGG TGGAATCGTC GGTGCAGCAA TCGCGTGGGG CGTAAAGTGC
GCTGTATTTT CTAACGTTGC TGGCGTTGGA GAAGCGACGT ATAGTTCGGC CGCGGCTGAA
GTATCTCATC CAGCAAAACA AGGGTTAGTT CAAGCGTTTT CTGTATACAT TGATACAATT
GTCGTATGTA CAGCGACAGC TCTTATGATC TTAATAACAG GTATGTATAA TGTTATACCT
GAAGGGAAAA GCGCTATCGT AAAGAATATA GGGAATGTTG ATGCGGGTCC AATTTATACA
CAACAAGCAG TTGAAACTGT TATGACAGGG TTTGGTCCAT TATTCATTTC AATCGCAATT
TTCTTCTTCG CATTTACAAC ATTACTTGCA TACTACTATA TCGCTGAAAC GACACTTACT
TATTTAGACC GTGAACTTAA GCATAGTTGG TTAAAACCAG TTTTGAAAAT TGGATTTTTA
ATTATGGTTT ACATCGGTAG TGTAGAATCA GCATCGCTTT TATGGAATCT TGGAGATTTA
GGAATCGGTA GTATGGCATG GTTAAACTTA ATCGCGATTC TACTATTAAG TAAAATCGCA
TTAAAAGTGT TAAAAGATTA TGAAACGCAG AAAAAAGAAG GGAAAGATCC CGTGTTTGAT
CCTAAAAATG TGGGAATTGA AGGTTTAACA TTTTGGGAAG AAAGAAGTAA AGAGGTTGCA
AGAAAAAACT CAAAAGAACA AGCGGTAGTG GATGATAGTC TGAAATTGTA G
 
Protein sequence
MEQLVEWLVG QVWSIGLVVF ALGAGVYFTI ATRFLQIRYF KEMIKLLFEG KSSETGISSF 
QAFCLALSGR VGIGNIAGVA TAIAFGGPGA VFWMWVMALL GAASAFVEST LSQIYKSKVE
NEYRGGTPYF IEKGLNMKWF AVIVAVVVTL SYGVLLPGIQ SSSIAVGFEN SNGISKYITG
ILLVVLLAAI IFGGVKRIAG VSQMLVPFMA IGYVIVTCIV LIANVKEIPS MFALIFSSAF
GVNEMFGGIV GAAIAWGVKC AVFSNVAGVG EATYSSAAAE VSHPAKQGLV QAFSVYIDTI
VVCTATALMI LITGMYNVIP EGKSAIVKNI GNVDAGPIYT QQAVETVMTG FGPLFISIAI
FFFAFTTLLA YYYIAETTLT YLDRELKHSW LKPVLKIGFL IMVYIGSVES ASLLWNLGDL
GIGSMAWLNL IAILLLSKIA LKVLKDYETQ KKEGKDPVFD PKNVGIEGLT FWEERSKEVA
RKNSKEQAVV DDSLKL