Gene GBAA_1246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_1246 
Symbol 
ID2815302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp1200812 
End bp1202290 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content37% 
IMG OID637788190 
Productsodium/proline symporter family protein 
Protein accessionYP_017861 
Protein GI47526512 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000709794 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACGC AGATGTTAAC TTTAACTTCT ATCTCTATTT ACATGCTCGG GATGTTAGTA 
ATTGGCTATT TCGCCTATAA ACGAACGTCC AACTTAACAG ATTATATGCT TGGCGGGCGT
ACACTAGGCC CCGCAGTAAC AGCATTAAGT GCTGGAGCAT CCGATATGAG TGGTTGGCTT
TTAATGGGCT TACCCGGTGC AATGTTTAGC GTTGGATTAA GTAGTAGTTG GATTGCGATC
GGCCTAACAC TAGGCGCATA CGCAAACTGG CTATATGTTG CTCCTCGCTT ACGTACCTAC
TCTGAAATTG CAAACAACTC TATTACTATC CCAGAATTTT TGGAACATCG CTTCCAAGAC
AAATCCCATA TGCTACGCTT AGTATCCGGA CTTGTTATTA TGATTTTCTT TACTTTTTAT
GTAGCTTCAG GATTAGTTTC AGGCGCTGTA TTATTTGAAA ATTCATTTGG TATGAACTAC
CATGTTGGAT TATTCATTGT TGCAGGCGTT GTTGTAGCTT ACACGTTATT TGGTGGTTTC
TTAGCAGTAA GTTGGACAGA CTTCGTGCAA GGAATCATTA TGGTAATTGC TCTTATTCTT
GTTCCTACCG TTACAATTAT GAATGTAAAT GGGCTTGGTC CAGCATTTAG CACAATTAAA
TCAATTGATC CAACATTATT AGACATTTTT AAAGGCACTT CTGTATTAGG TATTATTTCA
TTATTCGCAT GGGGCCTTGG TTATGTTGGA CAACCACATA TTATCGTACG ATTTATGGCA
ATTTCTTCTG TAAAAGAAAT TAAAAGTGCA AGACGAATTG GTATGAGCTG GATGATTTTC
TCTGTTGTTG GAGCTATGTT TACTGGTCTT ATCGGTATTG CATACTACTC AGACAAAGGA
TTAAAGCTAT CCAATCCAGA GACAATTTTC CTTGAACTAG GAAAGATTTT ATTCCATCCG
CTTATTACTG GATTTTTATT AGCCGCTATT TTAGCAGCAA TTATGAGTAC AATCTCATCT
CAGTTACTCG TTACTTCTAG TGCCATAACT GAAGACTTAT ATCGTACTTT CTTTAAACGT
TCTGCTTCTG ATAAAGAGCT TGTATTTGTC GGCCGTATGG CTGTACTTGT TATAGCATTA
GTTGGATGTG CATTAGCGTT TAAACAAAAT GATACGATTT TAGCTCTTGT TGGATACGCT
TGGGCTGGAT TTGGCTCTTC ATTCGGACCT GCTATTTTAT TAAGCTTATA TTGGAAACGT
ATGACGAAGT GGGGCGCACT TGCTGGTATG ATTTCTGGTG CCGCTACAGT CATTATTTGG
ACTCAATTCA AATTCTTAAA AGAATTCTTA TATGAAATGA TCCCTGGTTT CACTATTAGT
TTACTAGTAA TCGTAATTGT TAGTTTACTG ACACAGCCTT CAAAAGAAAT TGAAGAGCAA
TTTGAGAATT TCGAAAAACA ACATAGTGAT AATCTATAA
 
Protein sequence
MSTQMLTLTS ISIYMLGMLV IGYFAYKRTS NLTDYMLGGR TLGPAVTALS AGASDMSGWL 
LMGLPGAMFS VGLSSSWIAI GLTLGAYANW LYVAPRLRTY SEIANNSITI PEFLEHRFQD
KSHMLRLVSG LVIMIFFTFY VASGLVSGAV LFENSFGMNY HVGLFIVAGV VVAYTLFGGF
LAVSWTDFVQ GIIMVIALIL VPTVTIMNVN GLGPAFSTIK SIDPTLLDIF KGTSVLGIIS
LFAWGLGYVG QPHIIVRFMA ISSVKEIKSA RRIGMSWMIF SVVGAMFTGL IGIAYYSDKG
LKLSNPETIF LELGKILFHP LITGFLLAAI LAAIMSTISS QLLVTSSAIT EDLYRTFFKR
SASDKELVFV GRMAVLVIAL VGCALAFKQN DTILALVGYA WAGFGSSFGP AILLSLYWKR
MTKWGALAGM ISGAATVIIW TQFKFLKEFL YEMIPGFTIS LLVIVIVSLL TQPSKEIEEQ
FENFEKQHSD NL