Gene BCAH820_1314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_1314 
Symbol 
ID7191971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp1252087 
End bp1253565 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content37% 
IMG OID643554726 
Productsodium/proline symporter family protein 
Protein accessionYP_002450266 
Protein GI218902432 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value7.01893e-22 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTACGC AGATGTTAAC TTTAACTTCT ATCTCTATTT ACATGCTCGG GATGTTAGTA 
ATTGGCTATT TCGCCTATAA ACGAACGTCC AACTTAACAG ATTATATGCT TGGCGGGCGT
ACACTAGGCC CCGCAGTAAC AGCATTAAGT GCTGGAGCAT CCGATATGAG TGGTTGGCTT
TTAATGGGCT TACCCGGTGC AATGTTTAGC GTTGGATTAA GTAGTAGTTG GATTGCGATC
GGCCTAACAC TAGGCGCATA CGCAAACTGG CTATATGTTG CTCCTCGCTT ACGTACCTAC
TCTGAAATTG CAAACAACTC TATTACTATC CCAGAATTTT TGGAACATCG CTTCCAAGAC
AAATCCCATA TGCTACGCTT AGTATCCGGA CTTGTTATTA TGATTTTCTT TACTTTTTAT
GTAGCTTCAG GATTAGTTTC AGGCGCTGTA TTATTTGAAA ATTCATTTGG TATGAACTAC
CATGTTGGAT TATTCATTGT TGCAGGCGTT GTTGTAGCTT ACACGTTATT TGGTGGTTTC
TTAGCAGTAA GTTGGACAGA CTTCGTGCAA GGAATCATTA TGGTAATTGC TCTTATTCTT
GTTCCTACTG TTACAATTAT GAATGTAAAT GGGCTTGGTC CAGCATTTAG CACAATTAAA
TCAATTGATC CAACATTATT AGACATTTTT AAAGGCACTT CTGTATTAGG TATTATTTCA
TTATTCGCAT GGGGCCTTGG TTATGTTGGA CAACCACATA TTATCGTACG ATTTATGGCA
ATTTCTTCTG TAAAAGAAAT TAAAAGTGCA AGACGAATTG GTATGAGCTG GATGATTTTC
TCTGTTGTTG GAGCTATGTT TACTGGTCTT ATCGGTATTG CATACTACTC AGACAAAGGA
TTAAAGCTAT CCAATCCAGA GACAATTTTC CTTGAACTAG GAAAAATTTT ATTCCACCCG
CTTATTACTG GATTTTTATT AGCCGCTATT TTAGCAGCAA TTATGAGTAC AATCTCATCT
CAGTTACTCG TTACTTCTAG TGCCATAACT GAAGACTTAT ATCGTACTTT CTTTAAACGT
TCTGCTTCTG ATAAAGAGCT TGTATTTGTC GGCCGTATGG CTGTACTTGT TATTGCATTA
GTTGGATGTG CATTAGCGTT TAAACAAAAT GATACGATTT TAGCTCTTGT TGGATACGCT
TGGGCTGGAT TTGGCTCTTC ATTCGGACCT GCTATTTTAT TAAGCTTATA TTGGAAACGT
ATGACGAAGT GGGGCGCACT TGCTGGTATG ATTTCCGGTG CCGCTACAGT CATTATTTGG
ACTCAATTCA AATTCTTAAA AGAATTCTTA TATGAAATGA TTCCTGGTTT CACTATTAGT
TTACTAGTAA TCATAATTGT TAGTTTACTA ACACAGCCTT CAAAAGAAAT TGAAGAGCAA
TTTGAGGATT TCGAAAAACA ACATAGTGAT AATCTATAA
 
Protein sequence
MSTQMLTLTS ISIYMLGMLV IGYFAYKRTS NLTDYMLGGR TLGPAVTALS AGASDMSGWL 
LMGLPGAMFS VGLSSSWIAI GLTLGAYANW LYVAPRLRTY SEIANNSITI PEFLEHRFQD
KSHMLRLVSG LVIMIFFTFY VASGLVSGAV LFENSFGMNY HVGLFIVAGV VVAYTLFGGF
LAVSWTDFVQ GIIMVIALIL VPTVTIMNVN GLGPAFSTIK SIDPTLLDIF KGTSVLGIIS
LFAWGLGYVG QPHIIVRFMA ISSVKEIKSA RRIGMSWMIF SVVGAMFTGL IGIAYYSDKG
LKLSNPETIF LELGKILFHP LITGFLLAAI LAAIMSTISS QLLVTSSAIT EDLYRTFFKR
SASDKELVFV GRMAVLVIAL VGCALAFKQN DTILALVGYA WAGFGSSFGP AILLSLYWKR
MTKWGALAGM ISGAATVIIW TQFKFLKEFL YEMIPGFTIS LLVIIIVSLL TQPSKEIEEQ
FEDFEKQHSD NL