Gene BCG9842_B4055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4055 
Symbol 
ID7181918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp1191013 
End bp1192491 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content37% 
IMG OID643549010 
Productsodium/proline symporter family protein 
Protein accessionYP_002444681 
Protein GI218896270 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000025968 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACGC AGATGTTAAT TTTAACTTCT ATCTCTATTT ACATGCTCGG GATGCTAATT 
ATCGGCTATT TCGCTTATAA GCAAACATCC AACTTAACAG ATTATATGCT TGGCGGGCGT
ACACTAGGCC CCGCAGTAAC AGCATTAAGT GCTGGAGCAG CTGATATGAG TGGTTGGCTT
TTAATGGGCT TACCCGGTGC AATGTTTAGC GTTGGATTAA GTAGTAGCTG GATTGCGATT
GGCCTAACAT TAGGCGCATA TGCAAACTGG CTTTATGTCG CTCCTCGCTT ACGTACCTAC
TCTGAAATTG CAAATAACTC TATTACTATC CCAGAATTTT TGGAACACCG TTTCCACGAC
AAATCCCATA TGCTACGTTT AGTATCTGGA CTTGTTATTA TGATATTTTT CACGTTTTAT
GTAGCTTCAG GATTTGTCTC TGGTGCTGTA TTATTCGAAA ATTCATTTGG ACTCAATTAC
CATGTTGGTC TTCTTATCGT TGGTGGAGTT GTCGTAGCTT ACACATTATT TGGTGGATTT
TTAGCTGTAA GTTGGACAGA CTTCGTGCAA GGAATCATTA TGGTAGTCGC TCTTATTCTT
GTTCCAGTCG TAACAATTAT GCACGTAAAT GGACTTGGTC CAGCATTTGA AACAATTAAA
TCTATCGATC CGGCATTATT AGATATTTTT AAAGGTACTT CTGTATTAGG AATTATTTCA
TTATTCGCAT GGGGCCTTGG TTATGTTGGA CAACCACATA TTATTGTACG CTTTATGGCG
ATTTCTTCTG TAAAAGAAAT TAAAAGTGCA CGACGTATTG GTATGAGCTG GATGATTTTC
TCTGTTGCTG GAGCTATGTT TACTGGCCTT ATCGGTATTG CATACTATTC AAAAGCAGGT
TTAAAACTTT CTGATCCAGA AACGATTTTC GTTGAACTTG GCACTATTTT ATTCCATCCA
CTTATTACTG GATTTTTATT AGCAGCTATT TTAGCAGCTA TTATGAGTAC AATTTCTTCT
CAACTTCTCG TTACTTCGAG TGCAGTAACA GAAGACTTAT ATAGAACATT CTTTAAGCGT
GATGCTTCTG ATAAAGAACT TGTATTTGTC GGTCGTATGG CTGTTCTTGT TATTGCTTTA
ATTGGATGTG CATTAGCACT TAAACAAAAT GATACGATTT TAGCTCTTGT TGGATATGCT
TGGGCTGGGT TCGGTTCTTC ATTCGGACCT GCTATTTTAT TAAGCTTATA TTGGAAACGT
ATGACGAAAT GGGGCGCGCT TGCTGGTATG GTTTCTGGTG CCGCTACTGT TATTATATGG
ACTCAATTCA AATTCTTAAA AGATTTCTTA TATGAAATGA TTCCAGGTTT CGCTATTAGT
TTACTAGCTA TCGTAATTGT TAGTTTACTA ACACAACCTT CAAAAGAAGT TGAAGAGCAA
TTTGAGAATT TCGAAAAACA ACATAGTCAT AATCTATAA
 
Protein sequence
MSTQMLILTS ISIYMLGMLI IGYFAYKQTS NLTDYMLGGR TLGPAVTALS AGAADMSGWL 
LMGLPGAMFS VGLSSSWIAI GLTLGAYANW LYVAPRLRTY SEIANNSITI PEFLEHRFHD
KSHMLRLVSG LVIMIFFTFY VASGFVSGAV LFENSFGLNY HVGLLIVGGV VVAYTLFGGF
LAVSWTDFVQ GIIMVVALIL VPVVTIMHVN GLGPAFETIK SIDPALLDIF KGTSVLGIIS
LFAWGLGYVG QPHIIVRFMA ISSVKEIKSA RRIGMSWMIF SVAGAMFTGL IGIAYYSKAG
LKLSDPETIF VELGTILFHP LITGFLLAAI LAAIMSTISS QLLVTSSAVT EDLYRTFFKR
DASDKELVFV GRMAVLVIAL IGCALALKQN DTILALVGYA WAGFGSSFGP AILLSLYWKR
MTKWGALAGM VSGAATVIIW TQFKFLKDFL YEMIPGFAIS LLAIVIVSLL TQPSKEVEEQ
FENFEKQHSH NL