Gene BCZK3347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK3347 
SymbolopuE 
ID3027023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp3478588 
End bp3480069 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content37% 
IMG OID637547566 
Productsodium/proline symporter 
Protein accessionYP_084932 
Protein GI52141901 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTGAAGA TTGAGATTAT GGTTTCGCTT GCTATTTATA TGGCAGGTAT GTTGTATATC 
GGGTATTGGT CTTATAAGAA GACATCCGAT TTATCAGATT ATATGTTAGG CGGAAGAGGA
CTCGGTCCAG CAGTTACAGC TTTATCAGCC GGTGCTTCTG ACATGAGTGG TTGGATGCTT
ATGGGATTAC CGGGTGCGAT GTATGCGACA GGGTTGTCCA GTGTATGGAT CGCGATAGGT
TTATTAATAG GCGCTTATGC AAACTATTTA ATTCTCGCGC CGCGTTTACG AACATATACG
GAAGTAGCAA ATGATTCAAT TACGATTCCA GATTTTTTAG AGAATCGGTT TAAAGATCGT
ACGAAAATAC TTCGTTTTGT CTCCGCTATC GTCATTTTAG TATTTTTCAC ATTTTATGCG
TCAGCTGGTT TGGTTTCAGG TGGACGTTTG TTTGAAAATT CTTTTAACCT TGATTATAAA
ATTGGTTTAT TTGTAACTGT CGGTGTCGTT GTTGCTTATA CACTATTCGG TGGTTTTTTA
GCAGTAAGTT GGACCGACTT TGTGCAAGGT TGTATTATGT TTATTGCTCT TGTATTAGTT
CCAATTGTAG CTTTTACAGA TGTCGGTGGT GTAACAGAAA CATTCAATAC AATTAAGCAA
GTTGATGCAT CGCATTTAGA TATGTTTAAA GGGACTACAA TACTTGGCAT TATTTCATTT
TTAGCATGGG GCCTTGGGTA TTTTGGTCAA CCACATATTA TTGTCCGCTT TATGGCAATT
ACCTCTATTA AAGATTTAAA AACTTCTCGT AGAATCGGTA TCGGTTGGAT GACGATTTCA
ATTATAGGTG CAATGCTTAC TGGTCTAGTT GGTATTGCTT ATTACGCTAA AAATAATGCG
ACATTACAAG ATCCGGAAAT GGTCTTTGTA ACATTCTCAA ATATTTTATT CCATCCGTAC
ATTACTGGAT TTTTATTATC AGCTATTTTG GCTTCGATTA TGAGTAGTAT TTCCTCGCAA
TTACTTGTTA TTTCAAGTGC TGTAACGGAA GATTTCTATA AAACATTTTT CCGTCGTAAA
GCAAGTGATA AAGAACTTGT ATTTATCGGT AGGCTGTCAG TATTAGTAGT AGCGATGATT
GCAGTTGTTT TAGCGTATCA TCCGAGTGAT ACAATTTTAA CGCTTGTTGG ATATGCTTGG
GCAGGATTTG GATCAGCATT CGGACCAGCA ATTTTATTAA GTTTATATTG GAAGAGAACG
AACAAATGGG GCGTTCTTGC TGGGATGATT GTCGGTGCAT TAGTTGTTAT CACTTGGGTA
CAAATTCCAA GTTTAAAAGC GACTATGTAT GAGATGGTAC CTGGATTCTT CTGTAGCTTA
TTAGCTGTTA TTATCGTAAG TTTAGTAACG AAAGAACCAG TTAAAGCAAT ACATCGTGAA
TTTAATGAGA TGGAAGCAGT ATTGGAAGAG GAAACAAAAT AA
 
Protein sequence
MVKIEIMVSL AIYMAGMLYI GYWSYKKTSD LSDYMLGGRG LGPAVTALSA GASDMSGWML 
MGLPGAMYAT GLSSVWIAIG LLIGAYANYL ILAPRLRTYT EVANDSITIP DFLENRFKDR
TKILRFVSAI VILVFFTFYA SAGLVSGGRL FENSFNLDYK IGLFVTVGVV VAYTLFGGFL
AVSWTDFVQG CIMFIALVLV PIVAFTDVGG VTETFNTIKQ VDASHLDMFK GTTILGIISF
LAWGLGYFGQ PHIIVRFMAI TSIKDLKTSR RIGIGWMTIS IIGAMLTGLV GIAYYAKNNA
TLQDPEMVFV TFSNILFHPY ITGFLLSAIL ASIMSSISSQ LLVISSAVTE DFYKTFFRRK
ASDKELVFIG RLSVLVVAMI AVVLAYHPSD TILTLVGYAW AGFGSAFGPA ILLSLYWKRT
NKWGVLAGMI VGALVVITWV QIPSLKATMY EMVPGFFCSL LAVIIVSLVT KEPVKAIHRE
FNEMEAVLEE ETK