Gene BCZK1128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK1128 
SymbolopuE 
ID3023010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp1231950 
End bp1233428 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content38% 
IMG OID637545361 
Productsodium/proline symporter; osmoregulated proline transporter 
Protein accessionYP_082728 
Protein GI52144101 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000418657 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACGC AGATGTTAAC TTTAACTTCT ATCTCTATTT ACATGCTCGG GATGTTAGTA 
ATTGGCTATT TTGCCTATAA ACGAACGTCC AACTTAACAG ATTATATGCT TGGCGGGCGT
ACACTAGGTC CCGCAGTAAC GGCATTAAGT GCTGGAGCAT CCGATATGAG TGGTTGGCTT
TTAATGGGCT TACCCGGTGC AATGTTTAGC GTTGGATTAA GTAGTAGTTG GATTGCGATC
GGCCTAACAC TAGGCGCATA CGCAAACTGG CTATATGTCG CTCCTCGCTT ACGTACCTAC
TCAGAAATTG CAAACAACTC TATTACTATC CCAGAATTTT TGGAACATCG CTTCCAAGAC
AAATCCCATA TGCTACGCTT AGTATCTGGA CTTGTTATTA TGATTTTCTT TACTTTTTAT
GTAGCTTCAG GATTAGTTTC AGGCGCTGTA TTATTTGAAA ATTCATTTGG TATGAACTAC
CATGTTGGAT TATTCATTGT TGCAGGCGTT GTTGTAGCTT ACACGTTATT TGGGGGTTTC
TTAGCAGTAA GTTGGACAGA CTTCGTGCAA GGAATCATTA TGGTGATTGC TCTTATTCTT
GTTCCTACTG TTACAATTAT GAATGTAAAT GGGCTTGGTC CAGCATTTAG CACAATTAAA
TCAATTGATC CAACATTATT AGACATTTTT AAAGGCACTT CTGTATTAGG TATTATTTCA
TTATTCGCAT GGGGCCTTGG TTATGTTGGA CAACCACATA TTATCGTACG CTTTATGGCG
ATTTCTTCTG TAAAAGAAAT TAAAAGTGCA AGACGAATTG GTATGAGCTG GATGATTTTC
TCTGTTGTCG GAGCTATGTT TACTGGTCTT ATCGGTATTG CATACTACTC AGACAAAGGA
TTAAAACTAT CCAATCCAGA GACAATTTTC CTTGAACTGG GAAAAATTTT ATTCCACCCA
CTTATTACTG GATTTTTATT AGCCGCTATT TTAGCAGCGA TTATGAGTAC AATCTCATCT
CAGTTACTCG TGACTTCTAG TGCCATAACT GAAGACTTAT ATCGTACTTT CTTTAAACGT
TCTGCTTCTG ATAAAGAGCT TGTATTTGTC GGCCGTATGG CTGTACTTGT TATTGCATTA
GTTGGATGTA CATTAGCGTT TAAACAAAAT GATACGATTT TAGCTCTTGT TGGATACGCT
TGGGCTGGAT TTGGCTCTTC ATTCGGACCT GCTATTTTAT TAAGCTTATA TTGGAAACGT
ATGACGAAGT GGGGCGCACT TGCTGGTATG ATTTCTGGTG CCGCTACAGT CATTATTTGG
ACTCAATTCA AATTCTTAAA AGAATCCTTA TATGAAATGA TTCCTGGTTT CACTATTAGT
TTACTAGTAA TCGTAATTGT TAGTTTACTA ACACAGCCTT CAAAAGAAAT TGAAGATCAA
TTTGAGGATT TCGAAAAACA ACATAGTGAT AATCTATAA
 
Protein sequence
MSTQMLTLTS ISIYMLGMLV IGYFAYKRTS NLTDYMLGGR TLGPAVTALS AGASDMSGWL 
LMGLPGAMFS VGLSSSWIAI GLTLGAYANW LYVAPRLRTY SEIANNSITI PEFLEHRFQD
KSHMLRLVSG LVIMIFFTFY VASGLVSGAV LFENSFGMNY HVGLFIVAGV VVAYTLFGGF
LAVSWTDFVQ GIIMVIALIL VPTVTIMNVN GLGPAFSTIK SIDPTLLDIF KGTSVLGIIS
LFAWGLGYVG QPHIIVRFMA ISSVKEIKSA RRIGMSWMIF SVVGAMFTGL IGIAYYSDKG
LKLSNPETIF LELGKILFHP LITGFLLAAI LAAIMSTISS QLLVTSSAIT EDLYRTFFKR
SASDKELVFV GRMAVLVIAL VGCTLAFKQN DTILALVGYA WAGFGSSFGP AILLSLYWKR
MTKWGALAGM ISGAATVIIW TQFKFLKESL YEMIPGFTIS LLVIVIVSLL TQPSKEIEDQ
FEDFEKQHSD NL