Gene BCZK3840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK3840 
Symbol 
ID3026700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp3969485 
End bp3970774 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content40% 
IMG OID637548054 
Productxanthine/uracil permease family protein 
Protein accessionYP_085420 
Protein GI52141409 
COG category[R] General function prediction only 
COG ID[COG2252] Permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000802373 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGAA TACTTGAAAG AACTTTTAAA TTAGGTTTAC ACGAAACATC ACCAAAACAA 
GAAGTTTTAG CTGGAGTTAC GTCATTTTTC ACAATCGTAT ACATTATGAT TGTAAATGCA
TCCATTTTAT CAGATGCTGG CATTCCTCTT GAAGCTGGAA TTTTAGCAAC TGTTTTCAGT
TCATTTGTCG GATGTCTGCT CATGGCATTT TGGGCAAATG CACCCGCTAT TCTTGTCCCT
GGTATGGGTG TAAATGCATT CTTCACGTAC ACTGCTGTCC ATACGCTCGG ATTAACTTGG
CAGGAAGCAT TAGCAGCTGT CTTCATCTCT GGTATTATTT TTGCAATTGC TGCGTTTACA
CCAATCGCTC GCGTGCTTTC CGTATCGATT CCAAAGTCAT TAAAGGAAGC TATTACTGTC
GGTATCGGAT TGTTTTTAGC GTTTATCGGA TTGCAAAAAG GTGGTTTAGT CGTTTCGAAT
CCAAATACTG CTGTTGCAAT GGGGAAATTA AGTAACCCTG TCGTTCTTGC GACCGTGCTT
ACTCTTATCG TTGCACTCGT ATTATTTATT CGTAATGTAC GTGGAAACTT TTTATGGACG
ATTGCAATAG GAACTGGTAT CGCATGGCTA TTTGGTCTTG TGGATACAAG TCAAATTGGA
AATAGTTCAT TTTCATTCGC TAATTACGGC GATGTGTTTG GAGCTATGTC ATTTGGCAAA
CTCTCTTCCT TACCGTTTTG GATTGCAACA TTCTCTTTAA GCATGGTGCT TATTTTTGAG
AACATGGGAC TTCTGCATGG TTTATTAGAA GATGACCGTA AATTCCCACG TGCTTACCAA
GCCAATGCAA TTTCAGCAAT GACATGTGGT CTATTTGGCA CAAGCCCTAC AGTATCAACA
GTAGAGAGTG CCGCAGGTAT TACTGCAGGC GGAAAGACAG GTCTGACGTC TATCGTTACA
GGGCTGTTAT TCTTTGCATC ACTGTTTGCA CTTCCGTTTG TCAAACTAAT TCCTGATAGT
GCCATTGCAC CAATCCTAAT TATTATTGGC GGCCTGATGA TTACAAGCAT TCAACAAATT
CCTCTGAACG ATTTTTCAGA AGGATTTCCA GCGTTCTTAA TTATCGTCAT GATTCCGCTC
ACATATAGTA TCGCTGATGG CATTGCGTTC GGATTTATTG CTTATCCTAT TTTAAAAGTT
GCTCTTGGAA AGCGTAAAGA AGTCGCACCG TCTATGTATA TCATTACATG CTTATTCTTA
GCCATGTTCG TATTACATGC TATTGGTTAA
 
Protein sequence
MKGILERTFK LGLHETSPKQ EVLAGVTSFF TIVYIMIVNA SILSDAGIPL EAGILATVFS 
SFVGCLLMAF WANAPAILVP GMGVNAFFTY TAVHTLGLTW QEALAAVFIS GIIFAIAAFT
PIARVLSVSI PKSLKEAITV GIGLFLAFIG LQKGGLVVSN PNTAVAMGKL SNPVVLATVL
TLIVALVLFI RNVRGNFLWT IAIGTGIAWL FGLVDTSQIG NSSFSFANYG DVFGAMSFGK
LSSLPFWIAT FSLSMVLIFE NMGLLHGLLE DDRKFPRAYQ ANAISAMTCG LFGTSPTVST
VESAAGITAG GKTGLTSIVT GLLFFASLFA LPFVKLIPDS AIAPILIIIG GLMITSIQQI
PLNDFSEGFP AFLIIVMIPL TYSIADGIAF GFIAYPILKV ALGKRKEVAP SMYIITCLFL
AMFVLHAIG