Gene pE33L466_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0046 
SymbolopuD 
ID3399949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp43534 
End bp45132 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content37% 
IMG OID637659887 
Productglycine betaine transporter 
Protein accessionYP_245551 
Protein GI67077931 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.275522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAA AAGAAAACAG TGTATTCTAT ATCTCCATCT TGCTTACAAC CTTATTTATT 
ATATGGGGTG TAATTCCAGC AAGTTGGATA CAAGGCTATG ATTTACAAAG TGTTACCTCT
TCTCTAAACG TATTTATTCT CAATAAGTTT GGATGGTTTT ACTCTCTACT TATGACCACA
ATGATTGTTT TAGCTGGTTA TTTAGCATTT TCAAAGTATG GCTCTATTCG TCTTGGTAAA
GATGATGAAA AACCACAATA TAGTTACCTC TCTTGGCTTT CCATGTTATT TGGAGCAGGA
ATGGGCATCG GACTTCTCTT TTATGGAATT ACTGAACCCA TTTCACACTT TGGAGCACCG
CTTACTGGTG AACCTGGAAC TGAGGAAAGC GCTAAAGTTG CTATGCAATA TTCATTTTTC
CACTGGGGTC TCTTCCCTTG GTCACTTTAT GCAATAGTCG CTCTAACAAT TGCATATTTT
ACATTCCGAA AACAAAAGGG TAGTACAATT GGCGCTACAG TTACTCCATT ATTTAATCGA
TCGAAACATT CTCCTATTGG AAAAACAGTA GATATTTTAG CTGTTTTAGC AACTGTATTT
GGAATCGTAC CATCCGTAGG AATCGGTGCA CAACAAATCG CTGGAGGATT AAGCTATTTA
TTACCTTCTA TTAATAATAC AATTGGTACA CACCTCGTAC TTATTGCCAT TTTCACTGTA
CTCTATTTAA CAAGTGCACA AACTGGCTTA GATCGTGGGA TTAAATATTT GAGTAACTTA
AACTTCTCCC TCGCTGGAAT ATTACTCGTT TCCTTTTTAA TTTTAGGACC AACTGTCTTT
ATCATGAAGT ATTTTACTTC CACACTAGGC TCTTATATCG AAGCATTACC AAGCATGGGA
TTAAATTTAG GTGCTTTTAG CAAAGAATCC TCTTCTTGGA TTGAAAATTG GACTATTTTT
TACTGGGGTT GGTGGATTTC TTGGTCTCCA TTCGTCGGTA CATTCATTGC CCGTGTTTCT
CGCGGACGTA CAATTCGTGA ATTTATTATC GGCGTCGTAT TAATCCCAAC ATTCATTTGT
ACATTTTGGT TCGCTGTCTT TGGCGGTACA GCTATTCATA TGGAAATGTT CCAATCACTT
GGAATTGCTG ATGAAATTGC GAAAAATGGT ACCGAAATTG GGTTATTTGC TGTCATTTCA
CATTTACCAT TCTCAACATT TTTGACCATT ATTGGACTAA TTTTAATCGC AACATTCTTC
GTTACCTCTG CTGACTCCGC AACATTTGTT GTTGCTATGC AAACAAGCAA TGGTAACTTA
TCACCAAAAA ACAGTTTGAA GCTTATTTGG GGATTAACAA TTACAGCTAT TGCAGCCATT
CTCTTGCAAG CTGGTGGGTT AAATGCACTG CAAATTGCAG CGATCATAGC AGCTTTACCA
TTCTCTATTG TAGTCGTTCT TATGGTCACG TCGCTCTTTA AAGAATTACG AAAAGAGAAT
ATCATTACTC AAACGAAACA ATCACCGCAA AAAATAAAAA AGTTATATAA TAGAAAAGCA
CCTCTTCCTG AACTGTACCC TATAAAATGG ACATTTTAA
 
Protein sequence
MIKKENSVFY ISILLTTLFI IWGVIPASWI QGYDLQSVTS SLNVFILNKF GWFYSLLMTT 
MIVLAGYLAF SKYGSIRLGK DDEKPQYSYL SWLSMLFGAG MGIGLLFYGI TEPISHFGAP
LTGEPGTEES AKVAMQYSFF HWGLFPWSLY AIVALTIAYF TFRKQKGSTI GATVTPLFNR
SKHSPIGKTV DILAVLATVF GIVPSVGIGA QQIAGGLSYL LPSINNTIGT HLVLIAIFTV
LYLTSAQTGL DRGIKYLSNL NFSLAGILLV SFLILGPTVF IMKYFTSTLG SYIEALPSMG
LNLGAFSKES SSWIENWTIF YWGWWISWSP FVGTFIARVS RGRTIREFII GVVLIPTFIC
TFWFAVFGGT AIHMEMFQSL GIADEIAKNG TEIGLFAVIS HLPFSTFLTI IGLILIATFF
VTSADSATFV VAMQTSNGNL SPKNSLKLIW GLTITAIAAI LLQAGGLNAL QIAAIIAALP
FSIVVVLMVT SLFKELRKEN IITQTKQSPQ KIKKLYNRKA PLPELYPIKW TF