Gene BCAH820_5329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_5329 
SymbolopuD2 
ID7187628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp5026464 
End bp5027981 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content38% 
IMG OID643558739 
Productglycine betaine transporter 
Protein accessionYP_002454249 
Protein GI218906415 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value7.52581e-23 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGAAAC TGACAAAAAC ATTCATCGTT TCATTAACAT TATGTATTGC ATTTACACTT 
TGGGGGATTA TTCCCGAATC TATTATTGGA AAAGGTAGCC TAGGAAATGT AACAACCGCA
ATTCAAACTG CATTAGTTAG TAAGTTTGGA TGGTTCTATA TTATTTCTGT TTCTATTATT
TTAGGTGTGT CTATCTTTTT AATTGTTTCG AAATACGGTT CTATTCGTTT AGGTAAAGAT
GATGACGAGC CTGATTATAG TTATATGACA TGGTTTGCTA TGTTATTTAG TGCTGGTATG
GGTATCGGCT TAGTTTTCTG GGGCGTTGCG GAACCATTAA ACCATTTGTA TGCACCTCCG
TTTGGAGAGA GTGCAACTGA GGAAAGTGCA CGTCTTGCAC TGCGTTTTTC ATTTTTCCAT
TGGGGATTAC ATCCTTGGGG ACTATATGCA TTTGTAGCGC TTTGTATTGC TTACTTTACT
TTTAGAAAAG GAAAAGCAAG TACAATTAGT GCGACAGTAG GGCCGTTATT TAAAGGCGGG
GACCATGGAC GTATTGCTCA TTTATTTGAT GTGTTAGCTG TTTTCGCAAC TGTGTTTGGC
GTGGCAACAT CATTAGGTCT TGGTGCAAAA CAAATTGCCG GTGGTGTTAG TTATTTAACA
TCCATCCCGA ATTCATTAAC GACTCAGTTA GTTATTATTG CAATCGTAAC AGTGTTATTT
ATGTTATCTG CGCAAACAGG TCTTGATAAA GGAATTAAAT ATTTAAGTAA TACGAATATT
ATTTTGGCAT TTGCACTTAT GATTATTGTA TTATTTGCGG GTCCAACAAA CTTTATCATG
AATTACTTCA CCTCAACGAT TGGTGCTTAT ATTCAGGAAT TGCCAAGCAT GAGTTTCCGA
TTAAGTCCAT TAGATGAAGG TGGAAACCAA TGGATTCAAT CGTGGACAAT TTTCTATTGG
GCATGGTGGA TTGCATGGTC ACCATTCGTA GGTACATTTA TTGCTCGTGT TTCACGAGGA
CGTACCATTC GTGAGTTTGT TATCGGTGTG TTACTCGTAC CGACCGTAAT TGGTGCCCTT
TGGTTCTCTG TTTTCGGCGG AACTGGTATT CATATGGAGC TGTTCGGTGA TGCACATATT
TTTGAAAAAG TGAAAGAGAT GGGAACAGAA GTAGGGTTAT TCGCTATGTT TGACCAGATG
GGAAGCTTTG GATCGGCTTT ATCTGTTCTA GCTATTCTTC TTATTTCTAC ATTCTTTATT
ACATCTGCAG ATTCAGCGAC ATTCGTTTTA GGAATGTTAA CGACACATGG TAGTTTAAAT
CCGCCAAACC GCATTAAAAT GATCTGGGGT ATCGTTTTAG CAGCCTTAGC TTCTATCTTA
TTATATGTAG GTGGCTTAGA GGCCTTACAA ACGGCAGCTA TCATTGCAGC GTTCCCATTC
GTCTTCGTTA TTTTCTTTAT GATGGCAGCC TTATTTAAAG AGTTACAAAA AGAAGGACGT
ATGAAGCGTC ATAAATAA
 
Protein sequence
MRKLTKTFIV SLTLCIAFTL WGIIPESIIG KGSLGNVTTA IQTALVSKFG WFYIISVSII 
LGVSIFLIVS KYGSIRLGKD DDEPDYSYMT WFAMLFSAGM GIGLVFWGVA EPLNHLYAPP
FGESATEESA RLALRFSFFH WGLHPWGLYA FVALCIAYFT FRKGKASTIS ATVGPLFKGG
DHGRIAHLFD VLAVFATVFG VATSLGLGAK QIAGGVSYLT SIPNSLTTQL VIIAIVTVLF
MLSAQTGLDK GIKYLSNTNI ILAFALMIIV LFAGPTNFIM NYFTSTIGAY IQELPSMSFR
LSPLDEGGNQ WIQSWTIFYW AWWIAWSPFV GTFIARVSRG RTIREFVIGV LLVPTVIGAL
WFSVFGGTGI HMELFGDAHI FEKVKEMGTE VGLFAMFDQM GSFGSALSVL AILLISTFFI
TSADSATFVL GMLTTHGSLN PPNRIKMIWG IVLAALASIL LYVGGLEALQ TAAIIAAFPF
VFVIFFMMAA LFKELQKEGR MKRHK