Gene BCG9842_B4495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4495 
Symbol 
ID7181644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp774063 
End bp775265 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content39% 
IMG OID643548573 
Productmajor facilitator family transporter 
Protein accessionYP_002444244 
Protein GI218895833 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000363259 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000000000000205787 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGGAGAAG CAATACTCGT AAAACGAGAA CCGTTATGGA CAAAAGAGTT TGTTGCTTTA 
ATTTTTGCAA ACTTATGTAT GTTTTTAGGG TTTCAAATGT TAATTCCAAC CTTACCTGTT
TATGTGAAAG AAATTGGTGG CACAAGTTCC AATATCGGAT TTGTTGTCGG TATGTTTACC
GTTGCGGCAC TTTTTGTTAG ACCGCTAACT GGGAACGCCT TGCAAAAATT TAATAAAAAA
ATCATTTTAA TGATCGGTAC TGCTATCTGT TTACTCGCTA TGGGCAGTTA CCTTTTCGCC
TCAACTATCT TTCTCTTGCT TGCTGTTCGT ATTTTACACG GAGCTGGTTT CGGTATTACA
ACGACTACAT ATGGAACTGT CGTTTCTGAT TTAATTCCCT CAGCTCGCCG CGGAGAAGGC
ATGGGATATT TTGGCCTTTC TGGAACAATT GCAATGGCCC TCGGTCCACT TATAGGACTA
TGGCTCATGC AAACATATAA CTTCACAATT CTTTTTTTAT GTGCACTATC GTGCACAATT
GTTTCATTAA TATTAACGAA ACTACTTCAA ATCCAAAAAA CGAAACAGCC GCCACAACAA
TCATCTAGTA CTTTTCTCGA TGGATTTATT GAGCGTAAAG CTTTACTTCC TTCATTATTA
ATATTATGTA TTACATTAAT GTACGGAGGA ATCGGAAGTT TTATCACACT ATTTGCTACA
GAAGTCGGCA TAGCTGATAT TAGCCTCTTC TTTTTATGTA ATGCACTAGC AATTGCTGTA
ACTCGTCCAT TCTCTGGAAG GCTATATGAT GCGAAAGGCC ATACATTCGT CATCATTCCG
GGAGTTATTA TAACGTTTAC AGGCATTATT TTATTGTCGT ATACGACGAC AATTCCGAGC
TTAATTATTG CTGCAGCATG TTACGGAAGT GGTTTTGGAG CGATCCAACC TGCACTACAA
GCATGGATGA TTGACCGCGT AGCACCGCAT CGACGCGGCG TAGCAACAGC TACATTCTTC
TCCGCATTTG ACCTTGGAAT CGGCGCTGGA GCGATTATAT TTGGATTTAT TGCTCATTTT
ACAAACTATG CAACTGTATA TCGTTACTCC TCTCTATTAC TTATTGCTTT TCTCTTCATT
TACATTACAA GCATAAGAAA ACAAAAGTAT GGCGATAAAA ACATGGAAAA AGCTGCTGGA
TAA
 
Protein sequence
MGEAILVKRE PLWTKEFVAL IFANLCMFLG FQMLIPTLPV YVKEIGGTSS NIGFVVGMFT 
VAALFVRPLT GNALQKFNKK IILMIGTAIC LLAMGSYLFA STIFLLLAVR ILHGAGFGIT
TTTYGTVVSD LIPSARRGEG MGYFGLSGTI AMALGPLIGL WLMQTYNFTI LFLCALSCTI
VSLILTKLLQ IQKTKQPPQQ SSSTFLDGFI ERKALLPSLL ILCITLMYGG IGSFITLFAT
EVGIADISLF FLCNALAIAV TRPFSGRLYD AKGHTFVIIP GVIITFTGII LLSYTTTIPS
LIIAAACYGS GFGAIQPALQ AWMIDRVAPH RRGVATATFF SAFDLGIGAG AIIFGFIAHF
TNYATVYRYS SLLLIAFLFI YITSIRKQKY GDKNMEKAAG