Gene BCG9842_B0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B0037 
Symbol 
ID7184548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp4962786 
End bp4964000 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content38% 
IMG OID643552984 
Productmajor facilitator family transporter 
Protein accessionYP_002448626 
Protein GI218900215 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0000000189169 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGTTTAT ATACAGTACC ATCTTCAGAA GTTCAGACGA ATCGAAGAAG TATGTTTGCT 
TTATTAGCAC TAGCAATTAG TGCATTTGGG ATTGGTACGA CCGAGTTTAT TAGTGTCGGT
TTACTACCAT CTATTTCGAA AGATTTAAAT GTTTCGGTTA CAACAGCTGG TTTAACCGTT
TCTTTATATG CATTAGGAGT AGCGTTTGGT GCTCCAGTAT TGACGTCATT AACAGCTAGT
ATGTCGCGAA AGACGTTGTT AATGTGGATT ATGATTATTT TCATTATTGG CAATGGTATT
GCGGCAGTAG CAACAAGTTT TACTGTGCTA CTTATCGCGA GGGTTGTATC TGCGTTTGCG
CATGGGGTTT TTATGTCAAT TGGATCTACA ATTGCGGCTG CGCTTGTACC TGAGAATAAA
CGTGCAAGTG CAATAGCATT TATGTTTACT GGTTTAACTG TTGCGACTAT TACAGGTGTA
CCAATTGGAA CATTTATCGG TCAACAATTT GGCTGGAGAG CATCGTTTAT GGTTATTGTG
GCAATTGGAG TCGTTGCTTT AATCGCCAAC AGTATGCTAA TTCCTTCTAA CTTAAAAAAG
GGTACGTCTG TATCATTTCA CGATCAATTT AAGTTGATTA CGAACGGAAG ACTATTACTT
GTATTTATCA TTACTGCATT AGGATACGGG GGAACATTCG TAACATTTAC GTATTTATCT
CCGTTATTAC AAGAAGTAAC AGGATTTAAA GCAAGTACCG TTACAATTAT TTTGTTAGTA
TACGGAATTG CAATTGCGAT TGGGAATATG GTCGGCGGAA AATTATCAAA TTATAATCCG
ATTCGAGCGT TATTTTACAT GTTCTTTATT CAAGCGATTG TATTATTTGT TTTAACATTT
ACAGCGCCAT TTAAAGTTGC TGGATTCATT ACAATTATTT TTATGGGACT ATTTGCGTTT
ATGAATGTCC CAGGGCTACA AGTGTATGTC GTTATCTTGG CTGAGAGGTT TGTACCTAGC
GCTGTTGATG TAGCATCAGC GATTAACATT GCAGCATTTA ACGCAGGAAT TGCTCTTGGT
GCTTACTTGG GCGGTATTGT AACGAACTCG TTAGGATTAA TTCATACGAC TTGGGTAGGC
GGAATTATGG TAGTAGGTGC TGTGATTTTA ACAGCGTGGA GCATGACATT AGAAAAAAGA
GATCAAGTAA AATAA
 
Protein sequence
MSLYTVPSSE VQTNRRSMFA LLALAISAFG IGTTEFISVG LLPSISKDLN VSVTTAGLTV 
SLYALGVAFG APVLTSLTAS MSRKTLLMWI MIIFIIGNGI AAVATSFTVL LIARVVSAFA
HGVFMSIGST IAAALVPENK RASAIAFMFT GLTVATITGV PIGTFIGQQF GWRASFMVIV
AIGVVALIAN SMLIPSNLKK GTSVSFHDQF KLITNGRLLL VFIITALGYG GTFVTFTYLS
PLLQEVTGFK ASTVTIILLV YGIAIAIGNM VGGKLSNYNP IRALFYMFFI QAIVLFVLTF
TAPFKVAGFI TIIFMGLFAF MNVPGLQVYV VILAERFVPS AVDVASAINI AAFNAGIALG
AYLGGIVTNS LGLIHTTWVG GIMVVGAVIL TAWSMTLEKR DQVK