Gene SbBS512_E2135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2135 
SymbolmalE 
ID6271526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1943662 
End bp1944894 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content40% 
IMG OID641726169 
Productmaltose/maltodextrin-binding periplasmic protein MalE 
Protein accessionYP_001880661 
Protein GI187731820 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.905565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTTT CAAATATCGT CACAGTTATA ATTCTTGCAA TTTCCAGTAC GCTAACACCC 
CAGGCGATGG CAGAAAAATT AATTCCTGAA ACTGATGCTG AGTTGTTAGT TTGGTCTGAT
GCCACAAGCG TGGAATATAT GAAATATGCA GCAAAGGAAT TCAATAAAGA TTTTGGCTAC
AAGGTAAAGT TTACATTTCG CAATATAGCG CCAATGGATG CAGCATCAAG AATTATGCAG
GATGGGGGTA CGACTCGTGT AGCTGATGTA GCTGAAATTG AACATGATAC CCTGGGGCGG
TTAGTCGTTG CTGGCGGGGT TATGGAAAAC ATGGTCTCAG CTGAGCGGAT TAAAAAAACA
TTTATTCCAG GCGCAGTATC GGCAGCTACA TATAATAACA TCAGCTATGG TTTTCCTGTA
AGTTTCGCAA CGCTGGCGCT TTTCTATAAT AAGGATTTGT TAAACACCGC ACCAAAAACA
TTCGAAGAAA TCAATACTTT CAGTGAAAAG TTTAATAATT CATCCGAGCA TAAATATGCT
CTGCTATGGG ATGTACAAAA TTATTATGTT TCACGTATGT TTATTACCTT GTATGGTGCC
AACGAATTCG GTAAAATCGG TAACGATCCT AAAGCTCTAG GCATCGCTTC ATCTGAAGCG
AAGAAAGGGC TAGAGACGAT GAAACGCTTA AAGAAAGCGA ATCCCTCTAA TCCTCTTGAT
ATGGGTAATC CACAAGTTCT AAGAGGTCTG TTTAATGAAG GTAAAGTTGC TGCTGTAATC
GACGGACCTT GGTCCATACA AGGTTACATT GACAGCGGAA TCAATTTTGG CGTGACACGC
ATCCCAACAT TAGATGGTCA TCAGCCTCGC ACATTTTCAA CAGTACGGCT GGCCGTTGTA
AGCTCATTTA CCGCATATCC TCATGCCGCT GAATTATTCG CTGACTACCT GACAACGGAT
AAGATGTTGA TGAAGCGTTA TGAAATGACG AATCTCATCC CACCAATCGA TTCACTTATG
AACAAAATTA GCCAGACAGG TAGTGAAGCT ATAAAAGCAA TTATTGCTCA AGCTAATTAT
TCTGACGCAA TGCCATCAAT ACCGGAAATG TCTTATTTAT GGTCTCCAAT GACTAATGCT
ATTTTGGCTA CCTGGGTTGA GAATAAAACA CCGGATGAAG TTTTAAATCA TGCACAAACA
ATTATTGAAG AACAACTTTC GCTTCAGGAG TAA
 
Protein sequence
MKLSNIVTVI ILAISSTLTP QAMAEKLIPE TDAELLVWSD ATSVEYMKYA AKEFNKDFGY 
KVKFTFRNIA PMDAASRIMQ DGGTTRVADV AEIEHDTLGR LVVAGGVMEN MVSAERIKKT
FIPGAVSAAT YNNISYGFPV SFATLALFYN KDLLNTAPKT FEEINTFSEK FNNSSEHKYA
LLWDVQNYYV SRMFITLYGA NEFGKIGNDP KALGIASSEA KKGLETMKRL KKANPSNPLD
MGNPQVLRGL FNEGKVAAVI DGPWSIQGYI DSGINFGVTR IPTLDGHQPR TFSTVRLAVV
SSFTAYPHAA ELFADYLTTD KMLMKRYEMT NLIPPIDSLM NKISQTGSEA IKAIIAQANY
SDAMPSIPEM SYLWSPMTNA ILATWVENKT PDEVLNHAQT IIEEQLSLQE