Gene EcSMS35_4498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4498 
SymbollamB 
ID6142859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4595521 
End bp4596861 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content52% 
IMG OID641619314 
Productmaltoporin 
Protein accessionYP_001746426 
Protein GI170684303 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4580] Maltoporin (phage lambda and maltose receptor) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATTA CTCTGCGCAA ACTTCCTCTG GCGGTTGCCG TCGCAGCGGG CGTAATGTCT 
GCTCAGGCAA TGGCTGTTGA TTTCCACGGC TATGCACGTT CCGGTATTGG CTGGACAGGT
AGCGGCGGTG AACAACAGTG TTTCCAGACT ACCGGTGCTC AAAGTAAATA CCGTCTTGGC
AACGAATGTG AAACTTATGC TGAATTAAAA TTGGGTCAGG AAGTGTGGAA AGAGGGCGAT
AAGAGCTTCT ATTTCGACAC TAACGTGGCC TATTCCGTCG CACAACAGAA TGACTGGGAA
GCTACCGATC CGGCCTTCCG TGAAGCAAAC GTGCAGGGTA AAAACCTGAT CGAATGGCTG
CCAGGCTCCA CCATCTGGGC AGGTAAGCGC TTCTACCAAC GTCATGACGT TCATATGATC
GACTTCTACT ACTGGGATAT TTCTGGTCCT GGTGCCGGTC TGGAAAACAT CGATGTTGGC
TTCGGTAAAC TCTCTCTGGC AGCAACCCGC TCCTCTGAAG CTGGTGGTTC TTCCTCTTTC
GCCAGCAACA ATATTTATGA CTATACCAAC GAAACCGCGA ACGACGTTTT CGACGTGCGT
TTAGCGCAGA TGGAAATCAA CCCGGGCGGC ACATTAGAAC TGGGTGTCGA CTACGGTCGT
GCCAACCTGC GTGATAACTA TCGTCTGGTT GATGGCGCAT CGAAAGACGG TTGGTTATTC
ACTGCTGAAC ATACTCAGAG TGTCCTGAAG GGCTTTAACA AGTTTGTTGT TCAGTACGCT
ACTGACTCGA TGACCTCACA GGGTAAAGGT CTGTCGCAGG GTTCTGGCGT CGCGTTTGAT
AACGAAAAAT TTGCCTACAA TATCAACAAC AACGGTCACA TGCTGCGTAT CCTCGACCAC
GGTGCGATCT CCATGGGCGA TAACTGGGAC ATGATGTACG TGGGTATGTA CCAGGATATC
AACTGGGATA ACGACAACGG CACCAAGTGG TGGACCGTCG GTATTCGCCC GATGTACAAG
TGGACGCCAA TCATGAGCAC CGTGATGGAA ATCGGCTACG ACAACGTCGA ATCCCAGCGC
ACCGGCGACA AGAACAATCA GTACAAAATT ACCCTTGCAC AACAATGGCA GGCTGGCGAC
AGCATCTGGT CACGCCCGGC TATTCGTGTC TTCGCAACCT ACGCCAAGTG GGATGAGAAA
TGGGGTTACG ACTACACCGG TAACGCCAAT ACCAACACTA ACTTCGGCAA AGCCGTTCCT
GCTGATTTCA ACGGCGGCAG CTTCGGTCGT GGCGACAGCG ACGAGTGGAC CTTCGGTGCC
CAGATGGAAA TCTGGTGGTA A
 
Protein sequence
MMITLRKLPL AVAVAAGVMS AQAMAVDFHG YARSGIGWTG SGGEQQCFQT TGAQSKYRLG 
NECETYAELK LGQEVWKEGD KSFYFDTNVA YSVAQQNDWE ATDPAFREAN VQGKNLIEWL
PGSTIWAGKR FYQRHDVHMI DFYYWDISGP GAGLENIDVG FGKLSLAATR SSEAGGSSSF
ASNNIYDYTN ETANDVFDVR LAQMEINPGG TLELGVDYGR ANLRDNYRLV DGASKDGWLF
TAEHTQSVLK GFNKFVVQYA TDSMTSQGKG LSQGSGVAFD NEKFAYNINN NGHMLRILDH
GAISMGDNWD MMYVGMYQDI NWDNDNGTKW WTVGIRPMYK WTPIMSTVME IGYDNVESQR
TGDKNNQYKI TLAQQWQAGD SIWSRPAIRV FATYAKWDEK WGYDYTGNAN TNTNFGKAVP
ADFNGGSFGR GDSDEWTFGA QMEIWW