Gene EcSMS35_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1371 
SymbolmanX 
ID6143134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1357074 
End bp1358045 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content50% 
IMG OID641616249 
ProductPTS system, mannose-specific IIAB component 
Protein accessionYP_001743429 
Protein GI170680281 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2893] Phosphotransferase system, mannose/fructose-specific component IIA
[COG3444] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB 
TIGRFAM ID[TIGR00824] PTS system, mannose/fructose/sorbose family, IIA component
[TIGR00854] PTS system, mannose/fructose/sorbose family, IIB component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000647863 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0229749 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCATTG CTATTGTTAT AGGCACACAT GGTTGGGCTG CAGAGCAGTT GCTTAAAACC 
GCAGAAATGC TGTTAGGCGA GCAGGAAAAC GTCGGCTGGA TCGATTTCGT TCCAGGTGAA
AATGCCGAAA CGCTGATTGA AAAGTACAAC GCTCAGTTGG CAAAACTCGA CACCACTAAA
GGCGTGCTGT TTCTCGTTGA TACATGGGGA GGCAGCCCAT TCAATGCTGC CAGCCGCATT
GTCGTCGACA AAGAGCATTA TGAAGTCATT GCAGGCGTTA ACATTCCAAT GCTCGTGGAA
ACGTTAATGG CCCGTGATGA TGACCCAAGC TTTGATGAAC TGGTTGCGCT GGCAGTAGAA
ACAGGCCGTG AAGGCGTGAA AGCTCTGAAA GCCAAACCGG TTGAAAAAGC CGCGCCAGCA
CCCGCTGCCG CAGCACCAAA AGCGGCTCCA ACTCCGGCAA AACCAATGGG ACCAAACGAC
TACATGGTTA TTGGCCTTGC GCGTATCGAC GACCGTCTGA TTCACGGTCA GGTCGCCACC
CGCTGGACCA AAGAAACCAA TGTCTCCCGT ATTATTGTTG TTAGTGATGA AGTGGCTGCG
GATACCGTTC GTAAGACACT GCTCACCCAG GTTGCACCTC CGGGCGTAAC AGCACACGTA
GTTGATGTTG CCAAAATGAT TCGCGTCTAC AACAACCCGA AATATGCTGG CGAACGCGTA
ATGCTGTTAT TTACCAACCC AACAGATGTA GAGCGTCTCG TTGAAGGCGG CGTGAAAATC
ACCTCTGTTA ACGTCGGTGG TATGGCATTC CGTCAGGGTA AAACCCAGGT GAATAACGCG
GTTTCGGTTG ATGAAAAAGA TATCGAGGCG TTCAAGAAAC TGAATGCGCG CGGTATTGAG
CTGGAAGTCC GTAAGGTTTC CACCGATCCG AAACTGAAAA TGATGGATCT GATCAGCAAA
ATCGATAAGT AA
 
Protein sequence
MTIAIVIGTH GWAAEQLLKT AEMLLGEQEN VGWIDFVPGE NAETLIEKYN AQLAKLDTTK 
GVLFLVDTWG GSPFNAASRI VVDKEHYEVI AGVNIPMLVE TLMARDDDPS FDELVALAVE
TGREGVKALK AKPVEKAAPA PAAAAPKAAP TPAKPMGPND YMVIGLARID DRLIHGQVAT
RWTKETNVSR IIVVSDEVAA DTVRKTLLTQ VAPPGVTAHV VDVAKMIRVY NNPKYAGERV
MLLFTNPTDV ERLVEGGVKI TSVNVGGMAF RQGKTQVNNA VSVDEKDIEA FKKLNARGIE
LEVRKVSTDP KLKMMDLISK IDK