Gene EcSMS35_1416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1416 
SymbolydjK 
ID6143322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1400084 
End bp1401463 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content49% 
IMG OID641616294 
Productmajor facilitator family transporter 
Protein accessionYP_001743474 
Protein GI170683522 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.888006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAGA TAACAAAACC GCATTGTGGT GCTCGGCTGG ATCGCTTACC GGATTGCCGC 
TGGCATTCGT CAATGTTTGC TATCGTCGCG TTTGGCTTGC TTGTCTGCTG GAGTAATGCC
GTTGGTGGCT TGATCCTCGC GCAGCTGAAA GCGTTGGGCT GGACAGATAA TTCCACCACT
GCCACATTCT CAGCAATCAC GACCGCCGGA ATGTTTCTCG GTGCGCTGGT TGGCGGCATC
ATCGGTGACA AAACCGGTCG CAGAAATGCG TTCATCCTCT ATGAGGCCAT TCATATTGCC
TCGATGGTGG TCGGTGCTTT TTCACCGAAT ATGGATTTCC TTATTGCCTG CCGTTTTGTG
ATGGGCGTTG GGCTGGGAGC TTTACTGGTG ACGCTGTTTG CTGGTTTCAC CGAATATATG
CCCGGTAGAA ATCGCGGAAC GTGGTCCAGT CGGGTTTCTT TTATTGGCAA CTGGTCATAT
CCGCTCTGTT CATTGATTGC GATGGGACTC ACGCCGCTGA TTAGTGCAGA GTGGAACTGG
CGAGTACAAC TGCTTATCCC TGCAATATTG TCGCTTATCG CTACGGCGCT GGCCTGGCGC
TACTTTCCTG AATCCCCGCG CTGGCTGGAA TCGCGCGGAC GGTATCAGGA AGCCGAGAAA
GTCATGCGGA GTATAGAAGA AGGCGTCATA CGCCAGACGG GAAAACCTTT GCCGCCCGTG
GTTATTGCTG ATGACGGTAA AGCGCCACAA GCGGTGCCGT ATTCAGCCTT ACTGACAGGA
GTATTACTGA AACGCGTGAT ATTAGGTTCT TGTGTGCTGA TTGCCATGAA CGTTGTGCAG
TACACACTAA TTAACTGGTT GCCAACAATA TTCATGACCC AGGGGATTAA TTTAAAAGAC
TCGATTGTTT TAAATACCAT GAGTATGTTT GGTGCGCCAT TTGGTATATT TATTGCCATG
CTGGTGATGG ATAAAATTCC GCGTAAAACA ATGGGTGTGG GGCTATTAAT CCTGATTGCG
GTGCTCGGAT ATATCTATTC ACTGCAAACC AGTATGTTGC TCATAACGCT GATTGGTTTC
TTCCTGATTA CTTTCGTCTA TATGTACGTT TGCTATGCCT CGGCAGTGTA TGTCCCTGAG
ATCTGGCCGA CAGAGGCAAA ATTACGTGGC TCCGGTCTGG CGAATGCAGT AGGGCGAATC
AGTGGTATTG CCGCACCTTA TGCCGTTGCA GTGCTGCTCA GTAGTTATGG CGTAACGGGA
GTCTTTATTC TTCTGGGGGC GGTTTCAATT ATTGTCGCAA TTGCTATCGC CACCATTGGA
ATTGAAACCA AAGGTGTCTC CGTTGAAAGT TTAAGTATTG ATGCAGTAGT CAATAAATAA
 
Protein sequence
MEQITKPHCG ARLDRLPDCR WHSSMFAIVA FGLLVCWSNA VGGLILAQLK ALGWTDNSTT 
ATFSAITTAG MFLGALVGGI IGDKTGRRNA FILYEAIHIA SMVVGAFSPN MDFLIACRFV
MGVGLGALLV TLFAGFTEYM PGRNRGTWSS RVSFIGNWSY PLCSLIAMGL TPLISAEWNW
RVQLLIPAIL SLIATALAWR YFPESPRWLE SRGRYQEAEK VMRSIEEGVI RQTGKPLPPV
VIADDGKAPQ AVPYSALLTG VLLKRVILGS CVLIAMNVVQ YTLINWLPTI FMTQGINLKD
SIVLNTMSMF GAPFGIFIAM LVMDKIPRKT MGVGLLILIA VLGYIYSLQT SMLLITLIGF
FLITFVYMYV CYASAVYVPE IWPTEAKLRG SGLANAVGRI SGIAAPYAVA VLLSSYGVTG
VFILLGAVSI IVAIAIATIG IETKGVSVES LSIDAVVNK