Gene EcSMS35_1621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1621 
SymbolydfJ 
ID6146155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1610990 
End bp1612357 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content47% 
IMG OID641616497 
Productinner membrane metabolite transport protein ydfJ 
Protein accessionYP_001743675 
Protein GI170680764 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00533306 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATAG AAAAACACGA AAGAAGCACT AAGGATTTGG TGAAAGCAGC AGTATCGGGA 
TGGCTGGGCA CTGCGCTTGA ATTTATGGAT TTCCAGTTAT ATTCGCTCGG CGCAGCGTTA
GTGTTTCATG AAATATTTTT TCCTGAATCA TCAACGGCAA TGGCGTTAAT TCTGGCAATG
GGAACCTACG GTGCAGGTTA TGTGGCGCGT ATTGTCGGAG CATTTATTTT CGGCAAAATG
GGCGACAGAA TCGGGCGTAA AAAAGTGCTC TTTATTACCA TAACTATGAT GGGGATCTGT
ACCACCTTAA TTGGTGTGCT GCCGACCTAT GCACAAATTG GTGTGTTTGC GCCAATTTTG
CTGGTGACGC TGCGTATTAT TCAGGGGCTG GGGGCAGGCG CGGAAATTTC CGGTGCCGGT
ACGATGCTGG CGGAATATGC GCCAAAAGGT AAGCGCGGAA TTATCTCCTC ATTTGTTGCT
ATGGGGACTA ACTGCGGAAC CTTGAGCGCA ACGGCAATCT GGGCCTTTAT GTTCTTCATT
CTCAGTAAAG AGGAACTGTT GGCGTGGGGA TGGCGTATAC CGTTCCTGGC GAGCGTTGTC
GTGATGGTCT TTGCTATCTG GTTGCGTATG AATCTGAAAG AAAGCCCGGT CTTTGAGAAG
GTTAACGACA GCAACCAACC TACAGCAAAA CCTGCACCTG CTGGTAGCAT GTTCCAGAGC
AAATCCTTCT GGCTGGCAAC AGGGCTGCGT TTTGGTCAGG CTGGTAACTC AGGTTTAATT
CAGACTTTCC TTGCAGGCTA TTTAGTGCAG ACGTTATTGT TTAACAAAGC AATTCCAACA
GATGCATTGA TGATCAGTTC GATTCTCGGC TTTATGACCA TTCCGTTCCT TGGTTGGTTA
TCCGATAAAA TTGGTCGCCG GATCCCGTAT ATTATTATGA ATACCTCTGC GATTGTGCTG
GCATGGCCAA TGCTTTCTAT CATCGTAGAT AAAAGCTATG CCCCGAGCAC CATTATGGTT
GCACTGATTG TGATTCATAA CTGTGCGGTG CTGGGATTAT TTGCTCTGGA AAACATTACC
ATGGCAGAAA TGTTCGGCTG TAAAAACCGC TTTACCCGGA TGGCCATTTC TAAAGAAATT
GGTGGTCTTA TCGCTTCCGG TTTTGGTCCT ATCCTGGCGG GTATTTTCTG CACCATGACG
GAATCCTGGT ATCCGATCGC AATTATGATC ATGGCATATT CAGTGATTGG TTTAATTTCT
GCGCTGAAAA TGCCAGAAGT GAAAGACCGT GATTTAAGTG CGCTGGAAGA CGCTGCGGAA
GATCAACCGC ATGTTGTCAG AGCTGCGCAA CCTTCCAGAA GTCTTTAA
 
Protein sequence
MTIEKHERST KDLVKAAVSG WLGTALEFMD FQLYSLGAAL VFHEIFFPES STAMALILAM 
GTYGAGYVAR IVGAFIFGKM GDRIGRKKVL FITITMMGIC TTLIGVLPTY AQIGVFAPIL
LVTLRIIQGL GAGAEISGAG TMLAEYAPKG KRGIISSFVA MGTNCGTLSA TAIWAFMFFI
LSKEELLAWG WRIPFLASVV VMVFAIWLRM NLKESPVFEK VNDSNQPTAK PAPAGSMFQS
KSFWLATGLR FGQAGNSGLI QTFLAGYLVQ TLLFNKAIPT DALMISSILG FMTIPFLGWL
SDKIGRRIPY IIMNTSAIVL AWPMLSIIVD KSYAPSTIMV ALIVIHNCAV LGLFALENIT
MAEMFGCKNR FTRMAISKEI GGLIASGFGP ILAGIFCTMT ESWYPIAIMI MAYSVIGLIS
ALKMPEVKDR DLSALEDAAE DQPHVVRAAQ PSRSL