Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4638 |
Symbol | |
ID | 6144675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4738863 |
End bp | 4740395 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641619454 |
Product | hypothetical protein |
Protein accession | YP_001746562 |
Protein GI | 170680990 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000291773 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0355337 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAA ACCCCGTAAG TATACCACAC ACCGTCTGGC ACGCCGACGA TATCCGCCGC GGAGAACGCG AGGCGGTAGA TGCGCTGGGG CTCACACTCT ATGAGCTGAT GCTTCGCGCT GGCGAAGCCG CATTCCAGGT GTGTCGTTCG GCTTATCCTG ACGCCCGCCA CTGGCTGGTG CTGTGCGGTC ATGGTAATAA CGGCGGCGAT GGCTACGTGG TCGCGCGACT GGCCAAAGCG GTCGGTATTG ATGTCACGCT GCTGGCCCAG GAAAGTGACA AACCGTTGCC GGAAGAGGCC GCGCTGGCAC GCGAAGCATG GTTAAACGCG GGAGGCGAGA TCCATGCTTC GAATATTGTC TGGCCCGAAT CGGTAGATCT GATTGTTGAT GCGCTGCTCG GTACCGGATT ACAGCAAGCG CCTCGTGAAT CCATTAGCCA GTTAATCGAC CACGCTAATT CCCATCCTGC GCCGATTGTG GCGGTTGATA TCCCTTCCGG CCTGCTGGCT GAAACTGGCG CTACGCCAGG CGCGGTGATC AACGCCGATC ACACCATCAC TTTTATTGCG CTGAAACCAG GCTTGCTCAC TGGAAAAGCG CGGGATGTTA CAGGACAACT GCATTTTGAC TCACTGGGGC TGGATAGCTG GCTGGCAGGT CAGGAGACGA AAATTCAGCG GTTTTCGGCA GAACAACTTT CTCACTGGCT GAAACCGCGT CGCCCGACTT CGCATAAAGG CGATCACGGG CGGCTGGTGA TTATCGGTGG CGATCACGGC ACGGCGGGGG CTATTCGTAT GACGGGGGAA GCGGCGCTGC GTGCTGGTGC TGGTTTAGTC CGAGTACTGA CCCGCAGTGA GAACATTGCG CCGCTGCTGA CTGCACGACC AGAATTGATG GTGCATGAAC TGACTATGGA CTCTCTTACC GAAAGCCTGG AATGGGCCGA TGTGGTGGTG ATTGGTCCCG GTCTGGGCCA GCAAGAGTGG GGGAAAAAAG CCCTGCAAAA AGTTGAGAAT TTTCGCAAAC CGATGTTGTG GGATGCCGAT GCATTGAACC TGCTGGCAAT CAATCCCGAT AAGCGTCACA ATCGCGTGAT CACGCCGCAT CCTGGCGAGG CCGCACGGTT GTTAGGCTGT TCCGTCGCTG AAATTGAAAG TGACCGCTTA CATTGCGCCC AACGTCTGGT ACAACGTTAT GGCGGCGTAG CGGTGCTGAA AGGTGCCGGA ACCGTGGTCG CCGCCCATTC TGACGCTTTA GGCATTATTG ATGTCGGAAA TGCAGGCATG GCGAGCGGCG GCATGGGCGA TGTGCTCTCT GGTATTATTG GCGCATTGCT TGGGCAAAAA ATGAGCCCTT ATGATGCAGC CTGTGCGGGC TGTGTCGCGC ACGGTGCGGC AGCTGACGTA CTGGCGGCGC GTTTTGGAAC GCGCGGGATG CTGGCAACCG ATCTCTTTTC CACGCTACAG CGTATTGTTA ACCCGGAAGT GACTGATAAA AACCATGATG AATCGAGTAA TTCCGCTCCC TGA
|
Protein sequence | MKKNPVSIPH TVWHADDIRR GEREAVDALG LTLYELMLRA GEAAFQVCRS AYPDARHWLV LCGHGNNGGD GYVVARLAKA VGIDVTLLAQ ESDKPLPEEA ALAREAWLNA GGEIHASNIV WPESVDLIVD ALLGTGLQQA PRESISQLID HANSHPAPIV AVDIPSGLLA ETGATPGAVI NADHTITFIA LKPGLLTGKA RDVTGQLHFD SLGLDSWLAG QETKIQRFSA EQLSHWLKPR RPTSHKGDHG RLVIIGGDHG TAGAIRMTGE AALRAGAGLV RVLTRSENIA PLLTARPELM VHELTMDSLT ESLEWADVVV IGPGLGQQEW GKKALQKVEN FRKPMLWDAD ALNLLAINPD KRHNRVITPH PGEAARLLGC SVAEIESDRL HCAQRLVQRY GGVAVLKGAG TVVAAHSDAL GIIDVGNAGM ASGGMGDVLS GIIGALLGQK MSPYDAACAG CVAHGAAADV LAARFGTRGM LATDLFSTLQ RIVNPEVTDK NHDESSNSAP
|
| |