Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0972 |
Symbol | gatC |
ID | 6144264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 981806 |
End bp | 983161 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615859 |
Product | PTS system, galactitol-specific IIC component |
Protein accession | YP_001743051 |
Protein GI | 170683704 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3775] Phosphotransferase system, galactitol-specific IIC component |
TIGRFAM ID | [TIGR00827] PTS system, galactitol-specific IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.487394 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCAG AAGTCATGCG TTATATTCTC GACCTCGGCC CTACGGTGAT GCTGCCGATT GTCATCATTA TTTTTTCTAA AATATTAGGC ATGAAGGCAG GCGATTGCTT TAAAGCGGGT CTGCATATCG GGATTGGCTT TGTTGGCATT GGCCTTGTGA TTGGCTTAAT GCTGGATTCC ATTGGTCCGG CGGCGAAAGC GATGGCGGAA AATTTCGACC TGAATCTGCA TGTGGTCGAT GTCGGCTGGC CGGGCTCTTC ACCCATGACC TGGGCGTCGC AAATTGCGCT GGTGGCGATT CCGATTGCGA TTCTGGTTAA CGTGGCGATG TTACTGACCC GTATGACGCG GGTGGTAAAT GTTGATATCT GGAATATCTG GCATATGACC TTCACCGGCG CGTTGCTGCA TCTGGCAACC GGTTCATGGA TGATAGGGAT GGCGGGTGTG GTAATTCACG CGGCGTTTGT TTATAAGCTC GGCGACTGGT TTGCCCGCGA TACCCGAAAT TTCTTTGAGC TGGAAGGCAT TGCCATCCCG CACGGTACGT CGGCGTATAT GGGGCCAATT GCGGTGCTGG TGGATGCTAT CATCGAGAAA ATCCCAGGCG TTAACCGAAT TAAATTTAGT GCCGACGATA TTCAGCGCAA ATTTGGTCCG TTTGGCGAGC CTGTCACCGT GGGTTTTGTG ATGGGGCTGA TTATCGGCAT CCTCGCGGGT TACGATGTCA AAGGTGTATT GCAGCTGGCG GTAAAAACGG CGGCGGTTAT GCTGCTGATG CCACGGGTGA TTAAACCCAT CATGGATGGT TTAACGCCCA TCGCTAAGCA GGCCCGTAGT CGTTTACAGG CGAAGTTCGG CGGTCAGGAG TTCCTGATTG GCCTGGATCC CGCATTACTG CTGGGGCATA CGGCGGTGGT ATCGGCAAGC CTGATTTTTA TCCCGCTCAC CATTTTAATT GCTGTTTGTG TGCCGGGTAA TCAGGTGCTG CCGTTTGGCG ACCTTGCCAC CATCGGCTTC TTTGTGGCGA TGGCGGTCGC GGTGCATCGT GGAAATCTGT TCCGCACCTT AATCTCGGGT GTCATCATTA TGAGCATCAC CCTGTGGATC GCGACGCAAA CTATTGGTTT GCACACCCAA CTGGCGGCTA ATGCGGGCGC GTTAAAAGCC GGGGGTATGG TGGCTTCAAT GGATCAGGGC GGTTCTCCCA TTACCTGGTT ACTGATTCAG GTTTTCTCCC CGCAAAATAT TCCCGGTTTC ATTATTATCG GCGCAATTTA TCTGACCGGT ATTTTCATGA CCTGGCGTAG AGCGCGTGGT TTTATTAAAC AAGAGAAAGC CGTTCTCGCA GAATAA
|
Protein sequence | MFSEVMRYIL DLGPTVMLPI VIIIFSKILG MKAGDCFKAG LHIGIGFVGI GLVIGLMLDS IGPAAKAMAE NFDLNLHVVD VGWPGSSPMT WASQIALVAI PIAILVNVAM LLTRMTRVVN VDIWNIWHMT FTGALLHLAT GSWMIGMAGV VIHAAFVYKL GDWFARDTRN FFELEGIAIP HGTSAYMGPI AVLVDAIIEK IPGVNRIKFS ADDIQRKFGP FGEPVTVGFV MGLIIGILAG YDVKGVLQLA VKTAAVMLLM PRVIKPIMDG LTPIAKQARS RLQAKFGGQE FLIGLDPALL LGHTAVVSAS LIFIPLTILI AVCVPGNQVL PFGDLATIGF FVAMAVAVHR GNLFRTLISG VIIMSITLWI ATQTIGLHTQ LAANAGALKA GGMVASMDQG GSPITWLLIQ VFSPQNIPGF IIIGAIYLTG IFMTWRRARG FIKQEKAVLA E
|
| |