Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3071 |
Symbol | gatC |
ID | 6966799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2843259 |
End bp | 2844614 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643386903 |
Product | PTS system, galactitol-specific IIC component |
Protein accession | YP_002271371 |
Protein GI | 209397449 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3775] Phosphotransferase system, galactitol-specific IIC component |
TIGRFAM ID | [TIGR00827] PTS system, galactitol-specific IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0000414376 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTTCAG AAGTCATGCG TTATATTCTC GATCTCGGCC CTACGGTGAT GTTGCCGATT GTCATCATTA TTTTTTCTAA AATTTTAGGC ATGAAGGCAG GCGATTGCTT TAAAGCGGGT CTGCATATCG GGATTGGCTT TGTTGGCATT GGCCTTGTGA TTGGCTTAAT GCTGGATTCC ATTGGCCCGG CGGCGAAAGC GATGGCGGAA AATTTCGACC TGAATCTGCA TGTGGTCGAT GTCGGCTGGC CGGGCTCTTC ACCAATGACC TGGGCGTCGC AAATTGCGCT GGTGGCGATT CCGATTGCGA TTCTGGTTAA CGTGGCGATG TTACTGACCC GTATGACGCG GGTGGTAAAT GTTGATATCT GGAATATCTG GCATATGACC TTCACCGGCG CGTTGCTGCA TCTGGCAACC GGTTCATGGA TGATAGGGAT GGCAGGTGTG GTAATTCACG CGGCGTTTGT TTATAAGCTC GGCGACTGGT TTGCCCGCGA TACCCGAAAT TTCTTTGAGC TGGAAGGCAT TGCTATTCCG CACGGTACGT CGGCGTATAT GGGGCCGATT GCGGTGCTGG TCGATGCTAT CATCGAGAAA ATCCCAGGCG TTAACCGTAT TAAATTTAGC GCCGACGATA TTCAGCGCAA ATTTGGCCCG TTTGGCGAGC CTGTCACCGT GGGTTTTGTG ATGGGGCTGA TTATCGGCAT CCTCGCGGGT TACGATGTCA AAGGTGTATT GCAGCTGGCG GTAAAAACGG CGGCGGTGAT GCTGCTAATG CCACGGGTGA TTAAACCCAT CATGGATGGT TTAACGCCCA TCGCTAAGCA GGCCCGTAGT CGTTTACAGG CGAAGTTCGG CGGTCAGGAG TTCCTGATTG GCCTGGATCC CGCATTACTG CTGGGGCATA CGGCGGTGGT ATCGGCAAGC CTGATTTTTA TTCCGCTCAC CATTTTAATT GCTGTTTGTG TTCCTGGTAA TCAGGTACTG CCGTTTGGCG ATCTTGCCAC TATCGGCTTC TTCGTGGCGA TGGCGGTCGC GGTGCATCGT GGAAATCTGT TCCGCACCTT AATCTCGGGT GTCATCATTA TGAGCATCAC CCTGTGGATC GCGACGCAAA CTATTGGTTT GCACACCCAA CTGGCGGCTA ATGCTGGGGC GTTAAAAGCC GGGGGTATGG TGGCTTCAAT GGATCAGGGC GGTTCTCCCA TTACCTGGTT ACTGATTCAG GTTTTCTCCC CGCAAAATAT TCCCGGTTTC ATTATTATCG GCGCAATTTA TCTTACCGGT ATTTTCATGA CCTGGCGTAG AGCGCGTGGC TTTATTAAAC AAGAGAAAGT CGTTCTCGCA GAATAA
|
Protein sequence | MFSEVMRYIL DLGPTVMLPI VIIIFSKILG MKAGDCFKAG LHIGIGFVGI GLVIGLMLDS IGPAAKAMAE NFDLNLHVVD VGWPGSSPMT WASQIALVAI PIAILVNVAM LLTRMTRVVN VDIWNIWHMT FTGALLHLAT GSWMIGMAGV VIHAAFVYKL GDWFARDTRN FFELEGIAIP HGTSAYMGPI AVLVDAIIEK IPGVNRIKFS ADDIQRKFGP FGEPVTVGFV MGLIIGILAG YDVKGVLQLA VKTAAVMLLM PRVIKPIMDG LTPIAKQARS RLQAKFGGQE FLIGLDPALL LGHTAVVSAS LIFIPLTILI AVCVPGNQVL PFGDLATIGF FVAMAVAVHR GNLFRTLISG VIIMSITLWI ATQTIGLHTQ LAANAGALKA GGMVASMDQG GSPITWLLIQ VFSPQNIPGF IIIGAIYLTG IFMTWRRARG FIKQEKVVLA E
|
| |