Gene ECD_02018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02018 
SymbolgatC 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2070726 
End bp2072081 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content51% 
IMG OID 
Productgalactitol-specific enzyme IIC component of PTS 
Protein accessionACT43844 
Protein GI253978174 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTCAG AAGTCATGCG TTATATTCTC GACCTCGGCC CTACGGTGAT GCTGCCGATT 
GTCATCATTA TTTTTTCTAA AATATTAGGC ATGAAGGCAG GCGATTGCTT TAAAGCGGGT
CTGCATATCG GGATTGGCTT TGTTGGCATT GGCCTTGTGA TTGGCTTAAT GCTGGATTCC
ATTGGCCCGG CGGCGAAAGC GATGGCGGAA AATTTCGACC TGAATCTGCA TGTGGTCGAT
GTTGGCTGGC CGGGCTCTTC ACCAATGACC TGGGCGTCGC AAATTGCGCT GGTGGCGATT
CCGATTGCGA TTCTGGTTAA CGTGGCGATG TTACTGACCC GTATGACGCG GGTGGTAAAT
GTTGATATCT GGAATATCTG GCATATGACC TTCACCGGCG CGTTGCTGCA TCTGGCAACC
GGTTCATGGA TGATAGGGAT GGCAGGTGTG GTAATTCACG CGGCGTTTGT TTATAAGCTC
GGCGACTGGT TTGCCCGCGA TACCCGAAAT TTCTTTGAGC TGGAAGGTAT TGCTATTCCG
CACGGTACGT CGGCGTATAT GGGGCCGATT GCGGTGCTGG TCGATGCTAT CATCGAGAAA
ATCCCAGGCG TTAACCGAAT TAAATTTAGC GCCGACGATA TTCAGCGCAA ATTTGGTCCA
TTTGGCGAGC CTGTCACCGT GGGTTTTGTG ATGGGGCTGA TTATCGGCAT CCTCGCGGGT
TACGATGTCA AAGGTGTATT GCAGCTGGCG GTAAAAACGG CGGCAGTGAT GCTGCTAATG
CCACGGGTGA TTAAACCCAT CATGGATGGT TTAACGCCCA TCGCTAAGCA GGCTCGTAGT
CGTTTACAGG CGAAGTTCGG CGGTCAGGAG TTCCTGATTG GCCTTGATCC GGCGTTGCTG
CTGGGACATA CGGCGGTGGT ATCGGCAAGC CTGATTTTTA TCCCACTCAC CATTTTAATT
GCTGTTTGTG TGCCGGGTAA TCAGGTGCTG CCGTTTGGCG ATCTTGCCAC CATCGGCTTC
TTCGTGGCGA TGGCGGTCGC CGTGCATCGT GGAAATCTGT TCCGCACCTT AATCTCGGGT
GTCATCATTA TGAGCATCAC CCTGTGGATC GCGACGCAAA CTATTGGTTT GCACACCCAA
CTGGCGGCTA ATGCTGGGGC GTTAAAAGCC GGGGGTATGG TGGCTTCAAT GGATCAGGGC
GGTTCTCCCA TTACCTGGTT ACTGATTCAG GTTTTCTCCC CGCAAAATAT TCCCGGTTTC
ATTATTATCG GTGCAATTTA TCTGACCGGT ATTTTCATGA CCTGGCGTAG AGCGCGTGGC
TTTATTAAAC AAGAGAAAGT CGTTCTCGCA GAATAA
 
Protein sequence
MFSEVMRYIL DLGPTVMLPI VIIIFSKILG MKAGDCFKAG LHIGIGFVGI GLVIGLMLDS 
IGPAAKAMAE NFDLNLHVVD VGWPGSSPMT WASQIALVAI PIAILVNVAM LLTRMTRVVN
VDIWNIWHMT FTGALLHLAT GSWMIGMAGV VIHAAFVYKL GDWFARDTRN FFELEGIAIP
HGTSAYMGPI AVLVDAIIEK IPGVNRIKFS ADDIQRKFGP FGEPVTVGFV MGLIIGILAG
YDVKGVLQLA VKTAAVMLLM PRVIKPIMDG LTPIAKQARS RLQAKFGGQE FLIGLDPALL
LGHTAVVSAS LIFIPLTILI AVCVPGNQVL PFGDLATIGF FVAMAVAVHR GNLFRTLISG
VIIMSITLWI ATQTIGLHTQ LAANAGALKA GGMVASMDQG GSPITWLLIQ VFSPQNIPGF
IIIGAIYLTG IFMTWRRARG FIKQEKVVLA E