Gene EcolC_1620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1620 
Symbol 
ID6066075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1800839 
End bp1801906 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content54% 
IMG OID641601035 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_001724605 
Protein GI170019651 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.522162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.800439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGA AGTATCTTTT TATCGATCGC GATGGAACCC TGATTAGCGA ACCGCCGAGT 
GATTTTCAGG TGGACCGTTT TGACAAACTC GCCTTTGAAC CGGGCGTGAT CCCGGAGCTG
CTAAAGCTGC AAAAAGCGGG CTACAAGCTG GTGATGATCA CTAATCAGGA TGGTCTTGGA
ACACAAAGTT TCCCGCAGGC GGATTTCGAT GGCCCGCACA ACCTGATGAT GCAGATCTTC
ACCTCGCAAG GCGTACAGTT TGATGAAGTG CTGATTTGTC CGCACCTGCC CGCCGATGAG
TGCGACTGCC GTAAGCCGAA AGTAAAACTG GTGGAGCGTT ATCTGGCTGA GCAAGCGATG
GATCGCGCCA ACAGTTATGT GATTGGCGAT CGCGCGACCG ACATTCAACT GGCGGAAAAC
ATGGGCATTA ATGGTTTACG CTACGACCGC GAAACCCTGA ACTGGCCGAT GATTGGCGAG
CAACTCACTA AACGAGACCG TTACGCCCAT GTAGTGCGCA ACACCAAAGA GACGCAAATT
GACGTCCAGG TGTGGCTGGA TCGTGAAGGT GGCAGCAAGA TTAATACCGG CGTTGGCTTC
TTTGATCACA TGCTGGATCA GATCGCCACC CACGGCGGTT TCCGTATGGA AATCAACGTC
AAAGGCGACC TCTATATCGA CGATCACCAC ACCGTCGAAG ATACCGGCCT GGCGCTGGGT
GAAGCGTTAA AAATTGCCCT TGGCGACAAA CGCGGTATTT GCCGCTTTGG TTTTGTGCTG
CCGATGGACG AATGCCTTGC CCGCTGCGCG CTGGATATCT CTGGTCGCCC GCACCTGGAA
TATAAAGCCG AGTTTACCTA CCAGCGCGTG GGCGATCTCA GCACCGAGAT GATCGAGCAC
TTCTTCCGTT CGCTCTCATA CACCATGGGC GTGACCCTGC ACCTGAAAAC CAAAGGTAAA
AACGATCACC ACCGTGTAGA GAGCCTGTTC AAAGCCTTTG GTCGCACCCT GCGCCAGGCC
ATCCGCGTGG AAGGCGATAC CCTGCCCTCG TCGAAAGGAG TGCTGTAA
 
Protein sequence
MSQKYLFIDR DGTLISEPPS DFQVDRFDKL AFEPGVIPEL LKLQKAGYKL VMITNQDGLG 
TQSFPQADFD GPHNLMMQIF TSQGVQFDEV LICPHLPADE CDCRKPKVKL VERYLAEQAM
DRANSYVIGD RATDIQLAEN MGINGLRYDR ETLNWPMIGE QLTKRDRYAH VVRNTKETQI
DVQVWLDREG GSKINTGVGF FDHMLDQIAT HGGFRMEINV KGDLYIDDHH TVEDTGLALG
EALKIALGDK RGICRFGFVL PMDECLARCA LDISGRPHLE YKAEFTYQRV GDLSTEMIEH
FFRSLSYTMG VTLHLKTKGK NDHHRVESLF KAFGRTLRQA IRVEGDTLPS SKGVL