Gene EcolC_1591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1591 
Symbol 
ID6065933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1767595 
End bp1768818 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content55% 
IMG OID641601007 
Productputative glycosyl transferase 
Protein accessionYP_001724577 
Protein GI170019623 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.933401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC TGGTCTACGG CATTAACTAC TCGCCGGAGT TAACCGGCAT CGGCAAATAC 
ACCGGAGAGA TGGTGGAATG GCTGGCGGCA CAAGGTCATG AGGTGCGGGT TATTACCGCA
CCGCCTTACT ACCCGCAATG GCAGGTGGGC GAGAACTATT CCGCCTGGCG GTACAAACGA
GAAGAGGGGG CCGCCACGGT GTGGCGCTGC CCGCTGTACG TGCCAAAACA GCCGAGCACC
CTGAAACGCC TGTTACATCT CGGCAGTTTT GCCGTCAGCA GTTTTTTTCC TCTGATGGCG
CAACGTCGCT GGAAGCCGGA TCGCATTATC GGCGTGGTGC CAACGCTGTT TTGCACGCCG
GGAATGCGCC TGCTGGCGAA ACTCTCTGGT GCGCGTACCG TGCTGCATAT TCAGGATTAC
GAAGTAGATG CCATGCTGGG GCTGGGCCTT GCCGGAAAAG GCAAAGGCGG CAAAGTGGCA
CAGCTGGCGA CGGCGTTCGA ACGTAGCGGA CTGCATAACG TCGATAACGT CTCCACGATT
TCGCGTTCGA TGATGAATAA AGCCATCGAT AAAGGCGTGG CGGCGGAAAA CGTCATCTTC
TTCCCCAACT GGTCGGAAAT TGCCCGTTTT CAGCATATTG CAGATGCCGA TGTTGATGCC
CTTCGTAACC AGCTTGGCCT GCCGGATAAC AAAAAAATCA TTCTTTACTC CGGCAATATT
GGTGAAAAGC AGGGGCTGGA AAACGTTATT GAAGCTGCCG ATCTCCTGCG CGATGAACCG
CTGATTTTTG CCATTGTCGG GCAGGGCGGC GGCAAAGCGC GGCTGGAAAA AATGGCGCAA
CAGCGTGGTC TGCGCAACAT GCAATTTTTT CCGCTGCAAT CGTATGACGC TTTACCCGCA
CTGCTGAAGA TGGGCGATTG CCATCTGGTG GTGCAAAAAC GCGGCGCGGC AGATGCCGTA
TTGCCGTCGA AACTGACCAA TATTCTGGCG GTAGGCGGTA ACGCGGTGAT TACTGCAGAA
GCCCACACAG AACTGGGACA GCTTTGCGAA ACCTTTCCGG GCATTGCGGT TTGCGTTGAA
CCGGAATCGG TCGAGGCGCT GGTGGCGGGG ATCCGTCAGG CGCTCCTGCT GCCCAAACAC
AACACGGTGG CACGTGAATA TGCCGAACGC ACACTCGATA AAGAGAACGT GTTACGTCAA
TTTATAAATG ATATTCGGGG ATAA
 
Protein sequence
MKILVYGINY SPELTGIGKY TGEMVEWLAA QGHEVRVITA PPYYPQWQVG ENYSAWRYKR 
EEGAATVWRC PLYVPKQPST LKRLLHLGSF AVSSFFPLMA QRRWKPDRII GVVPTLFCTP
GMRLLAKLSG ARTVLHIQDY EVDAMLGLGL AGKGKGGKVA QLATAFERSG LHNVDNVSTI
SRSMMNKAID KGVAAENVIF FPNWSEIARF QHIADADVDA LRNQLGLPDN KKIILYSGNI
GEKQGLENVI EAADLLRDEP LIFAIVGQGG GKARLEKMAQ QRGLRNMQFF PLQSYDALPA
LLKMGDCHLV VQKRGAADAV LPSKLTNILA VGGNAVITAE AHTELGQLCE TFPGIAVCVE
PESVEALVAG IRQALLLPKH NTVAREYAER TLDKENVLRQ FINDIRG