Gene EcolC_1597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1597 
Symbol 
ID6065495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1776302 
End bp1777522 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID641601013 
Productglycosyl transferase group 1 
Protein accessionYP_001724583 
Protein GI170019629 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.308201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTCG GCTTCTTTTT ACTGAAATTT CCGCTGTCGT CAGAAACCTT CGTTCTTAAC 
CAAATCACTG CGTTTATTGA TATGGGCTTT GAGGTGGAGA TTGTCGCGTT GCAAAAAGGC
GATACAGAGA ACACCCACGC GGCATGGACG AAATACAACC TTGCCGCCAG AACCCGCTGG
TTACAGGACG AACCTACGGG CAAAGTGGCG AAGCTGCGCC ACCGCGCCAG CCAGACCTTA
CGCGGCATTC ATCGTAAAAA TACCTGGCAG GCGCTCAACC TCAAGCGCTA TGGCGCAGAG
TCGCGGAACC TGATTTTGTC TGCCATTTGC GGTCAGGTCG CAACACCATT TCATACCGAT
GTCTTTATCG CTCATTTTGG TCCTGCGGGG GTAACCGCAG CAAAACTACG CGAACTGGGT
GTCATTCGCG GCAAAATTGC CACCATCTTC CACGGTATTG ATATCTCCAG TCGGGAAGTG
CTCAACCACT ACACTCCCGA ATATCAACAA CTGTTTCGAC GTGGCGACCT GATGTTACCG
ATAAGCGATC TGTGGGCCGG AAGGCTGCAA AAAATGGGCT GTCCGAGGGA AAAAATCGCC
GTATCGCGCA TGGGCGTGGA CATGACGCGT TTTAGCCCGC GTCCGGTGAA AGCGCCCGCA
ACGCCGCTGG AAATCATTTC CGTCGCACGC TTAACCGAGA AAAAAGGCCT GCATGTGGCG
ATCGAAGCCT GCCGGCAGTT GAAAGAGCAG GGCGTGGCAT TTCGCTATCG CATCCTCGGC
ATTGGCCCGT GGGAACGACG CCTGCGCACC CTCATCGAAC AATATCAACT GGAAGATGTG
GTGGAGATGC CGGGCTTTAA ACCGAGCCAT GAAGTGAAAG CGATGCTCGA CGACGCGGAT
GTCTTCCTGT TGCCATCGGT TACAGGTGCG GATGGTGATA TGGAAGGCAT TCCGGTGGCG
CTAATGGAAG CGATGGCGGT CGGCATTCCG GTGGTTTCTA CTCTGCATAG CGGAATACCG
GAACTGGTGG AGGCTGACAA ATCCGGCTGG CTGGTGCCTG AGAACGATGC TCGCGCACTG
GCGCAACGAC TGGCGGCGTT TAGCCAACTG GACACCGACG AATTGGCTCC GGTCGTCAAA
CGCGCGCGCG AAAAAGTTGA ACACGATTTT AACCAGCAGG TGATCAATCG AGAACTCGCC
AGCTTGCTGC AGGCTTTATA G
 
Protein sequence
MKVGFFLLKF PLSSETFVLN QITAFIDMGF EVEIVALQKG DTENTHAAWT KYNLAARTRW 
LQDEPTGKVA KLRHRASQTL RGIHRKNTWQ ALNLKRYGAE SRNLILSAIC GQVATPFHTD
VFIAHFGPAG VTAAKLRELG VIRGKIATIF HGIDISSREV LNHYTPEYQQ LFRRGDLMLP
ISDLWAGRLQ KMGCPREKIA VSRMGVDMTR FSPRPVKAPA TPLEIISVAR LTEKKGLHVA
IEACRQLKEQ GVAFRYRILG IGPWERRLRT LIEQYQLEDV VEMPGFKPSH EVKAMLDDAD
VFLLPSVTGA DGDMEGIPVA LMEAMAVGIP VVSTLHSGIP ELVEADKSGW LVPENDARAL
AQRLAAFSQL DTDELAPVVK RAREKVEHDF NQQVINRELA SLLQAL