Gene EcolC_1608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1608 
Symbol 
ID6067678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1788690 
End bp1789832 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content29% 
IMG OID641601024 
Productglycosyl transferase group 1 
Protein accessionYP_001724594 
Protein GI170019640 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.372835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.872915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAAG TTTTAATAGT GTTCCATGAC GGTGATAAAA AAAGTGGTGC AACAGCCTCA 
ATGTTAGATC TTTTAACAGA GCTGATCAAA TGTAAAGATT TGAAAATAAT ATGCTTAATT
CCAAAGTATG GGTCTTTATA TGATGAACTA AAAAGATTAA GAATTAAAAC TTATGTTATT
AAATATTATA GTGGGAGATA TAGTCTTGGG AGTTATAAAG GTACTGTTTG GAATATCATA
AAAACCTTAA TAAAACAGTT AATTACCTTT TTATCTTTTA TGAAAGTTAA AAATAAATTT
ATGAATATCG ATGTCGTTTA TACGAATACA TCTGATAACT ATATGGGATT GTTACTCTCA
ATTTTTCTTA AAAAAAAGAA TATTTTTCAT ATTAGGGAAT TCGGTTTAGA AGATCAACAC
CAAAAGCATA TAATTACAGA TCATTTATAT TACTCGTTGG TAAACAAATA TGCTAATGAA
GTTATAGTTA TATCAGAAGC GTTAAAAAAT AAAATAATAA AATACATTAC AGGAAATAAT
CTTAATCTAA TATATGATGA TGTCCATATC CAAAACAAAC CAATGCTAAA TTATGCTAAT
TCGGCTAGGC TGCGGAAGTT CATTATTATT GGCACATTAT GTGAGGGGAA AGGTCAAAAA
ATAGCCATAG AGGCTATGCA TAATCTGATT CGTGAAGGAT ATTTATGTCA CTTAAAAATA
ATTGGAAATA ATAGAGTTCC ATATGCAAGT TATCTTAATA AAATTGTTGC TGATTATAAT
TTGAGTGATT ATGTAGAGTT TATGGGATTT AGGCATGATC TTGATCAAAT AAGGCTTGAC
AACGATGTTT GCTTAATTCC CTCTCTTTCT GAAGCTTTTG GTAGAGTTAC CATTGAATCA
ATGGCTGCAG GTATGATAGT TGTTGCTAGT GATTCTGGTG CTAGTAAAGA GATCATCAAT
GATGGTATAA ACGGTTTTTT GTTTTCTTCA GGTTCGGTAA GTGATCTTAC TAGCGTACTT
AAAAAAATCC TTGATGTAGA GTCTAATAAT TTAGAATGTA TAAGGAAAAG GGCTTTGGTT
GACTCACAAA AATATACATC AGGACATGCA GCTTCATCAA TTTATAATTT GATCATTAAC
TGA
 
Protein sequence
MTKVLIVFHD GDKKSGATAS MLDLLTELIK CKDLKIICLI PKYGSLYDEL KRLRIKTYVI 
KYYSGRYSLG SYKGTVWNII KTLIKQLITF LSFMKVKNKF MNIDVVYTNT SDNYMGLLLS
IFLKKKNIFH IREFGLEDQH QKHIITDHLY YSLVNKYANE VIVISEALKN KIIKYITGNN
LNLIYDDVHI QNKPMLNYAN SARLRKFIII GTLCEGKGQK IAIEAMHNLI REGYLCHLKI
IGNNRVPYAS YLNKIVADYN LSDYVEFMGF RHDLDQIRLD NDVCLIPSLS EAFGRVTIES
MAAGMIVVAS DSGASKEIIN DGINGFLFSS GSVSDLTSVL KKILDVESNN LECIRKRALV
DSQKYTSGHA ASSIYNLIIN