Gene EcolC_3265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3265 
Symbol 
ID6066857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3574542 
End bp3575738 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content50% 
IMG OID641602680 
Productglycosyl transferase family protein 
Protein accessionYP_001726214 
Protein GI170021260 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCT GGATATTTAT CTGTATGTCC ATAGCAATGT TGCTATGGTT TTTAAGTACG 
CTAAGACGTA AACCCAGTCA AAAGAAAGGC TGTATTGACG CCATTATACC TGCGTATAAC
GAAGGCCCGT GTCTGGCGCA GTCACTGGAT AATCTACTGC GTAACCCTTA TTTTTGCCGG
GTAATTTGCG TTAACGACGG CTCCACGGAC AATACCGAAG CGGTCATGGC GGAAGTCAAA
CGCAAATGGG GCGACCGCTT TGTTGCCGTC ACGCAAAAAA ATACCGGTAA AGGTGGTGCG
CTGATGAATG GCCTCAATTA TGCCACCTGC GACCAGGTTT TTTTAAGTGA TGCCGACACC
TATGTTCCGC CCGATCAAGA CGGAATGGGC TATATGCTGG CAGAAATTGA GCGCGGTGCT
GATGCCGTAG GCGGCATTCC CTCTACTGCG TTGAAAGGCG CGGGGCTGTT ACCGCACATC
CGCGCGACCG TAAAGTTGCC GATGATTGTT ATGAAGCGCA CGCTACAGCA GCTCCTGGGC
GGCGCACCGT TTATTATCAG CGGTGCCTGC GGAATGTTCC GTACTGATGT ATTGCGTAAG
TTCGGTTTCT CGGATCGTAC TAAAGTCGAA GACCTTGATC TCACCTGGAC ATTAGTGGCA
AACGGCTACC GTATTCGGCA GGCGAATCGC TGCATCGTAT ACCCACAGGA ATGCAACAGC
CCGCGTGAGG AGTGGCGTCG CTGGCGGCGT TGGATTGTGG GATACGCGGT CTGTATGCGC
CTGCATAAAA GACTTTTATT TAGCCGCTTC GGTATCTTCA GTATATTTCC TATGCTGTTG
GTTGTGCTTT ATGGCGTTGG GATTTATCTC ACTACCTGGT TTAATGAATT CATCACCACC
GGGCCGCATG GTGTGGTGTT GGCAATGTTT CCGCTTATCT GGATCGGCGT AGTTTGTGTT
ATTGGTGCTT TTAGCGCCTG GTTTCATCGT TGCTGGTTGT TGGTGCCTTT AGCGCCGCTT
TCCGTTGTGT ATGTATTATT AGCTTATGCC ATCTGGATTA TTTATGGACT TATTGCCTTT
TTTACTGGAC GCGAACCTCA GCGCGACAAA CCCACCCGCT ATTCCGCACT GGTGGAAGCG
TCAACCGCTT ATTCCCAACC TTCTGTCACA GGAACTGAAA AACTATCTGA AGCTTAA
 
Protein sequence
MKTWIFICMS IAMLLWFLST LRRKPSQKKG CIDAIIPAYN EGPCLAQSLD NLLRNPYFCR 
VICVNDGSTD NTEAVMAEVK RKWGDRFVAV TQKNTGKGGA LMNGLNYATC DQVFLSDADT
YVPPDQDGMG YMLAEIERGA DAVGGIPSTA LKGAGLLPHI RATVKLPMIV MKRTLQQLLG
GAPFIISGAC GMFRTDVLRK FGFSDRTKVE DLDLTWTLVA NGYRIRQANR CIVYPQECNS
PREEWRRWRR WIVGYAVCMR LHKRLLFSRF GIFSIFPMLL VVLYGVGIYL TTWFNEFITT
GPHGVVLAMF PLIWIGVVCV IGAFSAWFHR CWLLVPLAPL SVVYVLLAYA IWIIYGLIAF
FTGREPQRDK PTRYSALVEA STAYSQPSVT GTEKLSEA