Gene EcolC_3178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3178 
Symbol 
ID6066084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3483855 
End bp3484844 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content46% 
IMG OID641602594 
Productglycosyl transferase family protein 
Protein accessionYP_001726128 
Protein GI170021174 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATA AACCGTTTAT TTCAGTGAGC ATAAAAACGC TCAATGAATC TGAGTGTATT 
GAGAAAACTA TCGATAGCAT CCGTCAGCAG CTTAGAGGTT ATCCACATAA AATTATCGTT
GCCGATAGCC TGTCCACGGA TAACACCCAA CAACTGGCGG GTGACAAAGG GGTAATGGTG
GTTTCGCTCA CCGAGCCGGG CGATCGCTGT TGCGGCGTTG GACATCAGCT CGGTTATCTC
TACAGCGAAG GGGATTACAT TCTGCTGATG GACGGTGATA TGGAGCTTGA ACCCGGCTTT
ATTGATCGCG CGGTGACCTT TCTGGAAGCA AATTCAGAGT ATGCTGGCGT GGCTGGGACG
GTGGAGATGG ATGAGGCCGC AAACTATGAG TTTATCTCCC GTAAACAGCG CCTTGATACG
ATCTATCCAG TTGGGGATTG CGACCATTTA GGTGGCGGCG GGTTATACCG TCGTTCTGCA
ATCAGAAAAA TTGGTTATCT GACCAACCGC AATCTACACG CGTATGAAGA GGCTGAACTG
GGGCTTCGTT TATTGGAACA TGGCTATAAA TTGCATCGCC TGAATATTCC TTATTTCCGC
CATACGTCGT ACACGTTGCC AACATTCAAA ATGTTGCGCT ACCGCTGGCG GAGTGGTTAT
TACCAGGGTA TGGGCGAAAT ATTACGTAGC GCCTGGGGAA AACCGTATTT TTCAACCGTA
GTGAAAATGG TAAAAAGCGA AGTGGTTTTT TTACTGTATT TGATGCTGTT GGTATGTTCA
GTATTTACGC TGAATATGGA TATTGTCGGC GTTGCGCTGC TGCCGCTACT TGTATTTATC
GTACTTAAAA CTATTAAGAA TCGTTCGCTG GTTAATGGAC TATACAGCGC TATGAATATG
ACCATTCGCG CTGCTGGATT ATTGAAAGGG CTGATGCAAC CGATGCGCGA TCCGATTGTA
CCGCCTGGCA ATAAAATCAT TCATCGTTAA
 
Protein sequence
MNNKPFISVS IKTLNESECI EKTIDSIRQQ LRGYPHKIIV ADSLSTDNTQ QLAGDKGVMV 
VSLTEPGDRC CGVGHQLGYL YSEGDYILLM DGDMELEPGF IDRAVTFLEA NSEYAGVAGT
VEMDEAANYE FISRKQRLDT IYPVGDCDHL GGGGLYRRSA IRKIGYLTNR NLHAYEEAEL
GLRLLEHGYK LHRLNIPYFR HTSYTLPTFK MLRYRWRSGY YQGMGEILRS AWGKPYFSTV
VKMVKSEVVF LLYLMLLVCS VFTLNMDIVG VALLPLLVFI VLKTIKNRSL VNGLYSAMNM
TIRAAGLLKG LMQPMRDPIV PPGNKIIHR