Gene EcolC_2574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2574 
Symbol 
ID6065010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2821972 
End bp2823210 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content47% 
IMG OID641601981 
ProductN-glycosyltransferase 
Protein accessionYP_001725532 
Protein GI170020578 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0472996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGGT TCGTTTTCTT CTGGCCGTTT TTTATGTCCA TTATGTGGAT TGTTGGCGGC 
GTCTATTTCT GGGTCTATCG TGAACGCCAC TGGCCGTGGG GAGAAAACGC ACCAGCTCCC
CAGTTGAAAG ATAATCCGTC TATCTCCATT ATCATTCCCT GTTTTAATGA GGAGAAAAAC
GTTGAGGAAA CCATACACGC CGCTTTAGCA CAGCGTTATG AGAACATTGA AGTTATTGCC
GTAAATGACG GTTCAACAGA TAAAACCCGT GCCATCCTGG ATCGCATGGC TGCACAAATT
CCCCATTTGC GGGTCATTCA TCTGGCGCAA AACCAGGGGA AAGCCATTGC GCTTAAAACC
GGAGCTGCCG CGGCGAAAAG TGAATATCTG GTGTGCATTG ATGGCGATGC GTTATTAGAC
CGCGATGCGG CGGCATATAT TGTGGAACCG ATGTTGTACA ACCCGCGTGT GGGTGCCGTA
ACCGGTAATC CTCGTATTCG AACACGTTCT ACCCTGGTGG GTAAAATTCA GGTTGGCGAG
TATTCCTCAA TTATTGGTTT GATCAAGCGA ACCCAGCGTA TCTATGGAAA CGTATTTACC
GTTTCCGGTG TTATTGCCGC ATTTCGTCGC AGCGCCCTGG CAGAAGTGGG TTACTGGAGT
GACGATATGA TCACCGAAGA TATTGATATT AGCTGGAAGC TGCAGTTGAA TCAGTGGACG
ATTTTTTACG AGCCACGGGC ACTGTGCTGG ATATTAATGC CTGAAACGTT AAAAGGGCTG
TGGAAACAGC GCCTGCGCTG GGCTCAGGGC GGTGCAGAAG TATTCCTCAA AAATATGACA
AGGTTGTGGC GCAAAGAAAA CTTTCGAATG TGGCCGCTGT TTTTTGAATA CTGCCTGACG
ACAATATGGG CCTTCACCTG CCTGGTCGGT TTCATTATTT ACGCAGTCCA ACTTGCCGGT
GTACCGTTAA ATATTGAATT GACACATATC GCTGCGACAC ATACTGCCGG AATATTATTG
TGTACGTTAT GTTTACTGCA ATTTATTGTC AGCCTGATGA TCGAGAATCG CTATGAGCAT
AATCTGACTT CATCGCTTTT CTGGATTATT TGGTTCCCGG TTATTTTCTG GATGCTGAGC
CTGGCAACGA CATTGGTATC ATTTACACGA GTCATGTTGA TGCCTAAAAA GCAACGCGCC
CGTTGGGTAA GTCCCGATCG CGGGATTCTG AGAGGTTAA
 
Protein sequence
MMRFVFFWPF FMSIMWIVGG VYFWVYRERH WPWGENAPAP QLKDNPSISI IIPCFNEEKN 
VEETIHAALA QRYENIEVIA VNDGSTDKTR AILDRMAAQI PHLRVIHLAQ NQGKAIALKT
GAAAAKSEYL VCIDGDALLD RDAAAYIVEP MLYNPRVGAV TGNPRIRTRS TLVGKIQVGE
YSSIIGLIKR TQRIYGNVFT VSGVIAAFRR SALAEVGYWS DDMITEDIDI SWKLQLNQWT
IFYEPRALCW ILMPETLKGL WKQRLRWAQG GAEVFLKNMT RLWRKENFRM WPLFFEYCLT
TIWAFTCLVG FIIYAVQLAG VPLNIELTHI AATHTAGILL CTLCLLQFIV SLMIENRYEH
NLTSSLFWII WFPVIFWMLS LATTLVSFTR VMLMPKKQRA RWVSPDRGIL RG