Gene EcolC_3179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3179 
Symbol 
ID6066580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3484858 
End bp3486069 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content50% 
IMG OID641602595 
Productglycosyl transferase group 1 
Protein accessionYP_001726129 
Protein GI170021175 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAT TACTAGTTAA CAAGTTTTTC TTTATTAAAG GAGGTGCGGA AACCGTTTAT 
TTTCAGGAGA GAAACTGGCT GAAAGCAGCG GGTGTCGACG TGGTGGATTT CTCCATGCTG
CATGAGAAAA ACTTTCCGTC CGAGTATGCC GATACTTTTG TCCGTAATGT TGATTACCAC
AAAGAAAGCA CCCTTCTGGG CGAGGTGAAA ACGGCGAGTA ATTTTATTCA CAATGCAGAG
GCTTGCCGAA AAATGCGGAC ATTATTGCAG CGTGAACGCC CAGACATTGT GCACTTTCAT
AATATCTATC ATCAATTAAC CCCGGCCCTC ATTAAGGTGG CCCGCAATTT TGGTTGTAAA
ACGGTTCTCA CCGCTCACGA CTATAAAATC GCCTGTCCGG CCTACTCCAT GCTGAGGGAC
GGTAAGGTGT GCGATGCATG CCTGACCGGG ACGGTATTCA ACGCTTTTCG TTACCGCTGC
CAGGAAGGTT CTGCGACGAA GAGTTTACTG CTTTCTTTGG AAGCGACCTG GCAGTCGATT
GCCAGAAATT ATCACATGCT CGATGTGATA ATTTCGCCGA GTGAATTCCT GAAAGGGATA
TTAAGGCGCA AATTGCCGCA TTCGCGGATT GATGTGATCG TCAACGGTGT CGATGACGAC
CCTGCGACAG ACAAGACCGC CGATAAAGGC TACCTGCTGT ATGTTGGCCG GCTCAGCCGT
GAAAAAGGGG TCGCCACATT GCCTCTGGCA CATCAGAAAA TGCGCAACCG TGCGCCGTTA
AAGGTGGTGG GCCATGGTCC ACTCTACGAC GAGCTGGTGG CCAACTACCC GGATGTCGAA
TTTTTAGGCT ACGTACAGCA GGGGGAGGCG CTTAATACGC TGATTAAAGA GGCGCGCGCG
GTGATCCTTC CTTCCGAATG CTATGAGAAC TGCTCCATGT CGGTGCTGGA GGCAATGTCC
TTCGCTAAAC CGGTCATTGG TTCGCGGATT GGGGGTATTC CGGAACAAAT TCGAGACGGT
ATCGACGGAG TTCTGTTTGA ACCCGGGAAT GTGCAGGATC TCGCCAATGC AATGGATTAC
ATGATTGATT CCCCGGAAAA GGCCCGCGTC ATGGGGCTAT CCGCTCGCGA ACGTCTGCGC
GAAAAATATA CGCTTCAAAA GCATATGGAA ACGCTAACTG CGTTGTATAA GGAAATTCTG
AGCTGGTCCT GA
 
Protein sequence
MKILLVNKFF FIKGGAETVY FQERNWLKAA GVDVVDFSML HEKNFPSEYA DTFVRNVDYH 
KESTLLGEVK TASNFIHNAE ACRKMRTLLQ RERPDIVHFH NIYHQLTPAL IKVARNFGCK
TVLTAHDYKI ACPAYSMLRD GKVCDACLTG TVFNAFRYRC QEGSATKSLL LSLEATWQSI
ARNYHMLDVI ISPSEFLKGI LRRKLPHSRI DVIVNGVDDD PATDKTADKG YLLYVGRLSR
EKGVATLPLA HQKMRNRAPL KVVGHGPLYD ELVANYPDVE FLGYVQQGEA LNTLIKEARA
VILPSECYEN CSMSVLEAMS FAKPVIGSRI GGIPEQIRDG IDGVLFEPGN VQDLANAMDY
MIDSPEKARV MGLSARERLR EKYTLQKHME TLTALYKEIL SWS