Gene EcolC_3180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3180 
Symbol 
ID6066582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3486071 
End bp3487183 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content49% 
IMG OID641602596 
Productglycosyl transferase group 1 
Protein accessionYP_001726130 
Protein GI170021176 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAAA AAATCACGGT TCTCGGTACA CGCGGGATAC CGGATGTCCA GGGTGGTGTG 
GAAACACACT GCCAGAATCT TTACCCGGCT ATAAAAAAGC AGTTTGATAT GGATATCTGC
GTTATCGCTC GCTCTCCCTA CGTCAGCTAT AAACAGACGT ATTATAAAAA TGTTGAAACA
TACTCTCTAT GGGCTCCGAA GAAGCGATCG CTGGAAGCGA TTGTCCATTC CTTCTTAGCC
ACGTTAAGAA CCTGTTTCGA TGGTTCTGAT ATTGTGCACG TTCATGCCAT CGGACCCGGA
CTTCTGGTGC CACTGCTGCG TGTGCTAGGA AAGAAGGTGG TGTTTACCCA CCATGGTCCA
GATTACGATC GCCAGAAATG GGGGCGTCTG GCTAAAAGGG TGCTGCAACT GGGAGAGAAA
GTGGCTGTTA AGTATGCCAA TGAAGTGATC GTTATTTCAG AGGTGATTAA TCAACTGATA
CGCACAAAAC ACTGTCGTGA TGATGCACAC TTGATCTACA ACGGCGTCAA TTTACCGTTG
CCGTTAAAGG AAGAGACTGT GCGCACGGTG TTGGGACGTT ACGCGCTGCA GCCGCAAAAT
TACCTGGTTG TCGTTGGGCG GTTTGTGGAA GAAAAAGGTA TGCATGATGC GATTGCTGCC
CACCGCAAAC TGGGGCTCAC GATGCCGCTG GTATTGGTGG GTGATGCCGA TCATCCCACG
GAATATAGCG TCCGCCTTAA AAAGATGGCT GCAGATACGC CGAACGTCAT CATGACGGGG
TTCCTCAAAG GTGAGGAATT GCAGGCTATC TTTTCTCAGG CGCGGCTGTT TTTGATGCCT
TCATACCATG AAGGGTTACC GATAGCGCTT CTCGAAGCGA TGGCCTATTC ACTGCCCGCC
GTGGTCAGTG ATATTCCTGC GAATCTTGAA GTAAAATTGC CGCCAGAATC GTATTTCGAG
GTCGGCAACG TCGACGCTCT GGCGCAAAAA ATAGCAGCGT TGGTTTCCTC ACAGCGGATT
GACTACAGCG CCTGGCTGAA AAATTACGAC TGGCAGGTGA TCGCGAGAAA AACCGCCAGT
GTCTACCATT CCTTAGCAAA TAAAAAAGGT TAA
 
Protein sequence
MSQKITVLGT RGIPDVQGGV ETHCQNLYPA IKKQFDMDIC VIARSPYVSY KQTYYKNVET 
YSLWAPKKRS LEAIVHSFLA TLRTCFDGSD IVHVHAIGPG LLVPLLRVLG KKVVFTHHGP
DYDRQKWGRL AKRVLQLGEK VAVKYANEVI VISEVINQLI RTKHCRDDAH LIYNGVNLPL
PLKEETVRTV LGRYALQPQN YLVVVGRFVE EKGMHDAIAA HRKLGLTMPL VLVGDADHPT
EYSVRLKKMA ADTPNVIMTG FLKGEELQAI FSQARLFLMP SYHEGLPIAL LEAMAYSLPA
VVSDIPANLE VKLPPESYFE VGNVDALAQK IAALVSSQRI DYSAWLKNYD WQVIARKTAS
VYHSLANKKG