Gene EcHS_A2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2166 
Symbol 
ID5594718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2143116 
End bp2144231 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content53% 
IMG OID640921299 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_001458838 
Protein GI157161520 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones76 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGAGTTC TACACGTCTA TAAGACCTAC TATCCCGATA CCTACGGCGG TATTGAGCAG 
GTCATTTATC AGCTAAGTCA GGGCTGCGCC CGCCGGGGAA TCGCAGCCGA TGTTTTTACT
TTTAGCCCGG ACAAAGAGAC AGGTCCTGTC GCCTACGAAG ACCATCGGGT CATTTATAAT
AAGCAGCTTT TTGAAATTGC CTCCACGCCG TTTTCGTTGA AAGCGTTAAA GCGTTTTAAG
CAGATTAAAG ATGATTACGA CATCATCAAC TACCATTTTC CGTTTCCTTT CATGGATATG
TTGCATCTCT CGGCGCGGCC TGACGCAAGA ACGGTGGTGA CCTATCACTC GGATATTGTG
AAACAAAAAC GGTTAATGAA GTTGTACCAG CCGCTGCAGG AGCGATTCCT CGCCAGCGTA
GACTGCATCG TCGCCTCGTC GCCCAACTAC GTGGCCTCCA GCCAGACCCT GAAAAAATAT
CAGGATAAAA CCGTGGTGAT CCCGTTTGGT CTGGAGCAGC ATGACGTGCA GCACGATCCG
CAGCGGGTGG CGCACTGGCG GGAAACCGTC GGCGATAACT TCTTCCTCTT CGTCGGCGCT
TTCCGCTACT ACAAAGGGCT GCACATTCTG CTGGATGCCG CCGAGCGTAG CCGGCTGCCG
GTGGTGATCG TCGGGGGCGG GCCGCTGGAG GCGGAAGTGC GGCGTGAGGC GCAGCAACGC
GGGCTGAGCA ATGTGGTGTT TACCGGCATG CTCAACGACG AAGATAAGTA CATTCTCTTC
CAGCTCTGCC GGGGCGTGGT ATTCCCCTCG CATCTGCGCT CTGAGGCGTT TGGCATTACG
TTATTGGAAG GCGCACGCTT TGCAAGGCCG CTGATCTCTT GCGAGATCGG TACAGGTACC
TCTTTCATTA ACCAGGACAA AGTGAGTGGT TGCGTGATTC CGCCGAATGA TAGCCAGGCG
CTGGTGGAGG CGATGAATGA GCTCTGGAAT AACGAGGAAA CCTCCAACCG CTATGGCGAA
AACTCGCGTC GTCGTTTTGA AGAGATGTTT ACTGCCGACC ATATGATTGA CGCCTATGTC
AATCTCTACA CTACATTGCT GGAAAGCAAA TCCTGA
 
Protein sequence
MRVLHVYKTY YPDTYGGIEQ VIYQLSQGCA RRGIAADVFT FSPDKETGPV AYEDHRVIYN 
KQLFEIASTP FSLKALKRFK QIKDDYDIIN YHFPFPFMDM LHLSARPDAR TVVTYHSDIV
KQKRLMKLYQ PLQERFLASV DCIVASSPNY VASSQTLKKY QDKTVVIPFG LEQHDVQHDP
QRVAHWRETV GDNFFLFVGA FRYYKGLHIL LDAAERSRLP VVIVGGGPLE AEVRREAQQR
GLSNVVFTGM LNDEDKYILF QLCRGVVFPS HLRSEAFGIT LLEGARFARP LISCEIGTGT
SFINQDKVSG CVIPPNDSQA LVEAMNELWN NEETSNRYGE NSRRRFEEMF TADHMIDAYV
NLYTTLLESK S