Gene EcDH1_0083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0083 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp85556 
End bp86815 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content32% 
IMG OID 
ProductO-antigen polymerase 
Protein accessionACX37781 
Protein GI260447359 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.0973021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAACAT CCTTTAAACT TCATTCATTG AAACCTTACA CTCTGAAATC ATCAATGATT 
TTAGAGATAA TAACTTATAT ATTATGTTTT TTTTCAATGA TAATTGCATT CGTCGATAAT
ACTTTCAGCA TAAAAATATA TAATATCACT GCTATAGTTT GCTTATTGTC ACTAATTTTA
CGTGGCAGAC AAGAAAATTA TAATATAAAA AACCTTATTC TTCCCCTTTC TATATTTTTA
ATAGGCTTGC TTGATTTAAT TTGGTATTCT GCGTTTAAAG TAGATAATTC GCCATTTCGT
GCTACTTACC ATAGTTATTT AAATACTGCC AAAATATTTA TATTTGGTTC TTTTATTGTT
TTCTTGACAC TAACTAGCCA GCTAAAATCA AAAAAAGAGA GTGTATTATA CACTTTGTAT
TCTCTGTCAT TTCTAATTGC TGGATATGCA ATGTATATTA ATAGCATTCA TGAAAATGAC
CGCATTTCTT TTGGTGTAGG AACGGCAACA GGAGCAGCAT ATTCAACAAT GCTAATAGGG
ATAGTTAGTG GCGTTGCGAT TCTTTATACT AAGAAAAATC ATCCTTTTTT ATTTTTATTA
AATAGTTGCG CGGTACTTTA TGTTCTGGCG CTAACACAAA CCAGAGCAAC CCTACTCCTG
TTCCCTATAA TTTGTGTTGC TGCATTAATA GCTTATTATA ATAAATCACC CAAGAAATTC
ACTTCCTCTA TTGTTCTACT AATTGCTATA TTAGCTAGCA TTGTTATTAT ATTTAATAAA
CCAATACAGA ATCGCTATAA TGAAGCATTA AATGACTTAA ACAGTTATAC CAATGCTAAT
AGTGTTACTT CCCTAGGTGC AAGACTGGCA ATGTACGAAA TTGGTTTAAA TATATTCATA
AAGTCACCTT TTTCATTTAG ATCAGCAGAG TCACGCGCTG AAAGTATGAA TTTGTTAGTT
GCAGAACACA ATAGGCTAAG AGGGGCATTG GAGTTTTCTA ACGTACATCT ACATAATGAG
ATAATTGAAG CAGGGTCACT GAAAGGTCTG ATGGGAATTT TTTCCACACT TTTCCTCTAT
TTTTCACTAT TTTATATAGC ATATAAAAAA CGAGCTTTGG GTTTGTTGAT ATTAACGCTT
GGCATTGTGG GGATTGGACT CAGTGATGTG ATCATATGGG CACGCAGCAT TCCAATTATC
ATTATATCCG CTATAGTCCT CTTACTCGTC ATTAATAATC GTAACAATAC AATTAATTAA
 
Protein sequence
MLTSFKLHSL KPYTLKSSMI LEIITYILCF FSMIIAFVDN TFSIKIYNIT AIVCLLSLIL 
RGRQENYNIK NLILPLSIFL IGLLDLIWYS AFKVDNSPFR ATYHSYLNTA KIFIFGSFIV
FLTLTSQLKS KKESVLYTLY SLSFLIAGYA MYINSIHEND RISFGVGTAT GAAYSTMLIG
IVSGVAILYT KKNHPFLFLL NSCAVLYVLA LTQTRATLLL FPIICVAALI AYYNKSPKKF
TSSIVLLIAI LASIVIIFNK PIQNRYNEAL NDLNSYTNAN SVTSLGARLA MYEIGLNIFI
KSPFSFRSAE SRAESMNLLV AEHNRLRGAL EFSNVHLHNE IIEAGSLKGL MGIFSTLFLY
FSLFYIAYKK RALGLLILTL GIVGIGLSDV IIWARSIPII IISAIVLLLV INNRNNTIN