Gene EcDH1_1620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1620 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1768471 
End bp1769718 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content35% 
IMG OID 
Productpolysaccharide biosynthesis protein 
Protein accessionACX39285 
Protein GI260448863 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0681626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACGA ATAAATTATC TTTAAGAAGA AACGTTATAT ATCTGGCTGT CGTTCAAGGT 
AGCAATTATC TTTTACCATT GCTTACATTT CCATATCTTG TAAGAACACT TGGTCCTGAA
AATTTCGGTA TATTCGGTTT TTGCCAAGCG ACTATGCTAT ATATGATAAT GTTTGTTGAA
TATGGTTTCA ATCTCACAGC AACTCAGAGT ATTGCCAAAG CAGCAGATAG TAAAGATAAA
GTAACGTCTA TTTTTTGGGC GGTGATATTT TCAAAAATAG TTCTTATCGT CATTACATTG
ATTTTCTTAA CGTCGATGAC CTTGCTTGTT CCTGAATATA ACAAGCATGC CGTAATTATA
TGGTCGTTTG TTCCTGCATT AGTCGGGAAT TTAATCTACC CTATCTGGCT GTTTCAGGGA
AAAGAAAAAA TGAAATGGCT GACTTTAAGT AGTATTTTAT CCCGCTTGGC TATTATCCCT
CTAACATTTA TTTTTGTGAA CACAAAGTCA GATATAGCAA TTGCCGGTTT TATTCAGTCA
AGTGCAAATC TGGTTGCTGG AATTATTGCA CTAGCTATCG TTGTTCATGA AGGTTGGATT
GGTAAAGTTA CGCTATCATT ACATAATGTG CGTCGATCTT TAGCAGACGG TTTTCATGTT
TTTATTTCCA CATCTGCTAT TAGTTTATAT TCTACGGGAA TAGTTATTAT CCTGGGATTT
ATATCTGGAC CAACGTCCGT AGGGAATTTT AATGCGGCCA ATACTATAAG AAACGCGCTT
CAAGGGCTAT TAAATCCTAT CACCCAAGCA ATATACCCAA GAATATCAAG TACGCTTGTT
CTTAATCGTG TGAAGGGTGT GATTTTAATT AAAAAATCAT TGACCTGCTT GAGTTTGATT
GGTGGTGCTT TTTCATTAAT TCTGCTCTTG GGTGCATCTA TACTAGTAAA AATAAGTATA
GGGCCGGGAT ATGATAATGC AGTGATTGTG CTAATGATTA TATCGCCTCT GCCTTTTCTT
ATTTCATTAA GTAATGTCTA TGGCATTCAA GTTATGCTGA CCCATAATTA TAAGAAAGAA
TTCAGTAAGA TTTTAATCGC TGCGGGTTTG TTGAGTTTGT TGTTGATTTT TCCGCTAACA
ACTCTTTTTA AAGAGATTGG TGCAGCAATA ACATTGCTTG CAACAGAGTG CTTAGTTACG
TCACTCATGC TGATGTTCGT AAGAAATAAT AAATTACTGG TTTGCTGA
 
Protein sequence
MNTNKLSLRR NVIYLAVVQG SNYLLPLLTF PYLVRTLGPE NFGIFGFCQA TMLYMIMFVE 
YGFNLTATQS IAKAADSKDK VTSIFWAVIF SKIVLIVITL IFLTSMTLLV PEYNKHAVII
WSFVPALVGN LIYPIWLFQG KEKMKWLTLS SILSRLAIIP LTFIFVNTKS DIAIAGFIQS
SANLVAGIIA LAIVVHEGWI GKVTLSLHNV RRSLADGFHV FISTSAISLY STGIVIILGF
ISGPTSVGNF NAANTIRNAL QGLLNPITQA IYPRISSTLV LNRVKGVILI KKSLTCLSLI
GGAFSLILLL GASILVKISI GPGYDNAVIV LMIISPLPFL ISLSNVYGIQ VMLTHNYKKE
FSKILIAAGL LSLLLIFPLT TLFKEIGAAI TLLATECLVT SLMLMFVRNN KLLVC