Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1620 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 1768471 |
End bp | 1769718 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | polysaccharide biosynthesis protein |
Protein accession | ACX39285 |
Protein GI | 260448863 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.0681626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACGA ATAAATTATC TTTAAGAAGA AACGTTATAT ATCTGGCTGT CGTTCAAGGT AGCAATTATC TTTTACCATT GCTTACATTT CCATATCTTG TAAGAACACT TGGTCCTGAA AATTTCGGTA TATTCGGTTT TTGCCAAGCG ACTATGCTAT ATATGATAAT GTTTGTTGAA TATGGTTTCA ATCTCACAGC AACTCAGAGT ATTGCCAAAG CAGCAGATAG TAAAGATAAA GTAACGTCTA TTTTTTGGGC GGTGATATTT TCAAAAATAG TTCTTATCGT CATTACATTG ATTTTCTTAA CGTCGATGAC CTTGCTTGTT CCTGAATATA ACAAGCATGC CGTAATTATA TGGTCGTTTG TTCCTGCATT AGTCGGGAAT TTAATCTACC CTATCTGGCT GTTTCAGGGA AAAGAAAAAA TGAAATGGCT GACTTTAAGT AGTATTTTAT CCCGCTTGGC TATTATCCCT CTAACATTTA TTTTTGTGAA CACAAAGTCA GATATAGCAA TTGCCGGTTT TATTCAGTCA AGTGCAAATC TGGTTGCTGG AATTATTGCA CTAGCTATCG TTGTTCATGA AGGTTGGATT GGTAAAGTTA CGCTATCATT ACATAATGTG CGTCGATCTT TAGCAGACGG TTTTCATGTT TTTATTTCCA CATCTGCTAT TAGTTTATAT TCTACGGGAA TAGTTATTAT CCTGGGATTT ATATCTGGAC CAACGTCCGT AGGGAATTTT AATGCGGCCA ATACTATAAG AAACGCGCTT CAAGGGCTAT TAAATCCTAT CACCCAAGCA ATATACCCAA GAATATCAAG TACGCTTGTT CTTAATCGTG TGAAGGGTGT GATTTTAATT AAAAAATCAT TGACCTGCTT GAGTTTGATT GGTGGTGCTT TTTCATTAAT TCTGCTCTTG GGTGCATCTA TACTAGTAAA AATAAGTATA GGGCCGGGAT ATGATAATGC AGTGATTGTG CTAATGATTA TATCGCCTCT GCCTTTTCTT ATTTCATTAA GTAATGTCTA TGGCATTCAA GTTATGCTGA CCCATAATTA TAAGAAAGAA TTCAGTAAGA TTTTAATCGC TGCGGGTTTG TTGAGTTTGT TGTTGATTTT TCCGCTAACA ACTCTTTTTA AAGAGATTGG TGCAGCAATA ACATTGCTTG CAACAGAGTG CTTAGTTACG TCACTCATGC TGATGTTCGT AAGAAATAAT AAATTACTGG TTTGCTGA
|
Protein sequence | MNTNKLSLRR NVIYLAVVQG SNYLLPLLTF PYLVRTLGPE NFGIFGFCQA TMLYMIMFVE YGFNLTATQS IAKAADSKDK VTSIFWAVIF SKIVLIVITL IFLTSMTLLV PEYNKHAVII WSFVPALVGN LIYPIWLFQG KEKMKWLTLS SILSRLAIIP LTFIFVNTKS DIAIAGFIQS SANLVAGIIA LAIVVHEGWI GKVTLSLHNV RRSLADGFHV FISTSAISLY STGIVIILGF ISGPTSVGNF NAANTIRNAL QGLLNPITQA IYPRISSTLV LNRVKGVILI KKSLTCLSLI GGAFSLILLL GASILVKISI GPGYDNAVIV LMIISPLPFL ISLSNVYGIQ VMLTHNYKKE FSKILIAAGL LSLLLIFPLT TLFKEIGAAI TLLATECLVT SLMLMFVRNN KLLVC
|
| |