Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3774 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 4072590 |
End bp | 4073933 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | protein of unknown function DUF21 |
Protein accession | ACX41379 |
Protein GI | 260450957 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.767387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAACA GTATTTTAGT CATACTCTGC TTGATCGCTG TAAGTGCGTT CTTCTCGATG TCCGAGATCT CGCTTGCCGC CTCACGCAAA ATCAAACTTA AACTGCTGGC TGATGAAGGC AATATAAATG CCCAACGCGT TCTGAATATG CAGGAAAATC CCGGCATGTT CTTTACCGTG GTCCAAATCG GTCTGAACGC AGTGGCGATT CTCGGCGGTA TCGTCGGTGA TGCGGCATTT TCTCCAGCTT TTCACAGCCT GTTCTCCCGC TATATGTCGG CAGAGCTCTC TGAGCAACTG AGCTTTATTC TCTCTTTCTC GTTAGTGACT GGCATGTTTA TCCTGTTTGC GGATTTAACC CCGAAACGCA TCGGTATGAT TGCGCCAGAA GCTGTGGCTT TGCGTATCAT CAACCCGATG CGCTTCTGCC TGTACGTTTG CACCCCGCTG GTGTGGTTCT TCAACGGCCT GGCGAACATA ATCTTCCGTA TTTTCAAACT GCCAATGGTA CGTAAAGATG ACATCACTTC TGATGACATC TACGCGGTAG TGGAAGCCGG TGCGCTGGCG GGCGTGTTAC GTAAACAGGA ACACGAGCTG ATTGAAAACG TCTTTGAGCT GGAATCCCGT ACCGTTCCGT CTTCAATGAC ACCGCGTGAA AACGTGATTT GGTTTGATCT CCACGAAGAT GAGCAAAGCC TGAAGAATAA GGTGGCGGAA CATCCGCACT CTAAGTTCCT CGTCTGTAAT GAAGATATTG ACCACATCAT CGGTTATGTC GATTCTAAAG ACCTGCTGAA CCGCGTGCTG GCTAACCAAA GCCTGGCACT GAACAGCGGC GTACAAATTC GCAACACGCT GATTGTGCCG GATACGTTAA CCCTTTCAGA GGCGTTGGAA AGTTTTAAAA CCGCAGGTGA AGACTTCGCG GTGATCATGA ACGAGTACGC GCTGGTGGTG GGGATCATCA CCCTCAATGA CGTGATGACC ACGCTGATGG GCGATCTGGT CGGTCAGGGG CTGGAAGAGC AGATTGTCGC CCGTGATGAG AACTCATGGC TGATTGACGG CGGCACCCCA ATTGACGACG TCATGCGCGT GCTGGATATT GACGAGTTCC CGCAGTCGGG CAACTACGAA ACCATCGGCG GCTTTATGAT GTTTATGCTG CGTAAGATCC CGAAACGCAC CGATTCGGTG AAATTCGCCG GCTACAAATT TGAAGTGGTG GATATCGATA ACTACCGCAT CGACCAGCTG CTGGTGACCC GGATCGACAG CAAGGCCACC GCCCTTTCGC CAAAACTGCC TGACGCTAAA GATAAAGAAG AAAGCGTCGC GTAA
|
Protein sequence | MLNSILVILC LIAVSAFFSM SEISLAASRK IKLKLLADEG NINAQRVLNM QENPGMFFTV VQIGLNAVAI LGGIVGDAAF SPAFHSLFSR YMSAELSEQL SFILSFSLVT GMFILFADLT PKRIGMIAPE AVALRIINPM RFCLYVCTPL VWFFNGLANI IFRIFKLPMV RKDDITSDDI YAVVEAGALA GVLRKQEHEL IENVFELESR TVPSSMTPRE NVIWFDLHED EQSLKNKVAE HPHSKFLVCN EDIDHIIGYV DSKDLLNRVL ANQSLALNSG VQIRNTLIVP DTLTLSEALE SFKTAGEDFA VIMNEYALVV GIITLNDVMT TLMGDLVGQG LEEQIVARDE NSWLIDGGTP IDDVMRVLDI DEFPQSGNYE TIGGFMMFML RKIPKRTDSV KFAGYKFEVV DIDNYRIDQL LVTRIDSKAT ALSPKLPDAK DKEESVA
|
| |