Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2821 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 3022058 |
End bp | 3023323 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | |
Product | protein of unknown function DUF1479 |
Protein accession | ACX40454 |
Protein GI | 260450032 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0231817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCTA CTTTTACCAG CGACACATTG CCTGCCGATC ACAAAGCAGC TATCCGTCAG ATGAAGCACG CGCTGCGGGC GCAGCTTGGC GACGTCCAGC AGATCTTTAA TCAGCTAAGC GATGACATTG CCACGCGAGT GGCTGAAATC AACGCACTCA AAGCACAGGG CGATGCCGTC TGGCCGGTGC TGTCTTATGC CGATATCAAA GCAGGTCATG TTACTGCAGA GCAGCGCGAA CAGATTAAAC GTCGCGGTTG TGCGGTGATA AAAGGCCATT TCCCCCGCGA ACAAGCGCTA GGCTGGGATC AGTCGATGCT GGACTATCTG GACCGCAACC GCTTTGACGA GGTCTACAAA GGCCCCGGCG ATAATTTCTT CGGGACGCTC AGCGCTTCAC GTCCCGAGAT TTACCCCATC TACTGGTCGC AGGCGCAAAT GCAGGCCCGC CAGAGTGAAG AAATGGCGAA TGCGCAGTCG TTTCTCAATC GTCTGTGGAC ATTTGAAAGT GATGGAAAGC AATGGTTTAA CCCGGATGTG AGCGTCATCT ACCCTGACCG TATCCGCCGC CGTCCGCCCG GAACGACCTC CAAAGGTCTT GGAGCGCATA CCGACTCCGG GGCACTGGAA CGCTGGCTGC TTCCAGCGTA TCAGCGCGTT TTCGCCAACG TCTTTAATGG CAATCTGGCG CAATATGATC CCTGGCATGC GGCACATCGT ACGGAAGTTG AAGAGTACAC GGTGGACAAC ACCACCAAAT GTTCCGTGTT TCGGACATTC CAGGGCTGGA CAGCGCTCTC TGATATGCTG CCTGGTCAGG GGCTGCTGCA CGTCGTGCCC ATTCCTGAAG CTATGGCGTA CGTACTGTTA CGTCCGCTGC TTGATGATGT GCCGGAGGAT GAACTGTGCG GCGTAGCGCC CGGAAGAGTA TTGCCGGTAT CAGAGCAATG GCATCCACTG TTGATTGAGG CGTTAACCAG CATTCCAAAA CTCGAAGCCG GAGACTCCGT CTGGTGGCAC TGCGACGTCA TCCATTCCGT TGCCCCCGTT GAAAATCAAC AAGGTTGGGG CAACGTGATG TACATTCCTG CGGCACCGAT GTGCGAGAAA AATCTTGCCT ACGCGCACAA GGTGAAGGCC GCACTGGAAA AAGGCGCATC GCCGGGCGAC TTCCCGCGCG AGGACTATGA AACAAACTGG GAAGGACGCT TTACGCTTGC CGACCTCAAC ATTCACGGTA AGCGAGCGTT GGGCATGGAT GTTTGA
|
Protein sequence | MASTFTSDTL PADHKAAIRQ MKHALRAQLG DVQQIFNQLS DDIATRVAEI NALKAQGDAV WPVLSYADIK AGHVTAEQRE QIKRRGCAVI KGHFPREQAL GWDQSMLDYL DRNRFDEVYK GPGDNFFGTL SASRPEIYPI YWSQAQMQAR QSEEMANAQS FLNRLWTFES DGKQWFNPDV SVIYPDRIRR RPPGTTSKGL GAHTDSGALE RWLLPAYQRV FANVFNGNLA QYDPWHAAHR TEVEEYTVDN TTKCSVFRTF QGWTALSDML PGQGLLHVVP IPEAMAYVLL RPLLDDVPED ELCGVAPGRV LPVSEQWHPL LIEALTSIPK LEAGDSVWWH CDVIHSVAPV ENQQGWGNVM YIPAAPMCEK NLAYAHKVKA ALEKGASPGD FPREDYETNW EGRFTLADLN IHGKRALGMD V
|
| |