Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_1062 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 1133702 |
End bp | 1134943 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | protein of unknown function DUF21 |
Protein accession | ACX38736 |
Protein GI | 260448314 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000000790091 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGTCA TTTCAGCCTA TTTTTCCGGG TCCGAAACCG GAATGATGAC CCTCAACCGC TATCGTCTGC GACATATGGC GAAACAGGGT AATCGCTCGG CCAAACGCGT CGAAAAATTG CTGCGTAAGC CAGACCGCCT GATAAGCCTG GTGTTAATCG GCAATAACCT GGTCAATATT CTTGCCTCCG CGCTCGGCAC TATTGTTGGG ATGCGTTTGT ACGGCGATGC GGGCGTGGCA ATTGCGACTG GTGTGCTGAC TTTTGTCGTA CTGGTATTTG CTGAGGTATT GCCGAAAACC ATTGCCGCGC TGTACCCGGA AAAAGTCGCT TATCCGAGTA GTTTTCTGCT GGCTCCGCTG CAAATTTTGA TGATGCCGCT GGTCTGGTTG CTGAATGCTA TCACCCGTAT GCTGATGCGC ATGATGGGTA TCAAAACCGA TATCGTGGTT AGCGGCTCTT TGAGCAAAGA AGAGTTGCGC ACTATCGTGC ACGAATCACG CTCACAAATT TCCCGTCGCA ATCAGGATAT GCTGCTGTCG GTGCTCGATC TGGAAAAAAT GACCGTTGAT GACATCATGG TGCCGCGCAG TGAAATTATC GGTATTGATA TCAACGATGA CTGGAAATCG ATTCTGCGCC AACTCTCCCA CTCACCTCAC GGGCGCATCG TGCTCTACCG TGATTCGCTG GACGACGCCA TCAGTATGCT GCGAGTACGT GAAGCCTGGC GGTTGATGTC GGAGAAAAAA GAGTTCACCA AAGAAACCAT GCTGCGCGCC GCGGACGAGA TCTATTTTGT GCCGGAAGGT ACGCCGCTCA GCACGCAGTT GGTAAAGTTT CAGCGCAACA AAAAGAAAGT CGGCCTGGTC GTCAACGAGT ATGGAGACAT TCAGGGGCTG GTGACGGTTG AAGATATTCT GGAAGAGATT GTCGGCGATT TCACTACGTC GATGTCGCCA ACACTTGCCG AAGAGGTCAC GCCGCAAAAC GACGGTTCGG TGATTATCGA TGGCACCGCC AACGTGCGGG AAATCAACAA AGCCTTTAAC TGGCATCTAC CGGAAGATGA TGCCCGCACG GTTAATGGCG TCATTCTTGA GGCACTGGAA GAGATCCCTG TCGCAGGCAC CCGCGTGCGT ATTGGCGAGT ACGATATCGA TATTCTCGAC GTACAGGACA ATATGATTAA GCAGGTAAAA GTTTTTCCTG TGAAACCGCT GCGCGAGAGT GTGGCGGAGT AA
|
Protein sequence | MVVISAYFSG SETGMMTLNR YRLRHMAKQG NRSAKRVEKL LRKPDRLISL VLIGNNLVNI LASALGTIVG MRLYGDAGVA IATGVLTFVV LVFAEVLPKT IAALYPEKVA YPSSFLLAPL QILMMPLVWL LNAITRMLMR MMGIKTDIVV SGSLSKEELR TIVHESRSQI SRRNQDMLLS VLDLEKMTVD DIMVPRSEII GIDINDDWKS ILRQLSHSPH GRIVLYRDSL DDAISMLRVR EAWRLMSEKK EFTKETMLRA ADEIYFVPEG TPLSTQLVKF QRNKKKVGLV VNEYGDIQGL VTVEDILEEI VGDFTTSMSP TLAEEVTPQN DGSVIIDGTA NVREINKAFN WHLPEDDART VNGVILEALE EIPVAGTRVR IGEYDIDILD VQDNMIKQVK VFPVKPLRES VAE
|
| |