Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_2211 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 2369736 |
End bp | 2371697 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | peptidase U32 |
Protein accession | ACX39861 |
Protein GI | 260449439 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.722535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGTAT CTTCTCATCG ACTTGAACTG TTAAGCCCGG CACGCGATGC CGCCATTGCC CGCGAAGCTA TTTTGCACGG TGCCGATGCT GTTTATATCG GCGGCCCTGG TTTTGGTGCC CGTCATAATG CCAGTAATAG CTTGAAAGAT ATTGCCGAGC TGGTGCCGTT TGCCCATCGT TATGGTGCAA AAATTTTCGT CACGCTTAAC ACCATTTTGC ATGATGATGA GCTGGAACCC GCGCAACGGC TGATTACTGA CCTCTACCAG ACCGGTGTCG ATGCGCTGAT TGTTCAGGAT ATGGGGATTC TGGAACTTGA TATTCCGCCG ATTGAACTGC ACGCCAGTAC GCAGTGCGAC ATTCGTACAG TTGAAAAAGC GAAGTTCCTC TCTGATGTTG GCTTCACGCA GATTGTGCTG GCGCGAGAGC TGAATCTTGA TCAGATCCGC GCGATTCACC AGGCTACGGA CGCGACCATT GAATTCTTTA TTCATGGGGC ACTGTGCGTG GCCTATTCGG GTCAGTGCTA CATTTCTCAT GCGCAAACAG GGCGTAGCGC CAACCGTGGC GATTGCTCGC AGGCGTGCCG TTTGCCATAC ACATTGAAAG ACGATCAGGG GCGGGTGGTT TCCTATGAAA AACATCTGCT GTCGATGAAA GATAACGATC AGACTGCCAA CCTCGGCGCG CTGATTGATG CTGGTGTACG CTCCTTCAAG ATTGAAGGGC GTTACAAAGA TATGAGCTAC GTGAAGAATA TCACCGCCCA TTATCGCCAG ATGCTTGATG CCATTATTGA AGAACGTGGC GATCTGGCGC GCGCTTCATC AGGTCGTACT GAACATTTCT TTGTTCCATC GACGGAAAAG ACTTTCCACC GTGGTAGCAC AGATTATTTT GTGAATGCCC GTAAAGGCGA TATTGGCGCG TTCGATTCGC CGAAATTTAT CGGCCTGCCG GTAGGCGAAG TAGTGAAAGT GGCGAAAGAT CATCTCGATG TTGCCGTTAC CGAGCCACTG GCAAATGGCG ATGGCCTGAA CGTGTTGATT AAACGTGAAG TCGTCGGTTT TCGTGCCAAT ACGGTCGAGA AAACCGGAGA AAATCAGTAC CGCGTCTGGC CCAATGAAAT GCCAGCAGAT TTGCACAAAA TTCGTCCACA TCACCCACTA AACCGTAATC TTGATCATAA CTGGCAGCAG GCACTGACAA AAACCTCCAG CGAACGTCGG GTGGCGGTAG ACATTGAACT GGGCGGCTGG CAGGAACAAC TGATTCTGAC CCTCACCAGT GAAGAGGGTG TCAGCATCAC GCATACGCTG GACGGGCAGT TCGACGAAGC CAATAACGCC GAAAAAGCAA TGAACAATCT GAAGGATGGT CTGGCAAAAC TGGGGCAAAC CCTCTATTAC GCCCGCGATG TGCAAATTAA TTTGCCGGGG GCGCTGTTTG TACCAAACAG TCTGTTAAAC CAGTTCCGCC GTGAAGCTGC TGACATGCTG GATGCTGCGC GTCTTGCCAG TTACCAGCGC GGCAGCCGTA AACCGGTTGC TGATCCTGCG CCGGTTTATC CGCAAACGCA TCTGAGTTTC CTCGCGAACG TATACAACCA GAAAGCGCGT GAATTTTATC ATCGCTATGG TGTGCAGCTG ATTGACGCGG CGTATGAAGC ACATGAAGAG AAGGGCGAAG TCCCGGTGAT GATCACCAAG CATTGTCTGC GCTTTGCCTT TAATCTGTGC CCGAAACAGG CGAAAGGCAA TATCAAAAGC TGGAAGGCGA CGCCAATGCA ACTGGTTAAC GGCGATGAAG TATTAACGCT AAAGTTTGAT TGCCGCCCAT GCGAGATGCA CGTCATTGGC AAAATCAAAA ATCACATACT GAAAATGCCG TTACCGGGAA GCGTAGTGGC ATCCGTAAGT CCGGATGAGC TGCTGAAAAC ATTGCCGAAG CGAAAAGGGT AA
|
Protein sequence | MTVSSHRLEL LSPARDAAIA REAILHGADA VYIGGPGFGA RHNASNSLKD IAELVPFAHR YGAKIFVTLN TILHDDELEP AQRLITDLYQ TGVDALIVQD MGILELDIPP IELHASTQCD IRTVEKAKFL SDVGFTQIVL ARELNLDQIR AIHQATDATI EFFIHGALCV AYSGQCYISH AQTGRSANRG DCSQACRLPY TLKDDQGRVV SYEKHLLSMK DNDQTANLGA LIDAGVRSFK IEGRYKDMSY VKNITAHYRQ MLDAIIEERG DLARASSGRT EHFFVPSTEK TFHRGSTDYF VNARKGDIGA FDSPKFIGLP VGEVVKVAKD HLDVAVTEPL ANGDGLNVLI KREVVGFRAN TVEKTGENQY RVWPNEMPAD LHKIRPHHPL NRNLDHNWQQ ALTKTSSERR VAVDIELGGW QEQLILTLTS EEGVSITHTL DGQFDEANNA EKAMNNLKDG LAKLGQTLYY ARDVQINLPG ALFVPNSLLN QFRREAADML DAARLASYQR GSRKPVADPA PVYPQTHLSF LANVYNQKAR EFYHRYGVQL IDAAYEAHEE KGEVPVMITK HCLRFAFNLC PKQAKGNIKS WKATPMQLVN GDEVLTLKFD CRPCEMHVIG KIKNHILKMP LPGSVVASVS PDELLKTLPK RKG
|
| |