Gene EcDH1_2211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2211 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2369736 
End bp2371697 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content51% 
IMG OID 
Productpeptidase U32 
Protein accessionACX39861 
Protein GI260449439 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.722535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGTAT CTTCTCATCG ACTTGAACTG TTAAGCCCGG CACGCGATGC CGCCATTGCC 
CGCGAAGCTA TTTTGCACGG TGCCGATGCT GTTTATATCG GCGGCCCTGG TTTTGGTGCC
CGTCATAATG CCAGTAATAG CTTGAAAGAT ATTGCCGAGC TGGTGCCGTT TGCCCATCGT
TATGGTGCAA AAATTTTCGT CACGCTTAAC ACCATTTTGC ATGATGATGA GCTGGAACCC
GCGCAACGGC TGATTACTGA CCTCTACCAG ACCGGTGTCG ATGCGCTGAT TGTTCAGGAT
ATGGGGATTC TGGAACTTGA TATTCCGCCG ATTGAACTGC ACGCCAGTAC GCAGTGCGAC
ATTCGTACAG TTGAAAAAGC GAAGTTCCTC TCTGATGTTG GCTTCACGCA GATTGTGCTG
GCGCGAGAGC TGAATCTTGA TCAGATCCGC GCGATTCACC AGGCTACGGA CGCGACCATT
GAATTCTTTA TTCATGGGGC ACTGTGCGTG GCCTATTCGG GTCAGTGCTA CATTTCTCAT
GCGCAAACAG GGCGTAGCGC CAACCGTGGC GATTGCTCGC AGGCGTGCCG TTTGCCATAC
ACATTGAAAG ACGATCAGGG GCGGGTGGTT TCCTATGAAA AACATCTGCT GTCGATGAAA
GATAACGATC AGACTGCCAA CCTCGGCGCG CTGATTGATG CTGGTGTACG CTCCTTCAAG
ATTGAAGGGC GTTACAAAGA TATGAGCTAC GTGAAGAATA TCACCGCCCA TTATCGCCAG
ATGCTTGATG CCATTATTGA AGAACGTGGC GATCTGGCGC GCGCTTCATC AGGTCGTACT
GAACATTTCT TTGTTCCATC GACGGAAAAG ACTTTCCACC GTGGTAGCAC AGATTATTTT
GTGAATGCCC GTAAAGGCGA TATTGGCGCG TTCGATTCGC CGAAATTTAT CGGCCTGCCG
GTAGGCGAAG TAGTGAAAGT GGCGAAAGAT CATCTCGATG TTGCCGTTAC CGAGCCACTG
GCAAATGGCG ATGGCCTGAA CGTGTTGATT AAACGTGAAG TCGTCGGTTT TCGTGCCAAT
ACGGTCGAGA AAACCGGAGA AAATCAGTAC CGCGTCTGGC CCAATGAAAT GCCAGCAGAT
TTGCACAAAA TTCGTCCACA TCACCCACTA AACCGTAATC TTGATCATAA CTGGCAGCAG
GCACTGACAA AAACCTCCAG CGAACGTCGG GTGGCGGTAG ACATTGAACT GGGCGGCTGG
CAGGAACAAC TGATTCTGAC CCTCACCAGT GAAGAGGGTG TCAGCATCAC GCATACGCTG
GACGGGCAGT TCGACGAAGC CAATAACGCC GAAAAAGCAA TGAACAATCT GAAGGATGGT
CTGGCAAAAC TGGGGCAAAC CCTCTATTAC GCCCGCGATG TGCAAATTAA TTTGCCGGGG
GCGCTGTTTG TACCAAACAG TCTGTTAAAC CAGTTCCGCC GTGAAGCTGC TGACATGCTG
GATGCTGCGC GTCTTGCCAG TTACCAGCGC GGCAGCCGTA AACCGGTTGC TGATCCTGCG
CCGGTTTATC CGCAAACGCA TCTGAGTTTC CTCGCGAACG TATACAACCA GAAAGCGCGT
GAATTTTATC ATCGCTATGG TGTGCAGCTG ATTGACGCGG CGTATGAAGC ACATGAAGAG
AAGGGCGAAG TCCCGGTGAT GATCACCAAG CATTGTCTGC GCTTTGCCTT TAATCTGTGC
CCGAAACAGG CGAAAGGCAA TATCAAAAGC TGGAAGGCGA CGCCAATGCA ACTGGTTAAC
GGCGATGAAG TATTAACGCT AAAGTTTGAT TGCCGCCCAT GCGAGATGCA CGTCATTGGC
AAAATCAAAA ATCACATACT GAAAATGCCG TTACCGGGAA GCGTAGTGGC ATCCGTAAGT
CCGGATGAGC TGCTGAAAAC ATTGCCGAAG CGAAAAGGGT AA
 
Protein sequence
MTVSSHRLEL LSPARDAAIA REAILHGADA VYIGGPGFGA RHNASNSLKD IAELVPFAHR 
YGAKIFVTLN TILHDDELEP AQRLITDLYQ TGVDALIVQD MGILELDIPP IELHASTQCD
IRTVEKAKFL SDVGFTQIVL ARELNLDQIR AIHQATDATI EFFIHGALCV AYSGQCYISH
AQTGRSANRG DCSQACRLPY TLKDDQGRVV SYEKHLLSMK DNDQTANLGA LIDAGVRSFK
IEGRYKDMSY VKNITAHYRQ MLDAIIEERG DLARASSGRT EHFFVPSTEK TFHRGSTDYF
VNARKGDIGA FDSPKFIGLP VGEVVKVAKD HLDVAVTEPL ANGDGLNVLI KREVVGFRAN
TVEKTGENQY RVWPNEMPAD LHKIRPHHPL NRNLDHNWQQ ALTKTSSERR VAVDIELGGW
QEQLILTLTS EEGVSITHTL DGQFDEANNA EKAMNNLKDG LAKLGQTLYY ARDVQINLPG
ALFVPNSLLN QFRREAADML DAARLASYQR GSRKPVADPA PVYPQTHLSF LANVYNQKAR
EFYHRYGVQL IDAAYEAHEE KGEVPVMITK HCLRFAFNLC PKQAKGNIKS WKATPMQLVN
GDEVLTLKFD CRPCEMHVIG KIKNHILKMP LPGSVVASVS PDELLKTLPK RKG