Gene EcDH1_1797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1797 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1948440 
End bp1950500 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content50% 
IMG OID 
ProductOligopeptidase B 
Protein accessionACX39456 
Protein GI260449034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACCAA AAGCCGCCCG CATTCCCCAC GCCATGACGC TTCATGGCGA TACGCGCATC 
GATAATTACT ACTGGCTGCG GGACGATACG CGTTCTCAGC CAGAAGTCCT GGACTACCTG
CAACAAGAAA ATAGTTACGG TCATCGGGTG ATGGCCTCAC AACAAGCCTT GCAGGATCGC
ATCTTAAAGG AAATCATCGA CCGCATTCCG CAACGAGAAG TTTCTGCGCC CTACATCAAA
AATGGCTACC GCTATCGGCA TATTTATGAA CCAGGCTGTG AATATGCTAT CTACCAGCGT
CAATCGGCAT TCAGTGAAGA GTGGGATGAG TGGGAAACAT TGCTCGATGC CAATAAGCGC
GCAGCTCATA GTGAGTTTTA TTCGATGGGC GGAATGGCGA TTACGCCCGA TAACACCATT
ATGGCGCTGG CAGAAGATTT TCTTTCCCGA CGCCAGTACG GCATTCGTTT TCGTAATCTG
GAAACTGGTA ACTGGTACCC GGAACTGCTG GATAACGTTG AACCCAGCTT TGTCTGGGCA
AATGACTCCT GGATTTTCTA CTATGTTCGC AAGCATCCGG TGACGCTGCT GCCTTATCAG
GTCTGGCGTC ACGCCATCGG TACGCCAGCA TCGCAAGATA AACTGATCTA CGAAGAAAAA
GACGATACCT ATTACGTCAG CCTGCATAAA ACGACCTCGA AGCACTATGT AGTCATTCAT
TTGGCCAGCG CCACCACCAG TGAAGTTCGC CTGCTGGACG CGGAAATGGC CGATGCCGAG
CCGTTTGTTT TTCTGCCGCG CCGCAAAGAT CACGAATACA GCCTTGATCA CTACCAGCAT
CGTTTTTATC TGCGTTCCAA CCGCCACGGC AAAAACTTTG GCTTATACCG TACCCGTATG
CGTGATGAGC AACAGTGGGA AGAGTTAATT CCGCCACGCG AAAACATCAT GCTGGAAGGG
TTTACGCTGT TTACCGACTG GCTGGTGGTT GAAGAGCGTC AGCGCGGGTT AACCAGTTTG
CGCCAAATTA ACCGCAAGAC CCGGGAAGTC ATTGGTATTG CCTTTGATGA TCCGGCCTAT
GTGACCTGGA TTGCCTACAA TCCAGAACCT GAAACCGCGC GATTGCGTTA TGGTTATTCT
TCCATGACTA CACCAGACAC TTTGTTTGAA CTGGATATGG ATACCGGTGA GCGTCGTGTA
TTAAAACAAA CGGAAGTTCC TGGTTTTTAT GCGGCGAATT ACCGCAGTGA ACACCTGTGG
ATAGTCGCCC GTGATGGCGT CGAAGTTCCG GTTTCGTTGG TCTACCATCG CAAACATTTT
CGCAAAGGAC ACAACCCGTT GCTGGTGTAT GGCTATGGTT CTTACGGCGC AAGTATTGAT
GCCGATTTCA GTTTTAGCCG CTTGAGTTTG TTAGATCGTG GCTTTGTCTA CGCCATTGTC
CATGTTCGCG GCGGTGGTGA GCTGGGGCAA CAATGGTACG AAGACGGAAA ATTTCTGAAG
AAGAAAAATA CGTTTAATGA TTATCTTGAT GCCTGCGATG CATTGTTAAA ACTGGGCTAT
GGCTCTCCTT CGCTTTGTTA TGCGATGGGC GGGAGTGCGG GGGGCATGTT GATGGGCGTT
GCAATTAATC AACGCCCGGA ATTATTCCAC GGCGTTATCG CCCAGGTACC GTTTGTTGAT
GTTGTAACAA CGATGCTTGA TGAATCAATT CCTCTTACCA CTGGTGAGTT TGAAGAGTGG
GGTAACCCGC AGGATCCGCA ATATTACGAG TACATGAAAA GCTACAGCCC GTATGACAAC
GTCACCGCAC AGGCTTATCC GCATTTACTG GTAACGACCG GTTTGCACGA TTCTCAGGTG
CAATATTGGG AACCGGCAAA ATGGGTCGCT AAATTGCGCG AGCTGAAAAC CGATGACCAT
CTTTTATTGC TCTGTACCGA CATGGACTCA GGCCATGGCG GTAAATCTGG TCGCTTTAAA
TCGTACGAAG GCGTAGCGAT GGAATATGCT TTTCTGGTCG CGCTGGCGCA GGGAACATTA
CCCGCTACGC CTGCGGACTA A
 
Protein sequence
MLPKAARIPH AMTLHGDTRI DNYYWLRDDT RSQPEVLDYL QQENSYGHRV MASQQALQDR 
ILKEIIDRIP QREVSAPYIK NGYRYRHIYE PGCEYAIYQR QSAFSEEWDE WETLLDANKR
AAHSEFYSMG GMAITPDNTI MALAEDFLSR RQYGIRFRNL ETGNWYPELL DNVEPSFVWA
NDSWIFYYVR KHPVTLLPYQ VWRHAIGTPA SQDKLIYEEK DDTYYVSLHK TTSKHYVVIH
LASATTSEVR LLDAEMADAE PFVFLPRRKD HEYSLDHYQH RFYLRSNRHG KNFGLYRTRM
RDEQQWEELI PPRENIMLEG FTLFTDWLVV EERQRGLTSL RQINRKTREV IGIAFDDPAY
VTWIAYNPEP ETARLRYGYS SMTTPDTLFE LDMDTGERRV LKQTEVPGFY AANYRSEHLW
IVARDGVEVP VSLVYHRKHF RKGHNPLLVY GYGSYGASID ADFSFSRLSL LDRGFVYAIV
HVRGGGELGQ QWYEDGKFLK KKNTFNDYLD ACDALLKLGY GSPSLCYAMG GSAGGMLMGV
AINQRPELFH GVIAQVPFVD VVTTMLDESI PLTTGEFEEW GNPQDPQYYE YMKSYSPYDN
VTAQAYPHLL VTTGLHDSQV QYWEPAKWVA KLRELKTDDH LLLLCTDMDS GHGGKSGRFK
SYEGVAMEYA FLVALAQGTL PATPAD