Gene ECD_02366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02366 
SymbolypfI 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2458333 
End bp2460348 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content55% 
IMG OID 
Productpredicted hydrolase 
Protein accessionACT44187 
Protein GI253978517 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAC TGACTGCGCT TCACACATTA ACAGCGCAAA TGAAACGTGA AGGGATCCGC 
CGCTTGCTGG TGTTGAGCGG GGAAGAGGGT TGGTGTTTTG AGCATACTCT TAAGTTGCGT
GATGCCTTAC CTGGCGACTG GCTGTGGATT TCGCCGCGGC CAGATGCTGA AAACCACTGT
TCTCCCTCGG CACTACAAAC TTTACTTGGG CGCGAGTTCC GGCATGCGGT ATTCGACGCC
CGCCACGGCT TTGATGCCGC TGCCTTTGCC GCACTTAGCG GAACGTTGAA AGCGGGAAGC
TGGCTGGTTT TGTTACTCCC TGTATGGGAA GAGTGGGAAA ACCAACCTGA TGCCGACTCG
CTGCGCTGGA GTGATTGCCC TGACCCTATT GCGACGCCGC ATTTTGTCCA GCATCTCAAA
CGCGTACTTA CGGCGGATAA CGAGGCTATC CTCTGGCGGC AAAACCAGCC ATTCTCGTTG
GCGCATTTTA CTCCCCGTAC TGACTGGTAC CCCGCGACTG GCGCACCACA ACCAGAACAA
CAGCAACTCT TAAAGCAGCT AATGACCATG CCGCCGGGCG TGGCAGCGGT AACGGCTGCG
CGTGGGCGCG GTAAGTCGGC GTTGGCAGGG CAACTCATTT CTCGTATTGC GGGCAGAGCG
ATTGTCACCG CGCCCGCAAA AGCGTCAACG GATGTACTGG CACAATTTGC GGGCGAGAAG
TTTCGCTTTA TTGCGCCGGA TGCCTTGTTA GCCAGCGATG AGCAAGCCGA CTGGCTGGTG
GTCGATGAAG CCGCAGCCAT ACCTGCGCCA TTGTTGCATC AACTGGTATC GCGTTTTCCT
CGAACGTTGT TAACCACTAC GGTGCAGGGC TACGAAGGCA CCGGACGTGG TTTTTTGCTG
AAATTTTGCG CTCGCTTTCC GCATTTACAC CGTTTTGAAC TGCAACAGCC GATCCGCTGG
GCGCAGGGAT GCCCGCTGGA AAAAATGGTC AGCGAGGCAC TGGTTTTTGA CGATGAAAAC
TTCACCCATA CACCACAAGG CAATATTGTC ATTTCCGCAT TTGAACAGAC GTTATGGCAA
AGCGATCCAG AAACGCCGTT AAAGGTTTAT CAGCTCTTGT CTGGTGCGCA CTATCGGACT
TCGCCGCTGG ATTTACGCCG GATGATGGAT GCACCAGGGC AACATTTTTT ACAGGCGGCT
GGCGAAAACG AGATTGCCGG GGCGCTGTGG CTGGTGGATG AGGGTGGATT ATCTCAACAA
CTCAGTCAGG CGGTATGGGC AGGTTTTCGT CGCCCGCGGG GTAATCTGGT GGCCCAGTCG
CTGGCGGCGC ACGGCAACAA TCCACTGGCG GCGACATTGC GTGGACGGCG GGTCAGCCGG
ATAGCAGTTC ATCCGGCTCG TCAGCGGGAA GGCACAGGGC GGCAACTTAT TGCTGGTGCT
TTGCAATATA CGCAAGACCT CGACTATCTT TCGGTGAGTT TTGGTTACAC CGGGGAGTTA
TGGCGTTTCT GGCAACGCTG CGGTTTTGTG CTGGTGCGGA TGGGTAATCA TCGGGAAGCC
AGCAGCGGTT GCTATACGGC GATGGCGCTG TTACCGATGA GTGATGCGGG TAAACAGCTG
GCTGAACGTG AGCATTACCG TTTACGTCGC GATGCGCAAG CTCTCGCGCA GTGGAATGGC
GAAACGCTTC CTGTTGATCC ACTAAACGAT GCCGTCCTTT CTGACGACGA CTGGCTTGAA
CTGGCCGGTT TTGCTTTCGC TCATCGTCCG CTATTAACGT CGTTAGGTTG CTTATTGCGT
CTGTTACAAA CCAGTGAACT GGCATTACCG GCGCTGCGTG GGCGTTTACA GAAAAACGCC
AGTGATGCGC AGTTATGTAC CACACTTAAA CTTTCAGGCC GCAAGATGTT ACTGGTCCGT
CAGCGGGAAG AGGCCGCGCA GGCGCTGTTC GCACTTAATG ATGTTCGCAC TGAGCGTCTG
CGCGATCGCA TAACGCAATG GCAATTATTT CACTGA
 
Protein sequence
MAELTALHTL TAQMKREGIR RLLVLSGEEG WCFEHTLKLR DALPGDWLWI SPRPDAENHC 
SPSALQTLLG REFRHAVFDA RHGFDAAAFA ALSGTLKAGS WLVLLLPVWE EWENQPDADS
LRWSDCPDPI ATPHFVQHLK RVLTADNEAI LWRQNQPFSL AHFTPRTDWY PATGAPQPEQ
QQLLKQLMTM PPGVAAVTAA RGRGKSALAG QLISRIAGRA IVTAPAKAST DVLAQFAGEK
FRFIAPDALL ASDEQADWLV VDEAAAIPAP LLHQLVSRFP RTLLTTTVQG YEGTGRGFLL
KFCARFPHLH RFELQQPIRW AQGCPLEKMV SEALVFDDEN FTHTPQGNIV ISAFEQTLWQ
SDPETPLKVY QLLSGAHYRT SPLDLRRMMD APGQHFLQAA GENEIAGALW LVDEGGLSQQ
LSQAVWAGFR RPRGNLVAQS LAAHGNNPLA ATLRGRRVSR IAVHPARQRE GTGRQLIAGA
LQYTQDLDYL SVSFGYTGEL WRFWQRCGFV LVRMGNHREA SSGCYTAMAL LPMSDAGKQL
AEREHYRLRR DAQALAQWNG ETLPVDPLND AVLSDDDWLE LAGFAFAHRP LLTSLGCLLR
LLQTSELALP ALRGRLQKNA SDAQLCTTLK LSGRKMLLVR QREEAAQALF ALNDVRTERL
RDRITQWQLF H