Gene CHU_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_2040 
Symbol 
ID4186701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2370956 
End bp2373964 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content44% 
IMG OID638072040 
Productesterase-like protein 
Protein accessionYP_678645 
Protein GI110638436 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3509] Poly(3-hydroxybutyrate) depolymerase 
TIGRFAM ID[TIGR01840] esterase, PHB depolymerase family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.400928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGT ACGTTGTTCT ATTCAAGTAT GCGTTTGCAG GAATCCATTA TCATGTTACA 
TTTTTAATGA TATGTTTATG GATGCTTTCG TTTCATTCTA TGGCACAATT AACAACAATT
ACTGTAGGGA ATGCTACTCG AAGTATGTTG GTATATGCAC CTGCAGGAAT TCAGCAGAAC
AGACCTTTAC TGATTTCTAT GCATGGTTTA AATCAAGATC CGAATTATCA AAAGTCACAA
ACAAAATGGG AACTCGTAGC TGATACTGCG AAGTTTATTG TTGTATATCC TGCAGGTATT
AATAATTCAT GGGATTTGTC AGGTAATACG GATACTGATT TTATTTTGAA AATCATTGAT
GCTATGTATA CCAGTTATGG GATTGATCGT TCCCGGGTAT ATCTATCCGG CTTTTCAATG
GGAGGTATGA TGACTTATGT AGCAGCAACT AAAATTGCTG ACAAAATTGC CGCATTCGCA
CCTGTTTCAG GATATCCATT ATCTTCAAGT AATTTTAATA GTTCACGTGT TGTTCCGTTT
ATTCATATTC ATGGTGATGC TGACAATGTT GTTATTTATG ATAATAAGCT TTTGACTTAT
TTACAAGGTT GGAGAACTAA AAATGGATGT TCTTCAACAG CGGTAGTTAC TAAACCATAT
CCATCAAATA TTTCTAATTC AGTTGCTACT AAGAGCTCCT GGACGAATTG TGGTTGCGGT
ACTGAATTTG TTCTGATGAC CCTGGCTGGA AAAGGCCACT GGCATTCTTT AGATGCTACT
TTTAATTCAA CGGTCGAGAT ATGGAATTTC GTAAGAAAAT ATAAAAATAC GTGTGGTAAT
GTAGGACCGG TAGTATCGCT TACAGCACCT GTAAATAATA CAGTTTATAC GGAAGGTGAT
AATATAACGA TCAATGCCAC GGCAACGATC ACAAGCGGGA GCATTTCCAA AGTAGAATTT
TATAACGGAA CAACGTTGTT AGGTACAGAT GCAAGTTCAC CATACAGCTA TACAATCACA
GCTGCAGCAG CAGGAACATA TCCGATCACT GCCAAAGCAA CGAGTGCAGC CAATGCAGTA
ACAACGAGCA CGGCAATAAA CATTCAGGTA GCAAAACCTA TTTACCAGAC CGGTTCTGCA
CCCACAATCG ATGGAACCGT TGACGGCTTG TGGAGCAGTT TTCCATCCAC AGGTATCACA
AAAAATAATA CCGGTACGAT CAGCTCAGGT ACAGATCTGT CGGGTAACTG GAAAGCGATG
TGGGATGCGT CTAATCTGTA TGTATTGGTT CAGGTAACCG ATGATGTGAA GCGCAACGAT
GGGGGAACGG ATGTATACAA CGACGATGGC GTTGAAGTAT ACATTGATCT GGGCAACACC
AAAGCAACGA CATACGGCAC CAACGACCAG CAGTACACGT TCCGCTGGAA CGATGTTACA
GCGGCCTATG AGATCAACGG ACATCCGGTA ACAGGAATAA CCAAAGGCAT CAGCAATACA
GCAACCGGTT ATATTGTGGA GGTGAGCATC CCGTGGAGCA CCATTGGCGG CACTGCTTCA
TTAAATTCAT TCCAGGGCTT TGAAGTCATG ATCAATGATG ACGATGACGG AGGAGCAAGA
GAAGGTAAGC TTGCCTGGGT TGCGTCTACA GATGATACGT GGAGCAATCC GGCTTTAATG
GGAACAGTTG TATTAAAAGG ATTGAATTGT ACGGTACCGG CAGCAGCGAT AACAGCAAGC
ACGGCAACCA CATTCTGCTC CGGAGGCAGT GTAGTATTGA ATGCAGGTAC AGGCACCGGA
TACAGCTATG TATGGAAGAA CGGTACAGCA ACAATAGCAG GAGCGACAAA TTCAGGTTAT
ACAGCCACCG CATCGGGCAG TTATACGGTA ACAGTAACAA ACCCGGGCGG CTGTTCAGCA
ACCTCAGCAG GGACTACGGT GACGGTAAAT GCCTTACCGG TTTTAACGCA GTATGCACAG
GTAGATGGCG GAACCTGGAA CCAGGTATCA GGCGCAACGG TGTGTGCTGG CTCTTCGGTT
GTACTGGGTC CTCAGCCGAC AGTAAATACA GGCTGGAGCT GGACAGGTCC GAACGGTTAC
AGTGCATCGA CCAGAGAGAT TACGTTAACT GGAGTTACAC CAACACAGGG CGGTATTTAT
ACGGCAAGTT ATACAGATGG AAATATGTGT AAATCAACTT CTGCATTTAC GTTAACAGTA
ACTGCACTGC CAACTGCTAC AATCACAGCA ACTGGTTCAA CAACGATTCC TCAGGGCGGA
AGTGTAGTAT TACAGGCGAA TGCAGGTTCA GCTTTGACCT ACAAATGGTT CAACGGCACG
GTCACAATCA CAGGAGCAAC CGCACAGACC TATACCGCAA CGAACGCGGG AAGCTATACC
GTTGAAGTAA CAAATGCGGG TAACTGCAAA GCAACTTCAG CAGCAGCAAC AGTAAGCGTG
GTTGCAAATC AGCCATCTGT TATTACAATT ACTTCACCGG CACCGAATGC TGCAGTAACA
GGAGCGATCG ATATTTCGGT GAATATCACA GATGCGGATG GTAGTATAAC CCTTGTAGAG
TTTTTAGCAG GCGATGATGT AATCGGCACA GCAGCAGCAG CGCCGTATAC GTACACATGG
GACACTCCAA CGGCAGGATC TCATACGATT ACGGTTCGAG TAACAGACAG TAACGGAGGC
GTCACAACTT CTGGACCGGT AACAGTTACA TCGGAATCCA TCACAACAGG CGTGCAGGCA
TTGAATACAT TGAATGCAGC TGTATATCCG AATCCATCAA ACGGCATCGT ATTTATTGAT
ACAGATGCAG ACTTATCAGA TGCAAGTTTT ACACTGATAG ATGTGTTGGG TAAAGAAGGA
ACTGTTTCTT CAACAGCAAC CGGCAACGGA GCGATGATAG ATGTAAGCAG TCTGGCGGGT
GGCACTTATA TGCTTATCAT ACATAATGGC AATGCAACTC TTAGAAAGAA ATTTAACGTT
GTAAAATAA
 
Protein sequence
MNKYVVLFKY AFAGIHYHVT FLMICLWMLS FHSMAQLTTI TVGNATRSML VYAPAGIQQN 
RPLLISMHGL NQDPNYQKSQ TKWELVADTA KFIVVYPAGI NNSWDLSGNT DTDFILKIID
AMYTSYGIDR SRVYLSGFSM GGMMTYVAAT KIADKIAAFA PVSGYPLSSS NFNSSRVVPF
IHIHGDADNV VIYDNKLLTY LQGWRTKNGC SSTAVVTKPY PSNISNSVAT KSSWTNCGCG
TEFVLMTLAG KGHWHSLDAT FNSTVEIWNF VRKYKNTCGN VGPVVSLTAP VNNTVYTEGD
NITINATATI TSGSISKVEF YNGTTLLGTD ASSPYSYTIT AAAAGTYPIT AKATSAANAV
TTSTAINIQV AKPIYQTGSA PTIDGTVDGL WSSFPSTGIT KNNTGTISSG TDLSGNWKAM
WDASNLYVLV QVTDDVKRND GGTDVYNDDG VEVYIDLGNT KATTYGTNDQ QYTFRWNDVT
AAYEINGHPV TGITKGISNT ATGYIVEVSI PWSTIGGTAS LNSFQGFEVM INDDDDGGAR
EGKLAWVAST DDTWSNPALM GTVVLKGLNC TVPAAAITAS TATTFCSGGS VVLNAGTGTG
YSYVWKNGTA TIAGATNSGY TATASGSYTV TVTNPGGCSA TSAGTTVTVN ALPVLTQYAQ
VDGGTWNQVS GATVCAGSSV VLGPQPTVNT GWSWTGPNGY SASTREITLT GVTPTQGGIY
TASYTDGNMC KSTSAFTLTV TALPTATITA TGSTTIPQGG SVVLQANAGS ALTYKWFNGT
VTITGATAQT YTATNAGSYT VEVTNAGNCK ATSAAATVSV VANQPSVITI TSPAPNAAVT
GAIDISVNIT DADGSITLVE FLAGDDVIGT AAAAPYTYTW DTPTAGSHTI TVRVTDSNGG
VTTSGPVTVT SESITTGVQA LNTLNAAVYP NPSNGIVFID TDADLSDASF TLIDVLGKEG
TVSSTATGNG AMIDVSSLAG GTYMLIIHNG NATLRKKFNV VK