Gene Dfer_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_2037 
Symbol 
ID8225609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp2481840 
End bp2483810 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content51% 
IMG OID644929874 
ProductNaringenin-chalcone synthase 
Protein accessionYP_003086425 
Protein GI255035804 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3424] Predicted naringenin-chalcone synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.629301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.879643 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCAT TGATAGATCG CAACACCATT AAACTGCTCC GGATTCCGTT CTCGTTTTTC 
CTGGCGCCGC TGTTTTTGTT CGCTTACAGT CAGGCCGAGA CGGTTGTGCA CCGGCAGGCT
TTGTGGAGTT TCCTGATCAT CCATTTGCTC GTGTATCCCG CCAGCAACGG CTACAACAGC
TACATCGACC GCGATGAGGA AAGCATTGGC GGCCTTGAAA AACCACCGCT TCCTACCATT
CGGCTTTTTT ACCTTACACT TTTTCTGGAT GCTGCCGCAA CCATTCTGGC GCTCGTATTC
GTAAATACGT TGTTCGCGCT CTGCCTGGTG CTGTACATCG GGGCTTCGCG GGCGTACAGT
TCGCGGCTGA TCAGGCTGAA AAAATACCCG GTGATCGGAT TTCTGACGGT CGTGTTTTTT
CAGGGCGCAT TCACCTACTA TATGTCGATC GCCGGCATAT CGGGCGGCGC ATTGCGGCTT
GATACGGCCA ATATTTTCGT GTTGCTGGGC TGCTCGCTCC AGATCGCAGG CGCGTACCCG
CTCACGCAGG TGTACCAGCA CGAGCAGGAC CTGCGCGACG GCATCGTTAC GTTGAGCTAC
AAATTGGGCC ATATCGGCAC GTTCGTGTTC ACGACGGCCA TGTTTGTGCT CTGCAATGTG
TTTTATTATT TGTACTTTAC GACCAGGGAC CTTGGAATGA TTTTTTACAT TGTCCAGGTG
TTCTTCATCC CCATTGTGAT TTGGTTCGGG GTCTGGTTTT CGCTGGTTTG GAAGGATCGC
GCACAGGCCA ATTTCCGTAA TACCATGCGG ATGAACTGGG TGGCGGCGAT TTGCATGAAT
AGTTGTTTCA TCGTTTTAAT AATTATCAAC AGAATTCCAT TGAGTTATCT ATCCTCTATT
GAAACAGCCG TTCCCGAATA TGGTTATGCA CAGGAAACGC TTACCGATTT CTACCTGCGT
TCGACCGACG ACCTGAGTAC CCGCCGAAAA ATCAAGATCG TAGCCAGCAA AACGGGGATC
GAAAAACGTT ACTCGGTAAT TCCCGACTTC GATAAAAATC CCGATCAATA CACGTTTTTC
AACCGGAATG CCGCATTATT GCCCGAGCCC ACCCTTTCTC AGCGCATGCA GCTTTACCAG
CAGCACGCCA CGGCGCTTTC CAGAAAGGCG ATTGAGCAGA TCCGGGATTT TGATATGATT
AAAAAAGACA TCACGCATTT GATTACCGTG ACCTGTACCG GGCTTTTTGC GCCGGGCCTG
GATGTGGAAC TGATGCGGGA ATTGAAGTTG AATCCTTCGA TACAACGCAG CAGCGTCAAC
TTTATGGGCT GCAATGCGGC GATTCTGGCA TTGAAAAATG CGGATGCCAT CTGCAAAAGC
AATGCAAATG CCAAAGTGCT GGTCGTGTGT ACGGAGCTGT GCACGATCCA TTTTCAGAAG
CGCTATAATG ATGATTATCT GCTTTCCAAT ATGCTTTTCG GCGATGGCGC GGCGGCATTG
CTCGTGTCGT CCCAGCCGGA CGATCACTAT CTGCACGCGG TGAAAGTCGA TAGTTTCAAT
TCAATGGTGC TGCACAATGG CTACTCGGAT ATGGCCTGGC AGCTGTCGGA GACGGGTTTT
ATCATGAATT TGTCGTCGTA CGTGCCTGAT CTGATCCGGG AAAATATCCG GCCCATGCTG
AAATCGGTCG GGTCGAGGTC GGACGATTAT GGCCATTGGG CAGTGCATCC CGGCGGAAAA
CGCATTGTGG ATGACTTTGC GGCGGCATTG GAACTGGACA GATGCATGCT CTCCCCAACG
TACGATGTGT TGCGGAATTT TGGAAATATG TCCTCGCCAA CCGTGCTGTT TGTCCTGAAA
AATGTCCTGG AAAAAACGAA GCCAGAGCAC CTGAACGACC GCATCTTTGC GGCCGCATTT
GGCCCGGGGC TCAGCATCGA AACCATGCAA TTACGGTATG TTCGGGCATA G
 
Protein sequence
MPALIDRNTI KLLRIPFSFF LAPLFLFAYS QAETVVHRQA LWSFLIIHLL VYPASNGYNS 
YIDRDEESIG GLEKPPLPTI RLFYLTLFLD AAATILALVF VNTLFALCLV LYIGASRAYS
SRLIRLKKYP VIGFLTVVFF QGAFTYYMSI AGISGGALRL DTANIFVLLG CSLQIAGAYP
LTQVYQHEQD LRDGIVTLSY KLGHIGTFVF TTAMFVLCNV FYYLYFTTRD LGMIFYIVQV
FFIPIVIWFG VWFSLVWKDR AQANFRNTMR MNWVAAICMN SCFIVLIIIN RIPLSYLSSI
ETAVPEYGYA QETLTDFYLR STDDLSTRRK IKIVASKTGI EKRYSVIPDF DKNPDQYTFF
NRNAALLPEP TLSQRMQLYQ QHATALSRKA IEQIRDFDMI KKDITHLITV TCTGLFAPGL
DVELMRELKL NPSIQRSSVN FMGCNAAILA LKNADAICKS NANAKVLVVC TELCTIHFQK
RYNDDYLLSN MLFGDGAAAL LVSSQPDDHY LHAVKVDSFN SMVLHNGYSD MAWQLSETGF
IMNLSSYVPD LIRENIRPML KSVGSRSDDY GHWAVHPGGK RIVDDFAAAL ELDRCMLSPT
YDVLRNFGNM SSPTVLFVLK NVLEKTKPEH LNDRIFAAAF GPGLSIETMQ LRYVRA