Gene Dfer_0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_0289 
Symbol 
ID8223855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp327578 
End bp328714 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content57% 
IMG OID644928167 
Productglycoside hydrolase family 5 
Protein accessionYP_003084724 
Protein GI255034103 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.014441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCGCA GGACATTCAT ACAAAATACA TCCATCGCAC TGGCCGGGGC CGCATTGGCT 
CCCGGCCTTG CCGTTTCGGG CCAGGCAGCG CAGAACAAGC TGCCCAAATG GAAAGGCTTC
AATCTACTCG ATTTCTTTTC GCCAGACCCC GCCAAAGGCC GTAAGCCCAC CACCGAGGAA
CAGCTCAAAT GGATGAGCGA CTGGGGCTTC GATTTCATTC GCATTCCGAT GGCCTACCCG
GCTTACCTTA AATTCGATCG CAGCAAAAAC ATCACGCCGG AAGAAGTGTA CCAGATCGAC
GAGCGGGCCG TGGAACGGAT CGATAAGCTC GTGGCCGCGG CGCACAAATA CAACATGCAC
GTGAGCCTGA ACCTCCACCG GGCGCCGGGT TACTGCATTA ATGCGGGTTT TAACGAACCC
TACAACCTCT GGACCGACCA GAAAGCCCTC GATGCATTCT GCTTCCACTG GAATATGTGG
GCTAAACAAT ATAAAAATGT GAGCTCTGCA CGGATCAGCT TCGACCTGCT GAACGAGCCG
AGCATGCGCG CGGATATGAA CGACCAGCAT TCGAAACGCT CATCGGTGCC TGGTGACGTT
TACCGCAAAC TCGCGATTGC CGCGTCGGAA GCGATCCGGA AGGAAAACCC GGGACACCTG
ATCATCGCCG ACGGCAACGA CGTAGGTACA TCGGTCATCC CCGAGCTGGC CGACCTCGAC
ATTGCACAAA GCTGCCGCGG CTACCACCCG GGCATTATTT CGCATTACAA AGCGCCCTGG
GCCACGAAAG ATCCCGACAA TGTGCCGGAA CCAAAATGGC CCGGGCAGGT AGGCGACCAA
TACCTCAGCC GGGCCATGCT GGAAAAGTTT TACAAGCCGT GGATTGAGCT CGTCAAAAAG
GGCGTGGGCG TGCATTGCGG CGAATGCGGC TGCTGGAATA AAACGCCGCA CGCGGTTTTT
CTGGCCTGGT TTAACGACGT GCTCGACATC CTGTCATCGA ACGGCATCGG CTTTTCGCTA
TGGGAATTCG CCGGCGACTT CGGCGTGCTC GACTCCCGCC GCGATGATGT TGCGTACGAA
GACTGGTACG GCCACAAGCT GGACCGCAAG TTGCTCACGC TCCTGATGAA ATACTGA
 
Protein sequence
MHRRTFIQNT SIALAGAALA PGLAVSGQAA QNKLPKWKGF NLLDFFSPDP AKGRKPTTEE 
QLKWMSDWGF DFIRIPMAYP AYLKFDRSKN ITPEEVYQID ERAVERIDKL VAAAHKYNMH
VSLNLHRAPG YCINAGFNEP YNLWTDQKAL DAFCFHWNMW AKQYKNVSSA RISFDLLNEP
SMRADMNDQH SKRSSVPGDV YRKLAIAASE AIRKENPGHL IIADGNDVGT SVIPELADLD
IAQSCRGYHP GIISHYKAPW ATKDPDNVPE PKWPGQVGDQ YLSRAMLEKF YKPWIELVKK
GVGVHCGECG CWNKTPHAVF LAWFNDVLDI LSSNGIGFSL WEFAGDFGVL DSRRDDVAYE
DWYGHKLDRK LLTLLMKY