Gene Dhaf_1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_1089 
Symbol 
ID7258058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp1181250 
End bp1182530 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content48% 
IMG OID643561004 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_002457585 
Protein GI219667150 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0000284177 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAAAA GAACGATAGC AATATTCGTA ATCATTATTG CCGCTTTGGT TTTAGGATTT 
CTCTTTTTAT CCCAAAGGTT GAATCCCCAG GAGGGTAACC TTTCCATACC AGGTCAACAC
GATCAGCAGG GGAACAATAA TCCGGCTGAA CCCTCAGAAA AGCCCCCCGA GATTGACCCC
CTGCAGGAAC GGATCCAAGC TATGACCTTG GAAGAGAAGG TCGGGCAGCT GGTGATGGTG
GGAGTGGACG GGTATGAAAT AAACGCTAAT GCTCAGCAGT TGATTGAAAA TTATCATGTG
GGCGGCTTTG TTCTCTTAAA GAAAAATGTC AGGGATAGCG GGCAGATGTT AAACCTGATC
AACACCTTGA AAGAGACCAA TGGAGTCAAT AGAATTCCCT TATTCCTGGC CCTTGATGAA
GAGGGGGGCA GGATATCCAG GATGCCTGCT GAATTCAAGA AGATGCCTTC CAGTCAGCAG
GTCGGAGCCC AGAATAGTGG TGCTTTAGGG AAGAAGATGG GAGAAATCCT GGGCCGGGAA
GTCAAGGGAT TTGGGATGAA TGTGAATTTC GCTCCGGTTC TCGATATTTT CAGCAACCCC
AAGAATAAAG TGATCGGTGA TCGTGCTTTC GGCAGCAACC CCGAGCTTGT CAGCAAAGTG
GGAATTCAGA CTATGAGGGG AATCCAGGAG CAGGGCATTA TCTCTGTGGT TAAACATTTT
CCCGGTCATG GGGATACTTC GGTGGATTCC CATGTGGGAT TGCCCCGCGT TGATTATGAT
CTGGAACGAT TAAGGAATTT TGAGCTGAGG CCATTTGCAG AAGCCATTGC CAATGATGTG
GATGCCATTA TGCTGGCCCA CATCCTGCTG CCGAAGCTTG ACCCGGATTA TCCGGCATCC
TTTTCAGAAG TTCTTATCCG CGATATCCTG CGCAAAGAGA TGGACTATAA CGGGGTCGTG
ATTACGGATG ATATGACTAT GGGGGCTATT GTGGAGAATT ATAATATCGG TGAGGCCGCG
GTGAAATCCA TCCTGGCCGG CAGCGATATT GTCCTGGTCT GCCATGATTT CGCGAAAGAA
GAGGCTGTTC TCAAGGAGAT CCTTCATGCT GCAGAGACAG GGAAAATTCC CGTGGACCGG
ATCGATGAAA GTGTTTATCG TGTCTTAAAG CTGAAAGAGA AGTATGCTCT GGCCGACCGG
CAGAAAGAAT CGGTGGATGT ACAAGGTATC AATGCCGAAA TCGAGCAGTT TTATAAGGAT
TACCCGGCTT TAAAAGGGTA G
 
Protein sequence
MGKRTIAIFV IIIAALVLGF LFLSQRLNPQ EGNLSIPGQH DQQGNNNPAE PSEKPPEIDP 
LQERIQAMTL EEKVGQLVMV GVDGYEINAN AQQLIENYHV GGFVLLKKNV RDSGQMLNLI
NTLKETNGVN RIPLFLALDE EGGRISRMPA EFKKMPSSQQ VGAQNSGALG KKMGEILGRE
VKGFGMNVNF APVLDIFSNP KNKVIGDRAF GSNPELVSKV GIQTMRGIQE QGIISVVKHF
PGHGDTSVDS HVGLPRVDYD LERLRNFELR PFAEAIANDV DAIMLAHILL PKLDPDYPAS
FSEVLIRDIL RKEMDYNGVV ITDDMTMGAI VENYNIGEAA VKSILAGSDI VLVCHDFAKE
EAVLKEILHA AETGKIPVDR IDESVYRVLK LKEKYALADR QKESVDVQGI NAEIEQFYKD
YPALKG