Gene Dhaf_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_0072 
Symbol 
ID7257021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp72606 
End bp73760 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content47% 
IMG OID643559975 
Productglycoside hydrolase family 18 
Protein accessionYP_002456577 
Protein GI219666142 
COG category[R] General function prediction only 
COG ID[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000514122 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGGT TTCTGACTTT AAATTTAATA GCTGTTTTGG TATTGAGTTT TAGCTTGGTT 
GGGTGTAATA CAGCGCAGCA GCAAAACACC CAAGCACCGA ATCAGATCGC CACGGAAAGC
GGCGAACCTC CAAAAGCTGA GGCAGAGACT CGTCAGGATG TTCTAGGGGA AGAGAAACGG
GTGGTCATGG GTTTCTATAC GGACCCTGAA GGAGAGATCC CCGGCTCCAA GGAATCAATG
ATGAAGAACA TCAAATTGAT GGATGAGGTT TCCTTCTTCT GGTATAGTTT TGATGCCAAT
GGAAAAATTC TCACCACAGG GAAAAAAGAT CTCAGCATTA AAGAAGCAGC GCAAAAGAAT
GGAGCTAAAG CCTACGCTTT AATTCATAAT ATGCGCGGCG GCCTCTTCGA TGCCAACCTG
GCCCACAGTG TGTTCGCCAA TCCTCAGACC CGCTCTAAGT TTATCAACAA TATTGTGCAA
CTGGTTATCA ATGAGAAATG GGATGGTGTG GCCATTGATA TTGAAAAGAC ACCACCCGCT
GACCGCAACA ACTTCACAGC CTTCTTAGGT GAGCTTCACG GTGCCTTAAA GGCTAAAGAC
AAGGTGCTCA ACGTCTCCAT TCCGGCTAAG TTTATCGATT ACCCATCCGA CCTTTGGTCC
GGGGCTTATG ATTATGCTTC CATCGGTAAA AATGCCGACC AAATCGTGCT GATGACCTAT
GACGAGCATG GACTGGGAAC CACCCATGGA CCCATATCCT CCCACGCCTG GGTCAATAAA
GTCATCTCCT ATGCAGTGAC CAAAATCCCC AGGGAAAAAA TCGTCTTAGG ACTTCCTGTC
TACTCCTTTG ACTGGGGTTC CAACAAGCCC ACCATGCCCG ACTATCTCTC TTATGAGCAA
AGCATGGCCC GTGCCAAAAA ACATGGGGTG GAAGTTGGCT ATGATGAAGA GCATAAAGTT
CCCTGGTATA CCTACACAGC CAATGGTGTC CGTCATGAAG TATACTTTGA AAACAAGCAA
AGCCTGCAGC CCAAGATGGA ATATGCCCGG GAGCATAAGC TTCATGGCGT AGCTATCTGG
AGATTGGGGA TGGAAGATCC CTCCATCTGG GACAGCTTGG TCAAGACTTA CGGAACCAAT
AAAAATAAGA AATAA
 
Protein sequence
MKRFLTLNLI AVLVLSFSLV GCNTAQQQNT QAPNQIATES GEPPKAEAET RQDVLGEEKR 
VVMGFYTDPE GEIPGSKESM MKNIKLMDEV SFFWYSFDAN GKILTTGKKD LSIKEAAQKN
GAKAYALIHN MRGGLFDANL AHSVFANPQT RSKFINNIVQ LVINEKWDGV AIDIEKTPPA
DRNNFTAFLG ELHGALKAKD KVLNVSIPAK FIDYPSDLWS GAYDYASIGK NADQIVLMTY
DEHGLGTTHG PISSHAWVNK VISYAVTKIP REKIVLGLPV YSFDWGSNKP TMPDYLSYEQ
SMARAKKHGV EVGYDEEHKV PWYTYTANGV RHEVYFENKQ SLQPKMEYAR EHKLHGVAIW
RLGMEDPSIW DSLVKTYGTN KNKK