Gene Dole_0295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0295 
Symbol 
ID5693114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp339512 
End bp340543 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content62% 
IMG OID641262876 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001528182 
Protein GI158520312 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCAA CCGACAAACG CCCTTTTTCC CTCTCCGGCT TTTCCGATGA CATGCTGGCT 
GGCCAGCGGC TTGTGGCCGG ATTTGAGGGA ACAACCCTCA ACGACGACCT CAAATACCTG
ATCGACACCC TCAAGGTGGG GGGTATTATT CTGTTTGCCG TGAACCTGGA GCACCCCGAT
CAGATTCGCG ACCTGTGCGC TTCGGCCCAG GCCCATGCCG CGGCCTGCCG CCTGCCGCCC
CTGTTTGTGG CCATCGACCA GGAAGGGGGA CAAGTGGCCC GGCTCAAACC GCCCTTTACC
CGGTTTGAGG GCAACCCTTC AATCACCACG GATGACCAGG CCCGGCACTT CGCCCGAATC
ACCGCGTCCG AGCTGGCCGG CATCGGGGTG AATATGAACA TGGCGCCGGT GCTGGACGTG
GCCGACGGGG TTACCGACAG CGTCATGGCC GGCCGGGCCT TTGCCGGCGG CCCCCGGGAA
GTGGCCCGGC TGGGCGGCGT GGTCATTGAA GAGATGCAGA AAAACGGCAT AATGGCCGTG
GGCAAGCACT TTCCCGGCAT CGGCCGCACC ACGGCCGACT CCCACATCGA CCAGCCCTGG
CTGGCGGCCG ATCCCGCAGA AATGGAAACC ACCGACCTGG TGCCGTTTAA GACGGCCATC
GAACGGGACG TGGCCGGCAT CATGCTTTCC CATATCCGCT ACACCGCCCT GGATCCGGAC
CTGCCGGCCA GCATGTCGAC ACCTATTGCA AAAACCCTGC TGCGGGAAAA ATTGGGATAT
GAAGGCCTGG TGATGACCGA CGATCTGGAC ATGGGCGCCA TTCGCAACCA CCATGTTATG
GATCAGGTGG TGCGATGCGC GGACCGGGCC GGCATCGACA TGGTGCTGGT CTGCCACAAG
GGGCCGGACA GAAAAAAGGC GGTTGAGTCC TTCAGGGAAC TGCTGGAAAC ATCGGACACG
CACAGAAAAC AGGCCCTGTG CTCAGTGGAG CGGATTCTGC GGGCCAAGGC CCGTTATCTT
TGCCGCATTT GA
 
Protein sequence
MAATDKRPFS LSGFSDDMLA GQRLVAGFEG TTLNDDLKYL IDTLKVGGII LFAVNLEHPD 
QIRDLCASAQ AHAAACRLPP LFVAIDQEGG QVARLKPPFT RFEGNPSITT DDQARHFARI
TASELAGIGV NMNMAPVLDV ADGVTDSVMA GRAFAGGPRE VARLGGVVIE EMQKNGIMAV
GKHFPGIGRT TADSHIDQPW LAADPAEMET TDLVPFKTAI ERDVAGIMLS HIRYTALDPD
LPASMSTPIA KTLLREKLGY EGLVMTDDLD MGAIRNHHVM DQVVRCADRA GIDMVLVCHK
GPDRKKAVES FRELLETSDT HRKQALCSVE RILRAKARYL CRI