Gene Dole_0991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0991 
Symbol 
ID5693826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1163735 
End bp1165018 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content60% 
IMG OID641263588 
Productpeptidase U32 
Protein accessionYP_001528878 
Protein GI158521008 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAATG AACTGCGGAA CACCATGAGC CACCCGTTAA AAAAAGTGGA GCTGCTGGCC 
CCTGCCGGTA CCCCGGAAAA ACTGGAGATC GCCATTCACT ACGGCGCGGA CGCCGTCTAC
CTGGCGGACA GCCGCTTTTC CCTGCGCAAT TTCGCGGGCA ACTTCACCTG CGACCAGTTA
ACGGCGGCGG CCGGCCTTGC CCGAAAACAC GGGGTGAAAC TGTATGTGGC CTGCAACATC
TATGCAAGGA CCGATGAAAC CGAGGCCCTG TGCGAATACT TTCACCGGCT CTCCGCCATC
GGCCCGGACG GCATCATCAT AGCGGATCCC GGCGTGCTGA AACTGGCCAG GGCCACCATT
CCCCATATTC CTGTTCACCT CAGCACCCAG GCCAACACCA CCAGCCTGGA GGCGGTCCGC
TTCTGGGAAC AGCAGGGCGT GTCCCGTATC AACCTGGCCC GGGAGCTCAC CCTGACCGAG
CTTGCCCAAA TCGCCTCTCA AACATCGGTC CAGATCGAAA CCTTTGTCCA TGGGTCCATG
TGCATGGCCT ATTCCGGCCG GTGCCTGCTC AGCGGTTTTC TCACCGGCCG GGAGAGCAAC
CGGGGCCTGT GCAGCCAGCC GTGCCGGTGG CAATATTCCC TGGCCGAGGA GACCCGGCCC
GGGGTCTGGA TGCCGGTGTT TGAAGATGAC CGGGGGGCCT ATGTGTTTAA CGCAAAAGAC
CTGTGCATGA TCGAACACAT CGACAGCCTG ATCAACGCCG GCATCGCCGC CTTAAAAATT
GAAGGCCGCA TGAAAAGCAT TCATTATCTG GCCGCCACTG TAAAGGTCTA TCGCGAGGCC
ATTGATGCTT ATTACGAAAA ACCGGAAAAA TATCGTGTGC AGACCGCCTG GATCGAAGAA
CTTGAAGCCG TCAACAACCG GGGGTTTTCC ACCGGCTTCT ACTTCGGTCC CCCGGAAAGC
GGGGGCATCA ACCGAACCGG TGCCCGGCCC GGCACAGCAT ACCGCTTCCT GGCGAGGATC
CTTCGGGCCC GGCCATCGGG CCGGGTCACG GCCGAGGTGA AAAACAAGCT GTGCGAAGGC
GACGCCGTTG AAATTTTGAC CGCCGGAGGA CCGGTTCGGC CAGGCACGGT GCTGAACATT
TTTGATGCCG ACGGCAACCC AATGGAAGCG GCCATGCCCA ACAGCACGGC CACCCTGGTC
CTTTCCGCCA CCTGCGGACC CAACGACCTG ATCCGGTGCC GGGAAACCCC GCCTGCGACA
CAGGGGGGGG AACACCTGCG ATAA
 
Protein sequence
MYNELRNTMS HPLKKVELLA PAGTPEKLEI AIHYGADAVY LADSRFSLRN FAGNFTCDQL 
TAAAGLARKH GVKLYVACNI YARTDETEAL CEYFHRLSAI GPDGIIIADP GVLKLARATI
PHIPVHLSTQ ANTTSLEAVR FWEQQGVSRI NLARELTLTE LAQIASQTSV QIETFVHGSM
CMAYSGRCLL SGFLTGRESN RGLCSQPCRW QYSLAEETRP GVWMPVFEDD RGAYVFNAKD
LCMIEHIDSL INAGIAALKI EGRMKSIHYL AATVKVYREA IDAYYEKPEK YRVQTAWIEE
LEAVNNRGFS TGFYFGPPES GGINRTGARP GTAYRFLARI LRARPSGRVT AEVKNKLCEG
DAVEILTAGG PVRPGTVLNI FDADGNPMEA AMPNSTATLV LSATCGPNDL IRCRETPPAT
QGGEHLR