Gene Dole_1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1030 
Symbol 
ID5693865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1215813 
End bp1217783 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content63% 
IMG OID641263627 
Productpeptidase U32 
Protein accessionYP_001528917 
Protein GI158521047 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGACA CAAAAAACCA TAAACCCCAG ATTCTTGCCC CGGCCGGGGG AAAGGCATCG 
TTTCTGGCGG CCCTGGCTGC CGGCGCCGAC GTGATCTATT GCGGCCTGAA AAGTTTTTCC
GCCCGCATGG CGGCAGAAAA CTTTGCGCCC GGTGAACTTC GCGCGCTGAC GGAGCTGGCC
CATAAAAAAG GGGTGAAGGT CTTTGTGGCG CTTAATACCC TGGTCCGTCC GGGGGAGATT
CCCCAGGTCC GGCAGCTGGT CCACATCCTT GGCAGAGAGG TGGGCGCCGA CGCGCTGATC
GTTCAGGATT TGAGTGTTGT GGAACTGGCG AAACAGGCCG GTTTTAAAGG AGAACTGCAC
CTTTCCACCC TGGGGGCGGT CACCTTTTCA AAGGCGCTGG GCCTGATTTC CAGTGCCCTG
GGTGTCAGCC GGGTGGTGCT GCCCAGGGAG TTTCACATCG ATGAGATCAA GCAGATGGCC
CAGTCCTGTC CCCCGGGCAT GAGCCTGGAG GTGTTCGTTC ACGGGGCCCT GTGCTACGGG
GTGTCGGGCC GGTGCTACTG GAGCAGTTAC ATGGGCGGCA AAAGCGGGCT GCGGGGTCGG
TGCGTGCAGC CCTGCCGCCG GACCTACACC CAGGGCCGGT CCGAGGGCCG CTGGTTTTCC
TGTCTTGATT TTTCCGTGGA CGTGCTCACC AAGACCCTGC TGCCCCTGCC GCAGATCACC
GCCTGGAAGA TAGAGGGCCG TAAAAAAGGG CCCCACTATG TTTACTACAC CACCACTGCC
TATAAAATGC TGCGGGACCA CGGCAACGAC CCGAAGATTA AAAAGGATGC CATGGGCTAT
CTTGAACAGG CCCTGGGCAG AAAAACCACC CACTACAACT TTCTGCCCCA GCGGCCCTAT
CCGCCGTCAG GCCAGGAGGA GCAGACCGGG TCCGGGCTTC TGGCCGGACG GGTGAAGAGC
GACGGCGGCA GGCCGGCCCT CTCCCCCCGC ATGGGGTTGA TCAAGGGGGA CCTGCTGCGT
ATTGGTTACG AGGACAAGCC CGGCCACAGC CTGCTTCGGG TGCCGGCCGC CGTACCGGCC
AGGGGCCGGC TGGTGTTAAA GGTCCGTGGC GCTGTGCCCG CGGCCGGAAC ACCGGTTTTT
CTCATTGACC GCATGGAAGA CGCCCTGGAA ACCATGATCG GGGATCTGGC GGCTGAACTG
ACTGACGCCC CCTGCCGGGA AACAACCTCG GCCGCACCGG ACCGGGCGGC CCGGCAGCGA
CGGCCGGCAT CCAGGTCGCC GGAAGAGATG ACGGTTTACC GGTCCCTGCC AAGGGGAAGG
CAGGCCCACA GTGTGGGCTT CTGGCTCTCT CTTGAGGGAG CCAGAGGTGC CGCCAGGCTG
AATGCCCAAC AGTGGCTCTG GCTGCCGCCG GTGGTGTGGC AGGAGGACGC AGACCGGTGG
CAGGCGCTGG TGAACCGTAT GGTCAAACAG GGTGCCCGGC GGTTCGTGCT GAACGCGCCC
TGGCAGATAT CCCTGTTTGA ACGGACAAGA AATCTGGACC TGTGGGCCGG GCCGTTCTGC
AACCAGGCAA ACGGGGTCTC GATTCAGGTA CTGGCAGGGA TGGGTTTTTC CGGTGTTATC
GTGAGCCCCG AGCTGGGGAA AGAAGATTAC GCGGTCATTC CGGGCCAGAG CCCGGTCCCC
TTGGGAGTGG TTGTTTCGGG CAGCCTGCCC CTGTGCGTGG CCCGTACCCT GCCGGGACCG
GTTCGGGAGA AAAAACTGTT TTCAAGCCCC AGGAAGGAAA ATGCCTGGGC CGAGAAGCAC
AGCGGCCTTG TGTGGTTGTA TCCCGACTGG ATGGTGGATC TGCGGCCCCG GCAGAAAATC
CTGGAACAGT ACGGGTATGC GCTCTTTATT CATCTCCACC ACAGCCCGCC GCCGGGCGTG
AAAATCAGGC AGCGGCCGGG GATGTGGAAC TGGGATGTCG GCCTGCCGTG A
 
Protein sequence
MIDTKNHKPQ ILAPAGGKAS FLAALAAGAD VIYCGLKSFS ARMAAENFAP GELRALTELA 
HKKGVKVFVA LNTLVRPGEI PQVRQLVHIL GREVGADALI VQDLSVVELA KQAGFKGELH
LSTLGAVTFS KALGLISSAL GVSRVVLPRE FHIDEIKQMA QSCPPGMSLE VFVHGALCYG
VSGRCYWSSY MGGKSGLRGR CVQPCRRTYT QGRSEGRWFS CLDFSVDVLT KTLLPLPQIT
AWKIEGRKKG PHYVYYTTTA YKMLRDHGND PKIKKDAMGY LEQALGRKTT HYNFLPQRPY
PPSGQEEQTG SGLLAGRVKS DGGRPALSPR MGLIKGDLLR IGYEDKPGHS LLRVPAAVPA
RGRLVLKVRG AVPAAGTPVF LIDRMEDALE TMIGDLAAEL TDAPCRETTS AAPDRAARQR
RPASRSPEEM TVYRSLPRGR QAHSVGFWLS LEGARGAARL NAQQWLWLPP VVWQEDADRW
QALVNRMVKQ GARRFVLNAP WQISLFERTR NLDLWAGPFC NQANGVSIQV LAGMGFSGVI
VSPELGKEDY AVIPGQSPVP LGVVVSGSLP LCVARTLPGP VREKKLFSSP RKENAWAEKH
SGLVWLYPDW MVDLRPRQKI LEQYGYALFI HLHHSPPPGV KIRQRPGMWN WDVGLP