Gene Dole_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2014 
Symbol 
ID5694854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2440159 
End bp2443143 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content59% 
IMG OID641264612 
Productpeptidase C25 gingipain 
Protein accessionYP_001529895 
Protein GI158522025 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0011841 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CAGGTTTTCG GCAGTTGGTT TTCGGGCTGG TTCTGAGTTT TTGGCTTCTG 
GGACCGGCCC ATGCCGGGTG GATTGCCGCG GGCAGCCACA AGGCCGGGCC ACCGGCCCTT
GAAACTGTCC GGTCAGATTC TTCCGGAATG GTGATCGATC TTGATATTCC CGGTCTCCAT
ATAACGGAAA CCCTGCGCGA CGGCATGGTG TACCACGGCA TATCAGTGCC GGGCGGGGGC
CGTCTTTCCG GTATCGGCAA ACCGTCCCTG CCTTTTGTCA GCCGGTATGT GGCGGTGCCG
CAAGGCGCAA CCGCTTCTGT CAGGGTGATG GATGCCCGGT TTGAGGAGAT GACCGGGTAT
AACGTGGTTC CGGCCCAGGC CCCACTGCCG GAAAGCAACA CCGCAAAGGG CCCCCCTTTT
GAAAAGGACC GGGTCGCCTA CGGAGAAAAC GGGTTTTTCC CACGGCAGGT GGCTCAACTG
GAGGGGCCTG TCTCCATTCG GGGGTGTGAA ACATCCCTGC TGCAGCTCTT CCCTGTGCAG
TTCAATCCGG TCTCCCAAAC CCTGAGGGTC TATTCCCGCA TCACCGTTCA GTTGACTTTT
GACGGGGGAA CCCGTCGCTT TATCGACCGG CGGAAGCACT CCCGGTCCCT GGCCCCGGTT
TTTGAGGGCC TGTTTTTAAA TGCCCCCCTG GAGATGGACA GCCCACCCCT CACAAAAGCC
GATGACCGGT CAGTAACCGC CAAAAGCGCC ACCGACGATG CCGCGCTTCT GTTGATTATC
TCTCCGCCGG AACTGGTCGC CGCAGCCGAC CGGCTGGCCA ACTGGAAACG GGCCCAGGGT
ATTTTGACCG AGGTGCGGAC CACCGGCCAG ACCGGGACCA CCGCAGCAGA AATACAGGCG
TTTATTCAGG ATGCCTACGA TACATGGTCC ATACCCCCCT CTTATGTGCT GCTGGTCGGG
GATGTGGAGT TCATTCCCAC TCACCGGGGG GATGGGTGCG GCACCGATCT GTATTATGCC
ACGGTGGACG GGGACGACTA TTTTCCGGAT TTGAGCCTGG GCCGGCTCTC CGTGGACACA
CTGGATCAGG CCATCAAGCG GGTTGAGGAT ATTATCCGGT ATGAACTTTC TCCGCCTGCC
GGGGAGGGGT TTTATCAGAA CGCGGCCATT GTGGCCTATT TCCAGGATAC CAGCCCTCCT
TACAATTATG CGGACCGCCG GTTTCTCCAG TCCGCCGAGG ATATGGCCCT CTTTTTTTCT
GACCCGGCCT ACCTGAACGC CTACGATGTG GATCGGATTT ACTATACCGG ATACCGATCT
CCCCAGAACT GGAATAACGA TTCCTGGAAT TTCGGCACTA CGGGCGTGCT TTCCGGCGGC
CCCGGCGATT CGATACCGTT CTATCTGTTG GAGAGCAACG GCTTTGCCTG GGACGGCGAT
GCCGTTGACA TATCAACGGC CGTAAACGCG GGCCGTTTTC TGGTAACCTA CAGGGGTCAT
GGCCAGACCA GCCGGTGGGA CAGTGTCACC TATACCACCT CCGATGTTTC CGGCCTGCTA
AATCAGGACC TGCTGCCCGT GGTATTCAGC GTCACCTGCC TGTCCGGCAA GTTTGACATG
GAAAGTGTGG GTAACAACAC CCCCTGTTTT TCCGAATCAT GGGAGCGCAA CCCGGACGGC
GGGGCCGTCG GTGTTGTGGC CGCCTCTGAA ACCACCTACA GCGGCCACAA CGACCGCCTG
TTCTGGGGGT GGATGGAGGC ACTGTGGCCC AACTTTCCGG AGGACTACCA TCCTTCAGAC
ACGCCGTTTG ACCAGCCTGC ATGGGAAATG GGCCCGGTGT TCAACTACGG CAAATACTAC
TATGCCACCT GGTATGAAGA AGAACGTTAC CGAAAACTGG AGTTTGAGCG CTTTCACTGG
TTCGGCGACC CCACCATGCG GCTCTGGACA GGGGTTCCTC AGGATTTGAC GGTTTCCGAT
TATGCCATTG ACGCCGGGTC TGGGGGCCTG GAGATCACCC TGGGCCAGGC CGGAGCCGTG
ATCTGCGTTT CGCGGGACGG CGTTATCCTG GGCAAGGGCG TCTCTTCCGG CGGCACTGAT
CTTGTGGCCT GTGCCCCGCC CCTGGCAGCC GATGATGCCA TCCTTGTGGT TGTTACCAAA
CCGAATTTCC GGCCCTTTGT GGCTGTGACC GAAGAAGACG CGGATGCGCT GCCCACGCTG
CTGGAAATCC TCACCGGCAC CAGCCCCCAT GACGGAGACA CCGATGACGA TGGCATTGCC
GACGATGTGG AGGATGGCAA TTTAAACGGC CTGGTGGATG AAGATGAGAC CGACCCCCGG
AACATCGACA CCGACGGCGA CGGGATCCAG GACGGCACGG AGAAGGGGAA AACCTTGGCT
AATATTCCCG ATGATACAAA CAGGGATGTA TTTGTGCCTG ACCTTGACGA CACCACCACC
ACCGATCCGC TCAATTCTGA CACCGACGGA GACTGCGCGT TCGACGGCCA GGAAGATGCC
AACGCCAACG GCCGGTTGGA CGATGATGAA ACCGATCCGA ATGTTTACGA TAACCTGGCT
CCGCCGGTGG CCAATGCCGG TGCCGCTCAG TCCGTTCGGG AGGGGACAAC CGTCCGGCTG
GACGGGGCCG GCTCATACGA TGCCTGTCAG ACGCTGCTCT CTTTTTTCTG GGAGCAGGTG
TCCGGGCCGT CAGTAACCCT TTCCGACTCC GCAACGTCCC GTCCCACCTT TACCGCGCCG
ACGGTTGGTT CAGCCGGGAC TGCGCTGGTG TTCCGTCTGA CCGTGGATGA TGGTGATTTT
TTCGATACCG ATACCTGCCA GGTAACGGTT ACCGACACAC CGCCGCCGTC TGAGGATGAT
GACGACAGCG ATGACGATGA TGATAATGAT GATAGCGATC CGCCCCCCGC GGAGGGCGGT
GGGGGTGGTG GCTGTTTTGT CGAAACCGTG CTGTCGTTGA AGTAA
 
Protein sequence
MKKTGFRQLV FGLVLSFWLL GPAHAGWIAA GSHKAGPPAL ETVRSDSSGM VIDLDIPGLH 
ITETLRDGMV YHGISVPGGG RLSGIGKPSL PFVSRYVAVP QGATASVRVM DARFEEMTGY
NVVPAQAPLP ESNTAKGPPF EKDRVAYGEN GFFPRQVAQL EGPVSIRGCE TSLLQLFPVQ
FNPVSQTLRV YSRITVQLTF DGGTRRFIDR RKHSRSLAPV FEGLFLNAPL EMDSPPLTKA
DDRSVTAKSA TDDAALLLII SPPELVAAAD RLANWKRAQG ILTEVRTTGQ TGTTAAEIQA
FIQDAYDTWS IPPSYVLLVG DVEFIPTHRG DGCGTDLYYA TVDGDDYFPD LSLGRLSVDT
LDQAIKRVED IIRYELSPPA GEGFYQNAAI VAYFQDTSPP YNYADRRFLQ SAEDMALFFS
DPAYLNAYDV DRIYYTGYRS PQNWNNDSWN FGTTGVLSGG PGDSIPFYLL ESNGFAWDGD
AVDISTAVNA GRFLVTYRGH GQTSRWDSVT YTTSDVSGLL NQDLLPVVFS VTCLSGKFDM
ESVGNNTPCF SESWERNPDG GAVGVVAASE TTYSGHNDRL FWGWMEALWP NFPEDYHPSD
TPFDQPAWEM GPVFNYGKYY YATWYEEERY RKLEFERFHW FGDPTMRLWT GVPQDLTVSD
YAIDAGSGGL EITLGQAGAV ICVSRDGVIL GKGVSSGGTD LVACAPPLAA DDAILVVVTK
PNFRPFVAVT EEDADALPTL LEILTGTSPH DGDTDDDGIA DDVEDGNLNG LVDEDETDPR
NIDTDGDGIQ DGTEKGKTLA NIPDDTNRDV FVPDLDDTTT TDPLNSDTDG DCAFDGQEDA
NANGRLDDDE TDPNVYDNLA PPVANAGAAQ SVREGTTVRL DGAGSYDACQ TLLSFFWEQV
SGPSVTLSDS ATSRPTFTAP TVGSAGTALV FRLTVDDGDF FDTDTCQVTV TDTPPPSEDD
DDSDDDDDND DSDPPPAEGG GGGGCFVETV LSLK