Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2014 |
Symbol | |
ID | 5694854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 2440159 |
End bp | 2443143 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641264612 |
Product | peptidase C25 gingipain |
Protein accession | YP_001529895 |
Protein GI | 158522025 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0011841 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CAGGTTTTCG GCAGTTGGTT TTCGGGCTGG TTCTGAGTTT TTGGCTTCTG GGACCGGCCC ATGCCGGGTG GATTGCCGCG GGCAGCCACA AGGCCGGGCC ACCGGCCCTT GAAACTGTCC GGTCAGATTC TTCCGGAATG GTGATCGATC TTGATATTCC CGGTCTCCAT ATAACGGAAA CCCTGCGCGA CGGCATGGTG TACCACGGCA TATCAGTGCC GGGCGGGGGC CGTCTTTCCG GTATCGGCAA ACCGTCCCTG CCTTTTGTCA GCCGGTATGT GGCGGTGCCG CAAGGCGCAA CCGCTTCTGT CAGGGTGATG GATGCCCGGT TTGAGGAGAT GACCGGGTAT AACGTGGTTC CGGCCCAGGC CCCACTGCCG GAAAGCAACA CCGCAAAGGG CCCCCCTTTT GAAAAGGACC GGGTCGCCTA CGGAGAAAAC GGGTTTTTCC CACGGCAGGT GGCTCAACTG GAGGGGCCTG TCTCCATTCG GGGGTGTGAA ACATCCCTGC TGCAGCTCTT CCCTGTGCAG TTCAATCCGG TCTCCCAAAC CCTGAGGGTC TATTCCCGCA TCACCGTTCA GTTGACTTTT GACGGGGGAA CCCGTCGCTT TATCGACCGG CGGAAGCACT CCCGGTCCCT GGCCCCGGTT TTTGAGGGCC TGTTTTTAAA TGCCCCCCTG GAGATGGACA GCCCACCCCT CACAAAAGCC GATGACCGGT CAGTAACCGC CAAAAGCGCC ACCGACGATG CCGCGCTTCT GTTGATTATC TCTCCGCCGG AACTGGTCGC CGCAGCCGAC CGGCTGGCCA ACTGGAAACG GGCCCAGGGT ATTTTGACCG AGGTGCGGAC CACCGGCCAG ACCGGGACCA CCGCAGCAGA AATACAGGCG TTTATTCAGG ATGCCTACGA TACATGGTCC ATACCCCCCT CTTATGTGCT GCTGGTCGGG GATGTGGAGT TCATTCCCAC TCACCGGGGG GATGGGTGCG GCACCGATCT GTATTATGCC ACGGTGGACG GGGACGACTA TTTTCCGGAT TTGAGCCTGG GCCGGCTCTC CGTGGACACA CTGGATCAGG CCATCAAGCG GGTTGAGGAT ATTATCCGGT ATGAACTTTC TCCGCCTGCC GGGGAGGGGT TTTATCAGAA CGCGGCCATT GTGGCCTATT TCCAGGATAC CAGCCCTCCT TACAATTATG CGGACCGCCG GTTTCTCCAG TCCGCCGAGG ATATGGCCCT CTTTTTTTCT GACCCGGCCT ACCTGAACGC CTACGATGTG GATCGGATTT ACTATACCGG ATACCGATCT CCCCAGAACT GGAATAACGA TTCCTGGAAT TTCGGCACTA CGGGCGTGCT TTCCGGCGGC CCCGGCGATT CGATACCGTT CTATCTGTTG GAGAGCAACG GCTTTGCCTG GGACGGCGAT GCCGTTGACA TATCAACGGC CGTAAACGCG GGCCGTTTTC TGGTAACCTA CAGGGGTCAT GGCCAGACCA GCCGGTGGGA CAGTGTCACC TATACCACCT CCGATGTTTC CGGCCTGCTA AATCAGGACC TGCTGCCCGT GGTATTCAGC GTCACCTGCC TGTCCGGCAA GTTTGACATG GAAAGTGTGG GTAACAACAC CCCCTGTTTT TCCGAATCAT GGGAGCGCAA CCCGGACGGC GGGGCCGTCG GTGTTGTGGC CGCCTCTGAA ACCACCTACA GCGGCCACAA CGACCGCCTG TTCTGGGGGT GGATGGAGGC ACTGTGGCCC AACTTTCCGG AGGACTACCA TCCTTCAGAC ACGCCGTTTG ACCAGCCTGC ATGGGAAATG GGCCCGGTGT TCAACTACGG CAAATACTAC TATGCCACCT GGTATGAAGA AGAACGTTAC CGAAAACTGG AGTTTGAGCG CTTTCACTGG TTCGGCGACC CCACCATGCG GCTCTGGACA GGGGTTCCTC AGGATTTGAC GGTTTCCGAT TATGCCATTG ACGCCGGGTC TGGGGGCCTG GAGATCACCC TGGGCCAGGC CGGAGCCGTG ATCTGCGTTT CGCGGGACGG CGTTATCCTG GGCAAGGGCG TCTCTTCCGG CGGCACTGAT CTTGTGGCCT GTGCCCCGCC CCTGGCAGCC GATGATGCCA TCCTTGTGGT TGTTACCAAA CCGAATTTCC GGCCCTTTGT GGCTGTGACC GAAGAAGACG CGGATGCGCT GCCCACGCTG CTGGAAATCC TCACCGGCAC CAGCCCCCAT GACGGAGACA CCGATGACGA TGGCATTGCC GACGATGTGG AGGATGGCAA TTTAAACGGC CTGGTGGATG AAGATGAGAC CGACCCCCGG AACATCGACA CCGACGGCGA CGGGATCCAG GACGGCACGG AGAAGGGGAA AACCTTGGCT AATATTCCCG ATGATACAAA CAGGGATGTA TTTGTGCCTG ACCTTGACGA CACCACCACC ACCGATCCGC TCAATTCTGA CACCGACGGA GACTGCGCGT TCGACGGCCA GGAAGATGCC AACGCCAACG GCCGGTTGGA CGATGATGAA ACCGATCCGA ATGTTTACGA TAACCTGGCT CCGCCGGTGG CCAATGCCGG TGCCGCTCAG TCCGTTCGGG AGGGGACAAC CGTCCGGCTG GACGGGGCCG GCTCATACGA TGCCTGTCAG ACGCTGCTCT CTTTTTTCTG GGAGCAGGTG TCCGGGCCGT CAGTAACCCT TTCCGACTCC GCAACGTCCC GTCCCACCTT TACCGCGCCG ACGGTTGGTT CAGCCGGGAC TGCGCTGGTG TTCCGTCTGA CCGTGGATGA TGGTGATTTT TTCGATACCG ATACCTGCCA GGTAACGGTT ACCGACACAC CGCCGCCGTC TGAGGATGAT GACGACAGCG ATGACGATGA TGATAATGAT GATAGCGATC CGCCCCCCGC GGAGGGCGGT GGGGGTGGTG GCTGTTTTGT CGAAACCGTG CTGTCGTTGA AGTAA
|
Protein sequence | MKKTGFRQLV FGLVLSFWLL GPAHAGWIAA GSHKAGPPAL ETVRSDSSGM VIDLDIPGLH ITETLRDGMV YHGISVPGGG RLSGIGKPSL PFVSRYVAVP QGATASVRVM DARFEEMTGY NVVPAQAPLP ESNTAKGPPF EKDRVAYGEN GFFPRQVAQL EGPVSIRGCE TSLLQLFPVQ FNPVSQTLRV YSRITVQLTF DGGTRRFIDR RKHSRSLAPV FEGLFLNAPL EMDSPPLTKA DDRSVTAKSA TDDAALLLII SPPELVAAAD RLANWKRAQG ILTEVRTTGQ TGTTAAEIQA FIQDAYDTWS IPPSYVLLVG DVEFIPTHRG DGCGTDLYYA TVDGDDYFPD LSLGRLSVDT LDQAIKRVED IIRYELSPPA GEGFYQNAAI VAYFQDTSPP YNYADRRFLQ SAEDMALFFS DPAYLNAYDV DRIYYTGYRS PQNWNNDSWN FGTTGVLSGG PGDSIPFYLL ESNGFAWDGD AVDISTAVNA GRFLVTYRGH GQTSRWDSVT YTTSDVSGLL NQDLLPVVFS VTCLSGKFDM ESVGNNTPCF SESWERNPDG GAVGVVAASE TTYSGHNDRL FWGWMEALWP NFPEDYHPSD TPFDQPAWEM GPVFNYGKYY YATWYEEERY RKLEFERFHW FGDPTMRLWT GVPQDLTVSD YAIDAGSGGL EITLGQAGAV ICVSRDGVIL GKGVSSGGTD LVACAPPLAA DDAILVVVTK PNFRPFVAVT EEDADALPTL LEILTGTSPH DGDTDDDGIA DDVEDGNLNG LVDEDETDPR NIDTDGDGIQ DGTEKGKTLA NIPDDTNRDV FVPDLDDTTT TDPLNSDTDG DCAFDGQEDA NANGRLDDDE TDPNVYDNLA PPVANAGAAQ SVREGTTVRL DGAGSYDACQ TLLSFFWEQV SGPSVTLSDS ATSRPTFTAP TVGSAGTALV FRLTVDDGDF FDTDTCQVTV TDTPPPSEDD DDSDDDDDND DSDPPPAEGG GGGGCFVETV LSLK
|
| |