Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_3084 |
Symbol | |
ID | 7175030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 3895393 |
End bp | 3898089 |
Gene Length | 2697 bp |
Protein Length | 898 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643541621 |
Product | ATP-dependent protease La |
Protein accession | YP_002437489 |
Protein GI | 218888168 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0466] ATP-dependent Lon protease, bacterial type |
TIGRFAM ID | [TIGR00763] ATP-dependent protease La |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 0.981667 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACG ACATATACGA CGACGGCCCC CCCGACGAAT CCGGCCAAAC CACCGGCAAG CCGCAAAAGA AGCAGGACTC GGCGTCCGCA GGCCCTGCCG CCGATGCGGC CTCCGGTCCG GCCCCCGACG CCGCGCCCGA CGCCGCAAAT GGCGTCACCG TCATCGCCGT CCAGACCGGC GACCAGGCTG ACCCAGAACA GGTTGCACCG GATCAGCCCG CTCCGGGCAA GGCTGCCGGC ATCGAGGTTG CCGTGTCCAC GCAGCCGCGC GCCGGTGACG ATGCCCGCGC CCTGGATTTC GACGGCCTGA GCGCCCCCGA CGCGGACAAT CCTGGCGGCT CCGAATCTTC GGAGACGCCC GAGGGTCCGG AACTGCCCGT GCTGCCAGAC GAACTGCCCG TGCTGCCCGT GCGCGACGTG GTGGTGTTCA ACTACATGAT CCTGCCCCTG TTCGTGGGCC GCGAAAAGTC GGTGCAGGCC GTGGACGCCG CCCTCAACGG CAGCCGCTAC CTGATGATCT GCACCCAGCG CGACGAATCG GTGGACGACC CCGCGCCGGA AGACCTGCAC CCCACCGGCA CGGTTGTCAT GATCATGCGC ATGCTGAAGA TGCCCGACAA CCGGCTGAAG GTGCTGGTGC AGGGCATCAG CCGCGCCCGC GTGGAATCCT TCGGCGCGGG CGAGGGCTAC CTTACCGCCC GCGTGGAAAC CCTGCCCGAG CCGGAACTGG GCCCGCCCAC GGTGGAGCAG GAAGCCATGA TGCGCGCCGC CCGCGAACAG AGCGAAAAGA TCCTTTCGCT GCGCGGCATC GCCACCTCGG ACATCATGGC CGTGCTCAAT TCCGTGGACG ATCCGGGCCG CCTTGCCGAC CTCATCGCCG CCAACCTGCG CATGAAGGTT TCCGACGCGC AGGCCATCCT GGAATGCACC GACCCCGACG CCCGGCTGCG CCTCGTCAAC GAGCAGCTGG TCAAGGAAGT GGAAGTGGCG TCCATGCAGG CCAAGATACA GAGCATGGCG CGCGAGGGCA TGGACAAGGC CCAGAAGGAC TACTTCCTGC GCGAGCAGAT GAAGGCCATC CGCCGCGAAC TGGGCGAATC GGGCAACGAG GACGAGGAAC TGGAAGACCT CACCCGCTCT CTGGAACGTT CCGGATTGCC GCGTGAAGTC CGCAAGGAGG CCGACAAGCA GTTGCGCCGC CTGGCCTCCA TGCACCCCGA TTCGTCCGAG GCCACCGTGG TGCGCACCTA CCTGGAATGG CTGGCGGAGC TGCCGTGGGC CAAGCTTTCG CGCGACCGGC TGGACATCAA CAAGGCCAAG GTGATCCTGG ACGAGGACCA TCTCGGCCTC GCCAAGGTCA AGGACCGCAT TCTGGAATAC CTCAGCGTGC GCAAGCTGAA CCCCAAGTCC AAGGGGCCCA TCCTGTGCTT TGCCGGGCCT CCCGGCGTGG GCAAGACCTC GCTGGGCCGC TCCATCGCCC GCGCCATGGG TCGCAAGTTC CAGCGCATCT CGCTGGGCGG CATGCGCGAC GAGGCGGAAA TCCGCGGCCA CCGGCGCACC TACATCGGCG CCATGCCGGG GCGCATCGTG CAGAGCCTGA AGCAGCTGGG CACGCGCAAC CCCGTGCTGA TGCTGGATGA AATCGACAAG ATCGGCTCCG ACTTCCGGGG CGACCCGTCA TCCGCGCTGC TGGAGGTGCT GGACCCGGAA CAGAACTTCT CGTTCAGCGA CCACTACCTG AACGTGCCCT TCGACCTGTC CAAGGTCATG TTCATCTGCA CCGCCAACCA GCTGGACACC ATTCCCCCGC CCCTGCGCGA CCGCATGGAG GTCATCTCCA TTCCCGGCTA CACCATGCAG GAAAAGCTGG CCATCGCCCG CCGCTACCTG CTGCCGCGCC AGGCCAGGGA AAACGGCCTG TCCCCGCGCG AGGTGACCGT GCCCGACGCG CTCATCGAGC GCATCATCAC CGGCTACACC CGCGAGGCAG GGCTGCGGAA CCTGGAGCGC GAAATCGGCT CGCTGTGCCG CAAGGTGGCC CGCCGCAAGG CAGAGGGCGA AAAAGGCCCC TTCCGGGTCA CCCCGCGCAT GCTGGAAAAG CTGCTGGGCG CGCCCCGCTT CATCGACGAG GAAAAGGAAG CCGAACTGCT GCCCGGCGTG GCCCTGGGCC TGGCCTGGAC CCCCTACGGC GGCGAGGTGC TGCACGTGGA AGTCACCCCC ATGAAGGGCA AGGGCGGCGT GACCATGACC GGCCAGCTCG GCGACGTGAT GAAGGAAAGC GCACAGGCCG CCATCTCTTA CGCGCGCAGC CGCGCCGAAC AGCTGGGCAT CGAGCCCGAC TTCTCGGAAA AGCTCGACCT GCACATCCAC GTGCCCGCGG GCGCCACCCC CAAGGATGGC CCCTCCGCCG GGGTGACCAT GGTCACCGCG CTGCTCTCCG CCATCACCGG CAAGTCCGTG CGCAGCGACC TGTGCATGAC CGGCGAGATC ACCCTGCGAG GCCGCGTGCT GCCCGTGGGC GGCATCAAGG AAAAGATCCT GGCCGGGGTG GCGCGCGGCA TGCAGCACGT CATTATCCCG CGCCAGAACG TCAAGGACCT GGAAGACATT CCCGCAGACC TGCTGCGCCG CATCCAGGTT CACCCCGTGG CCCACATCGA CGATCTGCTG CCTCTGGCGT TTCCGAAAGG CGAATAA
|
Protein sequence | MTDDIYDDGP PDESGQTTGK PQKKQDSASA GPAADAASGP APDAAPDAAN GVTVIAVQTG DQADPEQVAP DQPAPGKAAG IEVAVSTQPR AGDDARALDF DGLSAPDADN PGGSESSETP EGPELPVLPD ELPVLPVRDV VVFNYMILPL FVGREKSVQA VDAALNGSRY LMICTQRDES VDDPAPEDLH PTGTVVMIMR MLKMPDNRLK VLVQGISRAR VESFGAGEGY LTARVETLPE PELGPPTVEQ EAMMRAAREQ SEKILSLRGI ATSDIMAVLN SVDDPGRLAD LIAANLRMKV SDAQAILECT DPDARLRLVN EQLVKEVEVA SMQAKIQSMA REGMDKAQKD YFLREQMKAI RRELGESGNE DEELEDLTRS LERSGLPREV RKEADKQLRR LASMHPDSSE ATVVRTYLEW LAELPWAKLS RDRLDINKAK VILDEDHLGL AKVKDRILEY LSVRKLNPKS KGPILCFAGP PGVGKTSLGR SIARAMGRKF QRISLGGMRD EAEIRGHRRT YIGAMPGRIV QSLKQLGTRN PVLMLDEIDK IGSDFRGDPS SALLEVLDPE QNFSFSDHYL NVPFDLSKVM FICTANQLDT IPPPLRDRME VISIPGYTMQ EKLAIARRYL LPRQARENGL SPREVTVPDA LIERIITGYT REAGLRNLER EIGSLCRKVA RRKAEGEKGP FRVTPRMLEK LLGAPRFIDE EKEAELLPGV ALGLAWTPYG GEVLHVEVTP MKGKGGVTMT GQLGDVMKES AQAAISYARS RAEQLGIEPD FSEKLDLHIH VPAGATPKDG PSAGVTMVTA LLSAITGKSV RSDLCMTGEI TLRGRVLPVG GIKEKILAGV ARGMQHVIIP RQNVKDLEDI PADLLRRIQV HPVAHIDDLL PLAFPKGE
|
| |