Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2077 |
Symbol | |
ID | 4663841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 2414482 |
End bp | 2417430 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639820320 |
Product | hypothetical protein |
Protein accession | YP_967520 |
Protein GI | 120603120 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG5459] Predicted rRNA methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.183603 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAC TCTTCAAACC GCTCGATGCC GAGCTTCGCA AGGTGATGGA CGCCCTGCCC GAAGCCATCG ACGCAGTCAT GCCGCTGCGC AGCGCGCATC GCCGCGACCT GCCCAACGCC ATACAGGAAC TTTCGGCTTC GCTCACCTCC GAGCGCGGCA ATCTCTCGGA ATCGTACTGG GCCTCGCCCC GCCTCTTCTC GGCCTATGTT CGGCACTTCC TGCCTTGGAA CGTGTTCCGG CTGGGCCGCC TGCTACCCTC GCTCGACCTG CCCACCCCCG AACAATGCGC CGGGGGGCAT GTGCTGGACG TGGGCAGCGG CCCCTTCACC CTGCCCATCG CCCTGTGGAT TGCGCGCCCC GAATTGCGCA CTGTCGCGTT GACGCTCACC TGCGGCGACA TCTCGCCCCA CGTGCTTGAT GTCGGACGCG CCCTGTTCCG CAAGATTGCG GGCGAAGAAT GCCCTTGGAC GCTGCACACG CTGCGGGGCA CCGTCGAGAA CGCCCTGCGC GAGAACTATG GCAAGAACAT CCTCGTCATG GGCGGCAACG TCCTTAACGA ACTGCAACCC CGCGCCGACC AGCTGCTCGA AGACAGGCTC TTCGAAGTGG CCGAGGCACT CCGCGACAGT CTCGCCCCCG GCGGCAAGGC TCTGTTCGTC GAACCGGGGT CCCGCCTTGG CGGCAAGTTG GTGACCATGC TCCGTCAGGC AGCCATCGAG AACGGACTCG CGCCCGAAGC GCCCTGCCCG CATCATGATG AATGCCCCAT GCTCGCCGCC AAAAGGTCGA GCTGGTGTCA CTTCACCTTC AGCGTGCACG ACGCGCCGCA GTGGCTCACT GACATTTCGC GTGAAGCGGG CTTCGAACGC GACACCGCGA GCCTCAGCTT CGTGTTCCTG AGTCATGCCA CCACCCCGCC CGCAGGAGAC ATGGTGCGTG TCATCTCTGC CGTCTTCCCC ATTCCGGGAC GCCGCGAGCC CGTACGCTAC GGTTGCAGCG CAGAAGGCCT GACCCTGCTG TACGATGCGC GCCCGCTCAT CTCCGGCGCG CTGGTCAAGG CCGCATGGCC TGAAGAACCG CGCACCGATG GCAAGTCCGG CGGACTTTTC GCGGGTTGGG TCGATGACGA AGGTGTCGTT CAGGGACTTG TCCCGAACGC GGCGAACGCC GAGGACTCTG ATGACGAGGG CTTCGCAGAC GGCGAGCGCG TCGGCTTTTC GAGAACTCCC AAGAACTTCG CCCGCAACGA CGAAGGACGC TCCGACCGGC GCGGAGGCGG CGAACGCAGG AACTTCGGCG ACAGGCCGCA GGGCGGCTTC CGCCGTGAAG GCCGTGACGA CCGTGACCGT GGCGGTGAGG GTGGGTTCAG GGACCGCAAG CCCGCCTTCA GACGCGAGGG TGGCGAAGGG TTCGACCGTA AGCCCGCCTT CCGCAGGGAC GGTGACGACT TCGGACGCAA GCCCCGCGAA TTCGGCGACA GGCCGCAGGG CGGTTTCCGC CGTGAGGGCC GTGACGACCG TGACCGCGGC GGCGAGGGTG GGTTCAGGGA CCGCAAGCCC GCCTTCAGGC GCGAGGGCGG CGAAGGGTTC GACCGCAAGC CTGCCTTCCG CAGGGACGGT GACGACTTCG GACGCAAGCC CCGGGAATTC GGCGACAGGC CGCAGGGCGG TTTCCGCCGT GAGGGCCGTG ACGACCGTGA CCGCGGCGGC GAAGGTGGGT TCAGGGACCG CAAGCCCGCC TTCAGGCGCG AGGGCGGCGA AGGGTTCGAC CGTAAGCCCG CCTTCCGCAG GGACGGTGAC GACTTCGGGC GCAAGCCCCG CGAATTCGGC GACAGGCCGC AGGGCGGCTT CCGCCGTGAG GGCCGCGACG ACCGCGGCGA CCGCGGCGGC GAGGGTGGGT TCAGGGACCG CAAGCCCGCC TTCAGGCGCG AGAGTGGCGA AGGGTTCGAC CGTAAGCCCG CCTTCCGCAG GGACGGTGAC GACTTCGGGC GCAAGCCCCG GGAATTCGGC GACAGGCCGC AGGGCGGCTT CCGCCGTGAG GGCCGTGACG ACCGTGACCG CGGCGGCGAG GGTGGGTTCA GGGACCGCAA GCCCGCCTTC AGGCGCGAGG GTGGCGAAGG GTTCGACCGT AAGCCCGCCT TCCGCAGGGA CGGTGACGAC TTCGGGCGCA AGCCCCGGGA ATTCGGCGAC AGGCCGCAAG GCGGCTTCCG CCGTGAGGGC CGTGACGACC GTGACCGCGG CGGCGAAGGT GGGTTCAGGG ACCGCAAGCC CGCCTTCAGG CGCGAGGGCG GCGAAGGTTT CGACCGTAAG CCCGCTTTCC GCAGGGACGG TGACGACTTC GGACGCAAGC CCCGCGAATT CGGCGACAGG CCGCAGGGCG GCTTCCGCCG TGAAGGCCGT GACGACCGTG ACCGCGGCGG CGAGGGTGGG TTCAGGGACC GCAAGCCCGC CTTCAGGCGC GAGGGTGGCG AAGGGTTCGA TCGCAAGCCA GCCTTCCGCA GGGACGGTGA CGACTTCGGA CGCAAGCCCC GTGAATTCGG CGACAGGCCG CAGGGCGGCT TCCGCCGTGA AGGCCGTGAC CGCGGCGGCG AAGGTGGGTT CAGGGACCGC AAGCCCGCCT GCAGGCGCGA GGGTGATGCC GAAGGTGTGC GCAAGCCCCC GTTCAGGCAT GACCCGCTTG GTTCCGGCTT CGGGCGCAAG CCCCGCGACG AGGGGGGCAA CGGTCCCCAG CGCGACCGGG ACGGTGGCAA GGGTGGCTTC GGTGACCGCA AGCCCCCGTT CCGTAAAGAG GGCTTCGGCG GCGACAGGCA CGGCGGGCCT CGCTTCTCGC AGCATCAGCC CAGACCCCAC CGTAAGGGCG AAGGCCCTGC CGGTGCTCCT CACCGACGCA TGAAACGCGA CGATGAGGGC GGCAACGGTG GTAACGGCGA AGGCCATGCT GACGATTGA
|
Protein sequence | MNRLFKPLDA ELRKVMDALP EAIDAVMPLR SAHRRDLPNA IQELSASLTS ERGNLSESYW ASPRLFSAYV RHFLPWNVFR LGRLLPSLDL PTPEQCAGGH VLDVGSGPFT LPIALWIARP ELRTVALTLT CGDISPHVLD VGRALFRKIA GEECPWTLHT LRGTVENALR ENYGKNILVM GGNVLNELQP RADQLLEDRL FEVAEALRDS LAPGGKALFV EPGSRLGGKL VTMLRQAAIE NGLAPEAPCP HHDECPMLAA KRSSWCHFTF SVHDAPQWLT DISREAGFER DTASLSFVFL SHATTPPAGD MVRVISAVFP IPGRREPVRY GCSAEGLTLL YDARPLISGA LVKAAWPEEP RTDGKSGGLF AGWVDDEGVV QGLVPNAANA EDSDDEGFAD GERVGFSRTP KNFARNDEGR SDRRGGGERR NFGDRPQGGF RREGRDDRDR GGEGGFRDRK PAFRREGGEG FDRKPAFRRD GDDFGRKPRE FGDRPQGGFR REGRDDRDRG GEGGFRDRKP AFRREGGEGF DRKPAFRRDG DDFGRKPREF GDRPQGGFRR EGRDDRDRGG EGGFRDRKPA FRREGGEGFD RKPAFRRDGD DFGRKPREFG DRPQGGFRRE GRDDRGDRGG EGGFRDRKPA FRRESGEGFD RKPAFRRDGD DFGRKPREFG DRPQGGFRRE GRDDRDRGGE GGFRDRKPAF RREGGEGFDR KPAFRRDGDD FGRKPREFGD RPQGGFRREG RDDRDRGGEG GFRDRKPAFR REGGEGFDRK PAFRRDGDDF GRKPREFGDR PQGGFRREGR DDRDRGGEGG FRDRKPAFRR EGGEGFDRKP AFRRDGDDFG RKPREFGDRP QGGFRREGRD RGGEGGFRDR KPACRREGDA EGVRKPPFRH DPLGSGFGRK PRDEGGNGPQ RDRDGGKGGF GDRKPPFRKE GFGGDRHGGP RFSQHQPRPH RKGEGPAGAP HRRMKRDDEG GNGGNGEGHA DD
|
| |