Gene Dvul_2077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2077 
Symbol 
ID4663841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2414482 
End bp2417430 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content69% 
IMG OID639820320 
Producthypothetical protein 
Protein accessionYP_967520 
Protein GI120603120 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG5459] Predicted rRNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.183603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAC TCTTCAAACC GCTCGATGCC GAGCTTCGCA AGGTGATGGA CGCCCTGCCC 
GAAGCCATCG ACGCAGTCAT GCCGCTGCGC AGCGCGCATC GCCGCGACCT GCCCAACGCC
ATACAGGAAC TTTCGGCTTC GCTCACCTCC GAGCGCGGCA ATCTCTCGGA ATCGTACTGG
GCCTCGCCCC GCCTCTTCTC GGCCTATGTT CGGCACTTCC TGCCTTGGAA CGTGTTCCGG
CTGGGCCGCC TGCTACCCTC GCTCGACCTG CCCACCCCCG AACAATGCGC CGGGGGGCAT
GTGCTGGACG TGGGCAGCGG CCCCTTCACC CTGCCCATCG CCCTGTGGAT TGCGCGCCCC
GAATTGCGCA CTGTCGCGTT GACGCTCACC TGCGGCGACA TCTCGCCCCA CGTGCTTGAT
GTCGGACGCG CCCTGTTCCG CAAGATTGCG GGCGAAGAAT GCCCTTGGAC GCTGCACACG
CTGCGGGGCA CCGTCGAGAA CGCCCTGCGC GAGAACTATG GCAAGAACAT CCTCGTCATG
GGCGGCAACG TCCTTAACGA ACTGCAACCC CGCGCCGACC AGCTGCTCGA AGACAGGCTC
TTCGAAGTGG CCGAGGCACT CCGCGACAGT CTCGCCCCCG GCGGCAAGGC TCTGTTCGTC
GAACCGGGGT CCCGCCTTGG CGGCAAGTTG GTGACCATGC TCCGTCAGGC AGCCATCGAG
AACGGACTCG CGCCCGAAGC GCCCTGCCCG CATCATGATG AATGCCCCAT GCTCGCCGCC
AAAAGGTCGA GCTGGTGTCA CTTCACCTTC AGCGTGCACG ACGCGCCGCA GTGGCTCACT
GACATTTCGC GTGAAGCGGG CTTCGAACGC GACACCGCGA GCCTCAGCTT CGTGTTCCTG
AGTCATGCCA CCACCCCGCC CGCAGGAGAC ATGGTGCGTG TCATCTCTGC CGTCTTCCCC
ATTCCGGGAC GCCGCGAGCC CGTACGCTAC GGTTGCAGCG CAGAAGGCCT GACCCTGCTG
TACGATGCGC GCCCGCTCAT CTCCGGCGCG CTGGTCAAGG CCGCATGGCC TGAAGAACCG
CGCACCGATG GCAAGTCCGG CGGACTTTTC GCGGGTTGGG TCGATGACGA AGGTGTCGTT
CAGGGACTTG TCCCGAACGC GGCGAACGCC GAGGACTCTG ATGACGAGGG CTTCGCAGAC
GGCGAGCGCG TCGGCTTTTC GAGAACTCCC AAGAACTTCG CCCGCAACGA CGAAGGACGC
TCCGACCGGC GCGGAGGCGG CGAACGCAGG AACTTCGGCG ACAGGCCGCA GGGCGGCTTC
CGCCGTGAAG GCCGTGACGA CCGTGACCGT GGCGGTGAGG GTGGGTTCAG GGACCGCAAG
CCCGCCTTCA GACGCGAGGG TGGCGAAGGG TTCGACCGTA AGCCCGCCTT CCGCAGGGAC
GGTGACGACT TCGGACGCAA GCCCCGCGAA TTCGGCGACA GGCCGCAGGG CGGTTTCCGC
CGTGAGGGCC GTGACGACCG TGACCGCGGC GGCGAGGGTG GGTTCAGGGA CCGCAAGCCC
GCCTTCAGGC GCGAGGGCGG CGAAGGGTTC GACCGCAAGC CTGCCTTCCG CAGGGACGGT
GACGACTTCG GACGCAAGCC CCGGGAATTC GGCGACAGGC CGCAGGGCGG TTTCCGCCGT
GAGGGCCGTG ACGACCGTGA CCGCGGCGGC GAAGGTGGGT TCAGGGACCG CAAGCCCGCC
TTCAGGCGCG AGGGCGGCGA AGGGTTCGAC CGTAAGCCCG CCTTCCGCAG GGACGGTGAC
GACTTCGGGC GCAAGCCCCG CGAATTCGGC GACAGGCCGC AGGGCGGCTT CCGCCGTGAG
GGCCGCGACG ACCGCGGCGA CCGCGGCGGC GAGGGTGGGT TCAGGGACCG CAAGCCCGCC
TTCAGGCGCG AGAGTGGCGA AGGGTTCGAC CGTAAGCCCG CCTTCCGCAG GGACGGTGAC
GACTTCGGGC GCAAGCCCCG GGAATTCGGC GACAGGCCGC AGGGCGGCTT CCGCCGTGAG
GGCCGTGACG ACCGTGACCG CGGCGGCGAG GGTGGGTTCA GGGACCGCAA GCCCGCCTTC
AGGCGCGAGG GTGGCGAAGG GTTCGACCGT AAGCCCGCCT TCCGCAGGGA CGGTGACGAC
TTCGGGCGCA AGCCCCGGGA ATTCGGCGAC AGGCCGCAAG GCGGCTTCCG CCGTGAGGGC
CGTGACGACC GTGACCGCGG CGGCGAAGGT GGGTTCAGGG ACCGCAAGCC CGCCTTCAGG
CGCGAGGGCG GCGAAGGTTT CGACCGTAAG CCCGCTTTCC GCAGGGACGG TGACGACTTC
GGACGCAAGC CCCGCGAATT CGGCGACAGG CCGCAGGGCG GCTTCCGCCG TGAAGGCCGT
GACGACCGTG ACCGCGGCGG CGAGGGTGGG TTCAGGGACC GCAAGCCCGC CTTCAGGCGC
GAGGGTGGCG AAGGGTTCGA TCGCAAGCCA GCCTTCCGCA GGGACGGTGA CGACTTCGGA
CGCAAGCCCC GTGAATTCGG CGACAGGCCG CAGGGCGGCT TCCGCCGTGA AGGCCGTGAC
CGCGGCGGCG AAGGTGGGTT CAGGGACCGC AAGCCCGCCT GCAGGCGCGA GGGTGATGCC
GAAGGTGTGC GCAAGCCCCC GTTCAGGCAT GACCCGCTTG GTTCCGGCTT CGGGCGCAAG
CCCCGCGACG AGGGGGGCAA CGGTCCCCAG CGCGACCGGG ACGGTGGCAA GGGTGGCTTC
GGTGACCGCA AGCCCCCGTT CCGTAAAGAG GGCTTCGGCG GCGACAGGCA CGGCGGGCCT
CGCTTCTCGC AGCATCAGCC CAGACCCCAC CGTAAGGGCG AAGGCCCTGC CGGTGCTCCT
CACCGACGCA TGAAACGCGA CGATGAGGGC GGCAACGGTG GTAACGGCGA AGGCCATGCT
GACGATTGA
 
Protein sequence
MNRLFKPLDA ELRKVMDALP EAIDAVMPLR SAHRRDLPNA IQELSASLTS ERGNLSESYW 
ASPRLFSAYV RHFLPWNVFR LGRLLPSLDL PTPEQCAGGH VLDVGSGPFT LPIALWIARP
ELRTVALTLT CGDISPHVLD VGRALFRKIA GEECPWTLHT LRGTVENALR ENYGKNILVM
GGNVLNELQP RADQLLEDRL FEVAEALRDS LAPGGKALFV EPGSRLGGKL VTMLRQAAIE
NGLAPEAPCP HHDECPMLAA KRSSWCHFTF SVHDAPQWLT DISREAGFER DTASLSFVFL
SHATTPPAGD MVRVISAVFP IPGRREPVRY GCSAEGLTLL YDARPLISGA LVKAAWPEEP
RTDGKSGGLF AGWVDDEGVV QGLVPNAANA EDSDDEGFAD GERVGFSRTP KNFARNDEGR
SDRRGGGERR NFGDRPQGGF RREGRDDRDR GGEGGFRDRK PAFRREGGEG FDRKPAFRRD
GDDFGRKPRE FGDRPQGGFR REGRDDRDRG GEGGFRDRKP AFRREGGEGF DRKPAFRRDG
DDFGRKPREF GDRPQGGFRR EGRDDRDRGG EGGFRDRKPA FRREGGEGFD RKPAFRRDGD
DFGRKPREFG DRPQGGFRRE GRDDRGDRGG EGGFRDRKPA FRRESGEGFD RKPAFRRDGD
DFGRKPREFG DRPQGGFRRE GRDDRDRGGE GGFRDRKPAF RREGGEGFDR KPAFRRDGDD
FGRKPREFGD RPQGGFRREG RDDRDRGGEG GFRDRKPAFR REGGEGFDRK PAFRRDGDDF
GRKPREFGDR PQGGFRREGR DDRDRGGEGG FRDRKPAFRR EGGEGFDRKP AFRRDGDDFG
RKPREFGDRP QGGFRREGRD RGGEGGFRDR KPACRREGDA EGVRKPPFRH DPLGSGFGRK
PRDEGGNGPQ RDRDGGKGGF GDRKPPFRKE GFGGDRHGGP RFSQHQPRPH RKGEGPAGAP
HRRMKRDDEG GNGGNGEGHA DD