Gene Dvul_0487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0487 
Symbol 
ID4664902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp619203 
End bp620519 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content49% 
IMG OID639818697 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_965937 
Protein GI120601537 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.700794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0187002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTTC CAGCGTATCC CGAATACAAG GACAGCGGTG TGGAGTGGCT GGGGAAAATC 
CCAAGCCATT GGAGTGTTAC GTCTCTGTAT AGCTTAGCCT CTGAGTGTGA TTTTCCCAAC
AAAGACATGC TTGAAAGTAA TCTACTATCA TTGAGTTACG GTCGCATCAT CAGAAAAGAC
ATAAACTCAA ATGATGGCCT TCTCCCAGAA TCTTTCGAGA CATATCAAAT TGTAGATCAT
GGCGACATTG TGCTCAGACT GACAGATCTT CAAAATGACC AACGAAGCTT GAGATCAGGC
CTAGTTAAAG AAAGAGGAAT AATAACTTCA GCATACACAG CCATACGTCC AACAGCCTCA
CACTATTCAT ATCTCGCATA TTTGCTGCGA GCATACGACA CGCTAAAAAT CTTTTACTCA
ATGGGTGGCG GCCTCAGACA ATCAATGAAA TTCTCCGACT TACGTCGTTT ACCAATACTT
AAGCCAGCAT ACAGTGAACA ATCCGCCATC GCCGTCTTCC TCGATCATGA GACTGCCAAG
ATTGATGCCC TGATTACCGA GCAAGAGAAG CTGATTGAAC TCCTGAAGGA GAAACGTCAG
GCAGTCATCT CCCATGCTGT TACCAAGGGG CTTGCCCCCA ATGTGCCCAT GAAGGACAGC
GGCGTGGAGT GGCTAGGAGA AGTGCCGGAG CATTGGAAAG TGGCAAAGCT TCGGCGTTTT
GTCCGGGCCG TGCAAACTGG CAGTACGCCA TCGGCTTCTC CTCCAAACAC CGATATCGAG
GATGGAACCT ACTGGTTCAC TCCCGGCGAT TTTTCAGGTC CGATCCGTCT TGGAAGCTCA
TCCAAAAAAG TACCTCCTGA AGCGATTAAG CAAGGAGAGG TAAAAGTTTT CCCTGCGGGT
GCTGTCTTCG TTGTTAGCAT CGGTGCTACG CTTGGCAAGA TCGGCTATCT CCTGACTCTG
GCCTCCGCGA ACCAGCAAAT CAACGCAATA ATACCAAATG CGGATGTCGA AGGGCTGTTT
TTGGCATACT CACTATCGTC TAAAACTTCT GAAATGATGA ACCTGTCGAA CGCGTCAACT
ATTGGGATCA TGAATCAAGA GAAGACAAAG GAAATATGGC TCACAGTTCC TCCTCTCTGC
GAGCAGGAGA GAATCACTAA ATTCCTTGAT GAAGACTGCG TAACTTCCGA TGCCCTCGTC
AACGAGTCAC AACGCGCCAT CGACCTCCTC AAAGAACGCC GTTCAGCACT CATTTCCGCC
GCCGTTACAG GCAAGATAGA CGTGCGGGGC TTCGCCCCCG TTTCGGAGGC TGTATGA
 
Protein sequence
MTFPAYPEYK DSGVEWLGKI PSHWSVTSLY SLASECDFPN KDMLESNLLS LSYGRIIRKD 
INSNDGLLPE SFETYQIVDH GDIVLRLTDL QNDQRSLRSG LVKERGIITS AYTAIRPTAS
HYSYLAYLLR AYDTLKIFYS MGGGLRQSMK FSDLRRLPIL KPAYSEQSAI AVFLDHETAK
IDALITEQEK LIELLKEKRQ AVISHAVTKG LAPNVPMKDS GVEWLGEVPE HWKVAKLRRF
VRAVQTGSTP SASPPNTDIE DGTYWFTPGD FSGPIRLGSS SKKVPPEAIK QGEVKVFPAG
AVFVVSIGAT LGKIGYLLTL ASANQQINAI IPNADVEGLF LAYSLSSKTS EMMNLSNAST
IGIMNQEKTK EIWLTVPPLC EQERITKFLD EDCVTSDALV NESQRAIDLL KERRSALISA
AVTGKIDVRG FAPVSEAV