Gene Dvul_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1002 
Symbol 
ID4663064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1232805 
End bp1234457 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content68% 
IMG OID639819226 
Producthydantoinase/oxoprolinase 
Protein accessionYP_966450 
Protein GI120602050 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.184636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCTCG GCATAGACGT GGGCGGCACC CACACCGACG CCGTCGTCAT GGACGGGCGC 
AGGGTGGTGT GCAGTTGCAA GGTGCCCACC GACCACCACG ACCTGCTGTC GTCGGTGCGG
CAGGCCATGC GAACCTTGCT GGAGACGGTG GAACCTGCCG CCGTGACACG CATCAACCTC
AGCACCACCC TCTCCACCAA CGCCATCGTC GAGGGACGCA CCGAGGAGGT GGGCGTGGTG
GTGGCTGGCG GCCCGGGCAT CGACCCGGAG CATGTGCGCG TGGGGCGCTT CTACCAGTCC
GTACCCGGTT CCATCGACCA TCGGGGTGTC GAGATTGCCG GCATCGACGC CGATTCGCTG
GACGCAGTGG CTGGTCGCTA CTGTCGCGAA GGCGTGCGCG TCTACGCAGC CGTGGGCAAG
TTCTCCACCC GCAACCCCGC GCACGAACAG GCCATCGGGG GCAAGCTCAT GGAGTGCGGC
GACTTCGTCT CCATGGGGCA CCGGCTTTCG GGGCAGCTCA ACTTTCCGCG CCGCGTGGCC
ACGGCGTACT ACAACTCCGC CGTGTGGCGG GTGTACAACG CCTTCGCGGA CGCCATCGAG
AACGCCGCGC GTGAGATGGG GCTTGCCGCG CCCATCCATG TGCTCAAGGC CGATGGCGGC
ACCATGCCTC TCGCCGTCTC GCGCGAGGTG CCTGTCGAAT CCATCCTCTC CGGCCCGGCC
GCCAGTGTCA TGGGCATCGT GGCCCTGTGC GACATCCGTG AGGACTGCGT CATCCTCGAC
GTGGGCGGCA CGACCACAGA CATCGCGGTG TTCGCCTCCG GCGCCCCGGT CATCGAACGC
GACGGCATCG ACGTGGGCAG CTACGCCACG CTGGTGCGTG CCTTGCGCAC CCATTCCATC
GGTGTGGGGG GCGACTCGCG CCTTCATGTG CAGGCCGGGG CGGTGCGTGT GGGACCCGAA
CGCATGGGTC CCAGCATGGC ACTGGGCGGC ACGCAGCCGA CCCTCATCGA CGCGCTCAAT
TACATGGGCA AGGCCCACGT GGGCGAGGTG GAGGCCTCCC GGCGCGGCAT CCGTGACCTG
GCGGCACTGT GGGACATGTT CCCCGAACGT CTTGCGTCCG AAGCCGTGAC CACCGCCGTA
CGCCGCATCG CGGAAGCGGT GACGGAACTC GTCGACGCCA TCAACGCCCG CCCGGTGTAC
ACCATCATGG AACTCTTGCA GGGGCGTCGG GTCGACCCCG TGCGCGCCTA TGTCATGGGC
GGCCCCGCCG AGGTGCTGCG CCCCCTGCTG GCTGATGCAC TGGGCCGCCG GGTGGAGGTG
CCAGAACAAT ACGCCGTGGC GAACGCCATT GGTGCCGCGC TGACCCGCAC GACGGCTGAA
CTCGAACTTT TCGCCGATAC CGAGAAGGGT GTGCTCTTCA TCCCCACCCT CGACGTACGC
CGCGAAGTGG GGGCCCGCTA CGACATCGAA GCCGCCAAAC GTGATGCCGG GCAGGCGCTG
CTTGAGCATC TCGCCTCGCT TGGGGTGGAT GACGACGAGG CGGCTGTGGA GGTGGTGGAA
GCCACGTCGT TCAACATGGT GGACGACTAT GGCACGGCGG GGCGCAACAT CCGGGTCAAA
TGCCAGGTGC GGCCCGGCAT TCTGGGGGCG TGA
 
Protein sequence
MFLGIDVGGT HTDAVVMDGR RVVCSCKVPT DHHDLLSSVR QAMRTLLETV EPAAVTRINL 
STTLSTNAIV EGRTEEVGVV VAGGPGIDPE HVRVGRFYQS VPGSIDHRGV EIAGIDADSL
DAVAGRYCRE GVRVYAAVGK FSTRNPAHEQ AIGGKLMECG DFVSMGHRLS GQLNFPRRVA
TAYYNSAVWR VYNAFADAIE NAAREMGLAA PIHVLKADGG TMPLAVSREV PVESILSGPA
ASVMGIVALC DIREDCVILD VGGTTTDIAV FASGAPVIER DGIDVGSYAT LVRALRTHSI
GVGGDSRLHV QAGAVRVGPE RMGPSMALGG TQPTLIDALN YMGKAHVGEV EASRRGIRDL
AALWDMFPER LASEAVTTAV RRIAEAVTEL VDAINARPVY TIMELLQGRR VDPVRAYVMG
GPAEVLRPLL ADALGRRVEV PEQYAVANAI GAALTRTTAE LELFADTEKG VLFIPTLDVR
REVGARYDIE AAKRDAGQAL LEHLASLGVD DDEAAVEVVE ATSFNMVDDY GTAGRNIRVK
CQVRPGILGA