Gene Dvul_1975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1975 
Symbol 
ID4663419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2294274 
End bp2295494 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content61% 
IMG OID639820216 
Producthypothetical protein 
Protein accessionYP_967418 
Protein GI120603018 
COG category[S] Function unknown 
COG ID[COG3672] Predicted periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.495543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGG CGCGGGAGAG ACAGTGGCAG GAGGGGCCGT ATGCGCGGTT CTCCTTTTGT 
GCGGCGGTGG TCGTACTGCT CTGTCTCTTC GTGCCCACGT TCTTTCTGGG GCTTCCGGGA
GATACCGAAG CTGGGGGTGC CAGTCCGAAA GCAGCCCCCT ATGAGCCGCG CGATGGCGAG
GCTGTAACGC GGGGCGAGTC TGAACAGCGA GCGTCTTCCC GTCATGTCAT CGGACAGGCC
CCAGACCCGC AGCAGGCTCC AGATGCACGC GTCATTGCAT CGGAAGCGCC TGCTTCACCA
GCCGATACAG ATGTCCGGGG GGAGCGGAAG GTCCCGGGCA CCACTGGGGA GGACGCTTCC
AGCAACCGGC CTGAAGTCTC ACCGGTGCTT GAGGTCGCAG CTGACGGCCC GCTGCGTCAT
GTACGCCCGG AGGACATGCG AGGCGGTGCG CAGCAGTCCC CGGTGCAGCC TTCGGGACAG
CGGGGGCACC GGGCAGGTGC GACTTCGGAT GGGACACCGG ATGGTGGGGA ACATTCCGGG
AATGCCGCGA CTGCCGAAGA TGGAGAAGGG CAGGGAGTGG AGGCGGCCCG GCCGTCGAGG
GACGCTCCTG CCAGAGGGCA GTCCTCTGCG TCCTCTACAG CGGCGACCGG GGTGCGACTG
TTCGGAACCA TAGAATTCAG GGGCCAGTTG AAGGCTCTTC CGAAATGGTC GCGGGTGGTC
GAGACGGAAC GCAAGAAACC CGGACTGTAT CTGGACAGGG CTCTTGGCGG CAAGGGCGGG
CAGGTCTGGC GGGAGTTGCG TGGCGAATGG CAGGGCTTGC CGCTGATGGA GAGGTTGAAG
AAGGTCAACA CGTTTTTCAA CCAGTGGCCG TACAGGCTTG ATAGTGAAAA CTACGGATTG
CCGGACTATT GGGCGACGCC CGACGAATTC CTCAGAAAGT CCGGTGACTG TGAGGACTAT
AGCATCATCA AGTATTTCGC CCTGAAGCAG CTAGGGGTTT CTGCCGATTC GATGCGCATA
GTCGTTCTGC TTGACAAGAT CAGGGGCATT GCCCACGCAG TGCTGGCCGT CTACGATGGT
AATACGGCGT ATATACTTGA TAACCTTTCC GGACTCGTGC TGGCTCATGA TTTCTACAAG
CACTACGTCC CCCAGTATTC GGTGAACGAA TCCTACAGGT GGGCACACAT CCCCCTCGGG
AAGAAAGCCG GGAGGAAATG A
 
Protein sequence
MAKARERQWQ EGPYARFSFC AAVVVLLCLF VPTFFLGLPG DTEAGGASPK AAPYEPRDGE 
AVTRGESEQR ASSRHVIGQA PDPQQAPDAR VIASEAPASP ADTDVRGERK VPGTTGEDAS
SNRPEVSPVL EVAADGPLRH VRPEDMRGGA QQSPVQPSGQ RGHRAGATSD GTPDGGEHSG
NAATAEDGEG QGVEAARPSR DAPARGQSSA SSTAATGVRL FGTIEFRGQL KALPKWSRVV
ETERKKPGLY LDRALGGKGG QVWRELRGEW QGLPLMERLK KVNTFFNQWP YRLDSENYGL
PDYWATPDEF LRKSGDCEDY SIIKYFALKQ LGVSADSMRI VVLLDKIRGI AHAVLAVYDG
NTAYILDNLS GLVLAHDFYK HYVPQYSVNE SYRWAHIPLG KKAGRK