Gene Dvul_0658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0658 
Symbol 
ID4664456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp808582 
End bp810147 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content65% 
IMG OID639818869 
Producthemerythrin HHE cation binding domain-containing protein 
Protein accessionYP_966108 
Protein GI120601708 
COG category[S] Function unknown 
COG ID[COG2461] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTTT CGGCTACCAC ATCCATACAT GACCTCGTCA CATCGCATCC CTATCTCATC 
GAAGTGCTGG CAGACTACGC ACCAGCCTTC GCCAAGCTGC GCAACCCGCT GCTTCGCAAC
ACACTGGGGC GTGTGGCGAC CCTCCAGCAA GCCGCAGACC TAGCGGGGCT TGAACTCACC
GGGCTCATGG CCCACCTCGC ACGCGCCATC ATGGAGCATA CGAAAGAGGC CGTAACGCTC
GTCCCGCGCG GGGCCGAAAC GCCCGCACAT GCCGTGGCGG ACGGTGACAG CGGTTGCGGA
GGCTGCACAC CCCGCCCCGC AGAAGGCAAC ACGGCGGACC GCGCTACGCG CCTTGCGACA
CTGCGTGCCC TGCTGGAGAG ACTGCACCAG GGAACACCAC TGGCCGACCT CAAGGAGGCC
TTCGCCTCTG CCGTGGGTGA CATCTCCGCA GCAGAAGTCG CATCGCTGGA GAAAGAACTT
GTGGCCGGAG GCGTTGCGGA GACGGAAATC AAGAGGCTGT GCTCGCTGCA CGTGGACATC
TTCCGCGAGG CCCTCGCACC ACAACACGTG CCGGACATGC CGCCCGGACA CCCCGTGCAC
ACCTACCGCG CCGAAAACGC GGAGGCGACA CGCATCGCCG ACGAAATCAT CAGTCAGATA
GACAGGATGC GCAGCGTACC CGGCGACACG GAGTCGCCCC TCGACGAACT GGCGTGGTCG
TTCAGCCGCC GCCACGTGGC CGACCTGCTT GCCGACCTTG CCCATGTCGA ACGCCATTAC
ACGCGCAAGG AGATGCAACT CTTCCCCATG CTCGAGGAGA ACGGCATCGA AGCCCCTCCG
AAGGTGATGT GGGAGGTGCA CGACGATATC CGCAGTCTGC TTCGCAAGGC CCGCGAGACC
GTGGAAGGCC CCTCGCCCGT GGCCGCCGCG ACCGTGGCGC GTGATGCGGC ACTTGCCGTG
AAAGACATGG TGGACAAGGA GGAGACGGTG CTCTTCCCCA TGGCACTGGA AAGCCTGACC
GAAGCGCAGT GGGGCCGTGT CAGGCACGGC GAGGATGAAA TCGGCTACGC GTGGGTGACC
CCCGAAGGGG AATGGACGCC GCAGGCGGCG GACACTCCGG AATCCCTGAC GATGGAAGGG
CCTGCTACCG GGACACCTGA CCGGGTGACG CTGGGGACGG GTACCCTCGC CGTGGAGACC
CTCGACCGCA TGCTGCGCAC CCTTCCCCTC GACCTGTCCC TTGTCGATGC CGAAGACCGG
GTGGCCTACT ACACGGACTC GACGCACCGC ATCTTTCCCC GCAGCGCCGG AGTCATCGGG
CGTAACGTGC GCAACTGCCA TCCGCCCAAG TCGGTACATA TGGTCGAAGA GATCCTGGCC
CGCTTCAAGA CCGGAGAACG CGACGAAGCC GCCTTCTGGA TAGAACTCGG AGGGCGCTTC
CTGCACATTC GCTACTTCGC CGTACGTAGC GACGAAGGCC GTTATCTCGG TTGTCTTGAA
GTCGCACAGG ACGTGACGGA CATCCGGGCC CTCAGCGGTC AGCGGAGGCT GCTCGACTGG
AATTGA
 
Protein sequence
MLLSATTSIH DLVTSHPYLI EVLADYAPAF AKLRNPLLRN TLGRVATLQQ AADLAGLELT 
GLMAHLARAI MEHTKEAVTL VPRGAETPAH AVADGDSGCG GCTPRPAEGN TADRATRLAT
LRALLERLHQ GTPLADLKEA FASAVGDISA AEVASLEKEL VAGGVAETEI KRLCSLHVDI
FREALAPQHV PDMPPGHPVH TYRAENAEAT RIADEIISQI DRMRSVPGDT ESPLDELAWS
FSRRHVADLL ADLAHVERHY TRKEMQLFPM LEENGIEAPP KVMWEVHDDI RSLLRKARET
VEGPSPVAAA TVARDAALAV KDMVDKEETV LFPMALESLT EAQWGRVRHG EDEIGYAWVT
PEGEWTPQAA DTPESLTMEG PATGTPDRVT LGTGTLAVET LDRMLRTLPL DLSLVDAEDR
VAYYTDSTHR IFPRSAGVIG RNVRNCHPPK SVHMVEEILA RFKTGERDEA AFWIELGGRF
LHIRYFAVRS DEGRYLGCLE VAQDVTDIRA LSGQRRLLDW N