Gene Dvul_2656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2656 
Symbol 
ID4663165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp3098525 
End bp3099550 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content67% 
IMG OID639820903 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_968095 
Protein GI120603695 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG AAACCCTTCT TCTCGATTAC GGCAGCGGCG GGCGCGCATC GCACCGCCTC 
ATTTCCGACC TCTTCCTCCG CCATTTCGAC AACCCCATCC TCGGCACGCT CAACGACGCC
GCCCGTCTCG ACCTGACAGG CCCCCTCGCC ATGAGCACCG ACAGCTACAC CGTAGACCCC
ATCTTCTTCC CCGGCGGCGA CATCGGCACG CTGGCGGTGC ACGGCACCGT CAACGACGTC
TCCATGCTGG GCGCACGGCC GCGCTACCTC AGCTGCGGTT TCATCCTCGA AGAGGGACTG
GACATGGACA TCCTCGAACG GGTGGTCGCC TCCATGGGGA AGGCCGCGCG TGAGGCGGGG
GTGTTCATCG TGACGGGTGA TACCAAGGTC GTGCCCCGTG GGGCCTGCGA CAAGATGTTC
ATCAACACCA CCGGCATCGG CGAGATTCTG GTCGACCCCG CGCCCTCGGG CGACAGGGCG
CGCCCCGGTG ACGCCATCCT CATCAGCGGC AGTATGGGCG ACCACGGGCT GACCATCCTC
TCGCAGCGTC AGGGGCTGAA CTTCGCTGCG GATGTGTGCA GCGACTCGGC CTCCCTCAAC
AGGGTGGTGG AGAAGCTGGT GCTGGAGGTC GGCGACATCC ACGTGCTGCG CGACCCCACC
CGTGGGGGTC TCGCCACGAC ACTGAACGAG ATAGCGGGCC AGTCGCAGGC CGTGTGCCAT
GTGCTGGAGA CGGCCGTGCC CGTGCGCGAG TCGGTGCGCA ACGGCTGCTC GTTCCTCGGA
CTCGACCCGC TGTATCTTGC CAATGAGGGC AAGCTCATCT GCATCCTGCC CGAGGAGAGG
GCCGAGGCCG CGCTTGCCGT GTTGCGCGAA GGGCCGCACG GTGAACACGC TGCCCGCATC
GGGAGTGTGA AGTCCGTCGG TGAACTCGGG GCAGCCCGGG CCGGTCAGGT GGTGATGGAG
ACGGCCCTTG GCGGGCACCG CCTGCTTTCC ATGCTCGAAG GCGAGCAGTT GCCGCGCATC
TGCTAG
 
Protein sequence
MSGETLLLDY GSGGRASHRL ISDLFLRHFD NPILGTLNDA ARLDLTGPLA MSTDSYTVDP 
IFFPGGDIGT LAVHGTVNDV SMLGARPRYL SCGFILEEGL DMDILERVVA SMGKAAREAG
VFIVTGDTKV VPRGACDKMF INTTGIGEIL VDPAPSGDRA RPGDAILISG SMGDHGLTIL
SQRQGLNFAA DVCSDSASLN RVVEKLVLEV GDIHVLRDPT RGGLATTLNE IAGQSQAVCH
VLETAVPVRE SVRNGCSFLG LDPLYLANEG KLICILPEER AEAALAVLRE GPHGEHAARI
GSVKSVGELG AARAGQVVME TALGGHRLLS MLEGEQLPRI C