Gene Dret_1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1076 
Symbol 
ID8418901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1267396 
End bp1268484 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content60% 
IMG OID645037648 
Producthydrogenase expression/formation protein HypD 
Protein accessionYP_003197942 
Protein GI258405200 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.490079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.158086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGTACT TGTTGAAACA ATTTCACGAT CCCCAATTGT GCCAAGCGCT TGTACGGGAT 
CTTCAGCAAC GGATCGGAGA TGGTTTCCGG TTCATGGAAG TGTGCGGCAC GCATACGGTG
AGTATTTTTC AAAGCGGCCT TCGCTCTTTG CTCCCCGACA ACTTGACCCA TTTGTCCGGG
CCGGGGTGTC CCGTGTGTGT AACCCACGAA CGCGAAGTGG CGGCTTTTTT GGATCTCGCC
AGACAGCCCG GTGTCGTTGT GGCCACATTT GGGGATCTGC TTCGCGTTCC CGGTCCTGAG
GGTCGCAGTC TGAAGGCCGC TCAAGCCGAA GGGGCGCAGG TCGAGGTCGT GTATTCGCCG
TTTGACGCTT TGCAACTGGC CCAGGACCGA CCCGGGAAAC ATATCGTTTT TTTGGGAATC
GGGTTTGAAA CGACCGCTCC GACTGTGGCC GCCACATTGA AGACCGCCAA ACGTGACGCT
CTGTCCAATT TCAGTGTCCT GAGCCTGCAC AAACTGGTCC CTCCGGCCCT GCAGGCCTTG
ATGGACGACA CGGCCTGCGC CATTGACGGC TTTTTGCTCC CAGGGCATGT CTCCGCGGTG
ATCGGGACGC AACCGTATGC CTTTGTGGCC GAAAAATACA GGACGCCCGG AGCGGTGGCT
GGATTTGAAC CGGTGGATAT CCTCGCCGGA CTTCAGGAAC TTTTGCGTCA ACGTGAAGCC
GGTGAGCCCA GTATCGCCAA TGTCTATCCT CGAGTGGTGG GCGACCAGGG CAATGCGAAA
GCGATGGCCC TGATGGAGGA GGTCTTCGCG CCCTGCGACA CCCTGTGGCG GGGTCTTGGC
GAACTGCCGG GCAGCGGTTT GGAGATCCGC TCTGCGTTCG CGGACCACGA CGCCCGTCAG
GTGTTGGGCA TTGAATGGCC CGAGGTTCCG CCTTTGCCGG GGTGCCGTTG CGGGGACGTG
CTCAAAGGCA AGCTGAGTCC GGAGCAATGC CCCTTGTTCG GGACCCGGTG TACTCCTGCC
ACTGCTGTGG GACCGTGCAT GGTCTCCACC GAGGGCAGTT GCGCCGCCTA CTACAAATAC
CGTCTGTAA
 
Protein sequence
MEYLLKQFHD PQLCQALVRD LQQRIGDGFR FMEVCGTHTV SIFQSGLRSL LPDNLTHLSG 
PGCPVCVTHE REVAAFLDLA RQPGVVVATF GDLLRVPGPE GRSLKAAQAE GAQVEVVYSP
FDALQLAQDR PGKHIVFLGI GFETTAPTVA ATLKTAKRDA LSNFSVLSLH KLVPPALQAL
MDDTACAIDG FLLPGHVSAV IGTQPYAFVA EKYRTPGAVA GFEPVDILAG LQELLRQREA
GEPSIANVYP RVVGDQGNAK AMALMEEVFA PCDTLWRGLG ELPGSGLEIR SAFADHDARQ
VLGIEWPEVP PLPGCRCGDV LKGKLSPEQC PLFGTRCTPA TAVGPCMVST EGSCAAYYKY
RL