Gene Dret_1225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1225 
Symbol 
ID8419053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1437349 
End bp1438980 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content62% 
IMG OID645037800 
Productformylmethanofuran dehydrogenase subunit E region 
Protein accessionYP_003198091 
Protein GI258405349 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.524983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCG GGTCGCATAC GTTTGAGGAA TTCAAGGAAA TCGCCCGGCG TTTCCACGGT 
TTTCCCGCCC CGGGGCTGCT GCTCGGCGGG TATATGGTTG AAAACGCCAA AGCCAGGATG
CCTGAGGGGG TCCTGTACGA GGCGATGGTC GAGACCGGCA AATGTCTGCC GGACGCGGTG
CAGCTGCTCA CGGTTCTGAG TGTGGGCAAC GGCTGGATGC GGGTGGTCAA TCTCGGCCGG
TACGCGCTGT CGCTCTATGA CAAATACACC GGTGCGGGAT GGCGGGTGTA CCTTGATTCC
GACAAGCTCG ACCATTGGCC GCATATGCAA TCCTGGCTGA TCAAGTCGAC GCCCAAAAAA
GACCAGGACA CCGAAGCGTT GTACAGGGAA ATCGAAGCCG CGGGCGACAC GGTTTGTTCG
GCGTACCCTA TCCAGATCCA CGAGCGGTTC CTCGGCAAAT CGAGCATGGG TGAGATCGGC
TGGTGCCCGG TCTGCCGGGA ATACTACCCC ACCCGCGACG GGGCTGTCTG CCGGGGCTGT
CAGGGTGACG CGCCCTACGC CAGCATACCG GGCTTTCCCG GACTCGGGCA ACATCGAGCC
CAGCCTCACG CGGTTTCTCT GGAAGAAGCC GTGGGCAAGA CCGCGGTCCA TGATATGACC
CGGATCGTCC CCGGCGAGAC CAAGGGACCG GCTATCCAGG CCGGTCAGGT GATCACCGGG
GGAGACCTGT GCCGTTTGCA GCAAATGGGA CGGGAACGGG TCTATGTTCG GGAGGAGCTC
GGAGGCGTTG ACGGCTTCGT CCATGAAAAC GATGCGGTCC TGGCCTTTGC CGAGGCCATG
GCCGGCGAGG GGGTCACCTA TACCGCGCCG CCGAAGGAAG GGAAAATCGA TTTTCGCGCC
AGCCAGAGTG GCCTGCTCGT GGTCGATATC GAGCGGCTCC GGGGGTTTAA TTGTTTGCCC
GATGTCATGT GCGCCACCCG GCAATCCGAT ATCTATGTCG AGAAGGACAA GGCGTTCGCC
GGCACCAGAG CGATCCCGCT GTATCTGAGT GAGGACCGCT TCGAGGCCGC TTTGGCTGTT
TTGCGCGACG GCCCCCTTCT GCGTGTCGAG CCGTTGCGCC CGATGGACAT CGGCATTCTG
GTCACTGGCT CTGAAGTCTT TAAGGGGTTG ATCGAAGACA AATTCGCCCC CCTCATCCGG
GGCAAGGTCG AGCACTTTGG TTGCCGGGTG TGTGCCGAGG AGGTTGTTCC GGACGACCGG
GAAACCATCG CTGCTGCCGT GGCCCGTCTG CGGGAGGCCG GGGCTCAGCT GATCATCACT
ACCGCAGGCT TGTCCGTGGA TCCTGATGAC GTGACCCGGG CGGGGCTGCT GGATGCGGGG
ATGACAGACA TGGTCTACGG TGCGCCCCTG CTGCCTGGAG CCATGACCCT GATCGGCCGT
GTGGGCTCCA GCCGGATTTT GGGCGTGCCC GCCTGCGCTC TTTTTTACAA AACGACCAGC
CTGGATCTGC TCCTGCCGCG GTTGCTGGCA AACCGCCCCA TCACCCGTGC TGCTTTAGCC
GAGATGGCGG ACGGCGGATT CTGCCTGAGT TGTACGGTGT GTACCTTTCC CAAGTGCCCG
TTCGGCAAGT AG
 
Protein sequence
MNIGSHTFEE FKEIARRFHG FPAPGLLLGG YMVENAKARM PEGVLYEAMV ETGKCLPDAV 
QLLTVLSVGN GWMRVVNLGR YALSLYDKYT GAGWRVYLDS DKLDHWPHMQ SWLIKSTPKK
DQDTEALYRE IEAAGDTVCS AYPIQIHERF LGKSSMGEIG WCPVCREYYP TRDGAVCRGC
QGDAPYASIP GFPGLGQHRA QPHAVSLEEA VGKTAVHDMT RIVPGETKGP AIQAGQVITG
GDLCRLQQMG RERVYVREEL GGVDGFVHEN DAVLAFAEAM AGEGVTYTAP PKEGKIDFRA
SQSGLLVVDI ERLRGFNCLP DVMCATRQSD IYVEKDKAFA GTRAIPLYLS EDRFEAALAV
LRDGPLLRVE PLRPMDIGIL VTGSEVFKGL IEDKFAPLIR GKVEHFGCRV CAEEVVPDDR
ETIAAAVARL REAGAQLIIT TAGLSVDPDD VTRAGLLDAG MTDMVYGAPL LPGAMTLIGR
VGSSRILGVP ACALFYKTTS LDLLLPRLLA NRPITRAALA EMADGGFCLS CTVCTFPKCP
FGK