Gene Dret_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0226 
Symbol 
ID8418030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp282666 
End bp285683 
Gene Length3018 bp 
Protein Length1005 aa 
Translation table11 
GC content57% 
IMG OID645036791 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_003197106 
Protein GI258404364 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.174026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0776388 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTCA CCCGAAGACA TTTTCTGAAG CTCTCCGCTT CGGCTGCGGC AGTCACCGCA 
TTCGGCGGGC TGGGATTCAG CCTGAAGCCG ACCGCGGCAC AGGCTCAGCT GCTGAAATTG
CGCTGGGCCA AGGAAACCAC ATCCATCTGT TGTTATTGCG CGGTAGGGTG TGGACTGATC
GTCCATACCT CCCAGGAAGG ACAGGGCCGG GCCATCAATG TCGAAGGCGA TCCGGACCAC
CCCGTCAGTG AAGGCTCCTT GTGCGCCAAA GGGGCGGCCA TTTTCAACCT GGGCGAGAAC
GAAGACCGCA TCACTTCGGT TCTTTATCGC GCTCCGGGCA GCGAGAAGTG GCAGGAAACA
TCCTGGGATT GGGCCCTGGA CACCATCGCC AAACGGGTCA AGGAAACCCG TGACGCCACG
TTTACCCGGA CTAATGCCCA AGGGCAGGAA GTCAATCGGT GCAACGGCTT GGCCTCTGTT
GGCTCCGCCG CCATCGACAA CGAAGAGTGC TGGGTCTATC AGGCCATGCT GCGCTCTCTG
GGCCTGGTGT ATATCGAGCA CCAGGCGCGT ATCTGACACT CCGCAACGGT AGCGGCTCTG
GCAGAGTCGT TCGGACGCGG TGCGATGACC AATCACTGGA TCGACATCAA AAACAGTGAT
TGCATTTTGA TCATGGGCAG TAACGCTGCC GAAAACCACC CTGTCTCCTT CAAGTGGGTG
ACCAAGGCCC AGGAAAAAGG GGCCCAACTG ATCCATGTCG ACCCCAGGTA CACGCGGACT
TCGGCCAAGG CCGATATCTA CGCCCCCTTG CGTTCCGGTT CCGACATCGC GTTTCTTGGC
GGTTTGATCA AATATCTGAC CGACAAGGAA ATGGTGAACT GGGAATACGT CATCAATTAT
ACCAACGCGA CATTTATCCT GAGCGACGAG TACGGCTTCG AAGACGGCCT TTTTGCCGGC
TTTGATCCCA AGACCAAGAG TTACGATAAA TCCAAGTGGA GCTTTGTCCT CGACGAAAAC
GGCGTCCCCA AACGGGACAC CAACCTGGCG GATCCCCGGT GTGTCTACAA CCTCATGCGC
AAACACTACG AACGCTACAC CCTGGATAAG GTCTCCAAGG CGACCGGGAC GCCCAAGGAA
GACCTGCTCA AAGTGTACAA AGCGTACGCG GCGTCCTATA AAGCCGACAA ATCCGCGACG
ATCATGTACG CCATGGGCTG GACCCAGCAT ACCGTCGGCG TCCAGAACAT CCGCGCCATG
GCCATGATCC AGCTTCTGCT GGGCAATATT GGCGTGGCTG GCGGCGGCGT GAACGCCCTG
CGCGGCGAGT CCAATGTGCA GGGGTCCACG GACCATTGCC TCCTGTACCA CATTCTGCCG
GGGTATCTGA AGACGCCCAA GGCGTCGCAA CCGACGCTCC AGGCCTATAA TGAAGCCTAC
ACTCCGGTCA GCAATGACCC CAAATCCGCC AATTGGTGGC AGCATTATCC GAAGTACTCG
GCCAGCTTGA TCAAGGCCAT GTACAAGGAC GCCCCGATTG AAAAAGGGTA CAAATGGCTG
CCCAAACTTG ACGACGGCAA AGGGTATTCC TTCCTGGAAC TCTTTGACGC CATGTACAGA
GAAGAGATCA AAGGCTTTTT CGCCTGGGGA CAAAACCCCG CCAGCGGTCT GGCCAACTCG
AACAAATCCC GTGAAGCGCT GTCCAAATTG GACTGGATGG TCGTGACCAA CATCTTCGAC
AATGAAACAG CCTCGTTTTG GAAGGGCCCG AACATGGATC CCAAGTCCGT GGACACCGAG
GTCTTCTTCC TGCCGTGCGC TGTGTCTATC GAGAAGGAAG GTTCGATCAC CAACTCTGGA
CGCTGGATGC AGTGGCGGTA CGAAGGGCCG AAGCCCCTGC CGAACACCAA GACAGACGGG
GACATGATCG TCGAGCTGAC CAAACGGCTC CAAAAGCTCT ACGCCAATGA AGGCGGCACC
TACAGTGAGC CGATCGTCAA TCTGAGCACC GAACTGTGGG AAAAGAACGG CAAATACGAT
CCACACAAGG TGGCCAAGCT GATCAACGGT TTCTTCCTCA AGGACGTCAC CGTCCGCGGC
AAATCCTTCA AGGCCGGGGA TCAGGTCCCG AGTTTCGCCT ATCTCCTGGA AGACGGGACC
ACGACCTCGG GCAACTGGCT GTACTGCAAT TCGTACACCA ATGAGGGCAA TATGGCCGCC
CGGCGCGACA AATCCCAGAC CAAGATGCAG GCCAATATCG GTCTGTATCC GAATTGGTCC
TGGTGTTGGC CGGTCAATCG GCGGATCATC TACAACCGGG CTTCCGTGGA TCTCAAAGGC
AAACCGTACG CGCCCGACAA ACCGGTCATC AAATGGACCG GGGACAGCTG GGCCGGCGAT
GTTCCCGACG GTGGCTGGCC TCCGGGCGAA AAGCACGCCT TTATCATGCG CAAGCATGGC
TTTGGTCAGA TTTTCGGCCC CGGCCGGGCT GATGGACCGT TCCCGGAATA CTACGAACCC
TTGGAGTGCC CGCTGGAAGA ACATCCGTTC TCCTCGCAAC TGCACAATCC AACGGCGCTG
ACCTTTGAGG GGGCCATGGA CAAACGGCGT TCCTGCGATC CGCGCTATCC GTTTGTCGGC
ACGACCTATC GGGTCACCGA ACACTGGCAA AGCGGAGTCA TGACCCGTTG GCAGCCGTGG
CTTATCGAAG CCGAACCGGA ACTGTTCGTG GAAATGAGCC CGGAACTGGC CAAGATGCGC
GGCATCGAGA ACGGGGAACG AGTCATCGTG GAATCCGCCC GGGGTCAGGT CAAAGCTGTG
GCCATGGTTA CCCCGCGGAT GCAGCCCTTT ACGATTATGG GGCAGGTCAT CCACCAGATC
GGGCTCCCCT GGCATTACGG TTGGGTCTAC CCCAAAGACA GCGGTGACGC GGCCAATCTG
CTCACACCGT CTGTCGGGGA TGCGAATACC GGTATTCCCG AAACCAAGGC CTTCATGGTC
AATGTTCGCA AGATTTAA
 
Protein sequence
MTFTRRHFLK LSASAAAVTA FGGLGFSLKP TAAQAQLLKL RWAKETTSIC CYCAVGCGLI 
VHTSQEGQGR AINVEGDPDH PVSEGSLCAK GAAIFNLGEN EDRITSVLYR APGSEKWQET
SWDWALDTIA KRVKETRDAT FTRTNAQGQE VNRCNGLASV GSAAIDNEEC WVYQAMLRSL
GLVYIEHQAR IUHSATVAAL AESFGRGAMT NHWIDIKNSD CILIMGSNAA ENHPVSFKWV
TKAQEKGAQL IHVDPRYTRT SAKADIYAPL RSGSDIAFLG GLIKYLTDKE MVNWEYVINY
TNATFILSDE YGFEDGLFAG FDPKTKSYDK SKWSFVLDEN GVPKRDTNLA DPRCVYNLMR
KHYERYTLDK VSKATGTPKE DLLKVYKAYA ASYKADKSAT IMYAMGWTQH TVGVQNIRAM
AMIQLLLGNI GVAGGGVNAL RGESNVQGST DHCLLYHILP GYLKTPKASQ PTLQAYNEAY
TPVSNDPKSA NWWQHYPKYS ASLIKAMYKD APIEKGYKWL PKLDDGKGYS FLELFDAMYR
EEIKGFFAWG QNPASGLANS NKSREALSKL DWMVVTNIFD NETASFWKGP NMDPKSVDTE
VFFLPCAVSI EKEGSITNSG RWMQWRYEGP KPLPNTKTDG DMIVELTKRL QKLYANEGGT
YSEPIVNLST ELWEKNGKYD PHKVAKLING FFLKDVTVRG KSFKAGDQVP SFAYLLEDGT
TTSGNWLYCN SYTNEGNMAA RRDKSQTKMQ ANIGLYPNWS WCWPVNRRII YNRASVDLKG
KPYAPDKPVI KWTGDSWAGD VPDGGWPPGE KHAFIMRKHG FGQIFGPGRA DGPFPEYYEP
LECPLEEHPF SSQLHNPTAL TFEGAMDKRR SCDPRYPFVG TTYRVTEHWQ SGVMTRWQPW
LIEAEPELFV EMSPELAKMR GIENGERVIV ESARGQVKAV AMVTPRMQPF TIMGQVIHQI
GLPWHYGWVY PKDSGDAANL LTPSVGDANT GIPETKAFMV NVRKI