Gene Msed_1318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1318 
Symbol 
ID5104569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1294134 
End bp1297055 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content49% 
IMG OID640507207 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_001191400 
Protein GI146304084 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTCA CGGTGAGGAT AAATGATAAA GTATATTCCG CTAACCCTGG GGAGACTATC 
ATTGACGTTC TGAAAAGGAA CAACATTTAC GTACCCCACG TCTGCCTCAA CGAGGGTCTA
GTGCCCATAG AGAGCTGTGA CACTTGTCTG GTCAGGGTAA ATGGTAAGTT AATGAGGGCC
TGTTCCACCA GGGTTGAGGA CGGAATGACA ATAACCACTA ACGACGATAA GTCCAAAGGG
GCTAGAAAGG AGGCCATATC TAGGATACTT AGATATCACA AGCTATACTG CACAGTTTGC
GAGAACAATA ACGGAGATTG TGTCCTTCAT GAGGCCGTGA TTAAGGAGAA GGTCTTCCAT
CAAAGGTACG TTGAGAAACC CTACTCCCTA GATAATAGTG GGCCCTTTTA CGTTTATGAC
CCTGCACAAT GCATCCTTTG CGGGAGATGC GTCGAGGCAT GCCAAGACTT CGCCGTAAAT
GAGGTAATAT GGATCGACTG GAACCTGAAT CCGCCAAGGG TGGTGTGGGA TCAGGGTAAT
CCCATTGGAA ATTCTTCCTG CGTGAACTGT GGGACCTGCG TCACAGTCTG TCCCGTTAAT
GCCCTTATGG AGAAGGGAAT GTTGGGGGAG GCTGGACTCT TTACGTGGAT AAACCCAGAG
CTCAAGAAGA AGACAATAGA GGCAGTTGGG AAAGTTGAGG ACAACTTTAG CTTGCTCATG
ACAGTGAGCG AACTAGAATC CAAGGCGAGA CAATCACAGA TTAAGAAGAC CAAGACTGTG
TGCACTTACT GCGGAGTTGG TTGCTCCTTT GAGGTGTGGA CTAAGGGAAG GAAGATCCTG
AAGGTAGAGC CCAAACCTGA GTCACCAGCG AACGGGATCC TGACTTGCGT CAAGGGTAAG
TTTGGGTGGG ATTTCGTCAA TAGCCCAGAT AGGATAACTA AACCACTAAT TAGAGAGGGA
GATCACTTCA GGGAAGCCAG CTGGGATGAG GCAATCCAGC TTGTTGCGAG GAAGTTTAAG
GAAATCAAGG AGAGGTATGG CCCAGATTCC CTGGGCTTCA TTGCCTCAGA TAAAATGACC
AATGAGGAGG CTTACCTCCT TCAGAAGCTA GCTAGGGCAG TTGTGGGCAC GAACAACGTA
GATAACTCCT CTAGATATTG CCAATCTCCG GCTACAGTGG GTCTTTGGAG GACCGTTGGA
ATAGGAGGTG ATTCCGGGAC CATAAGGGAC ATTGAGAACG CTGATCTAAT CCTCATAGTG
GGGCATAACA CCACGGAAAG TCACCCCGTT GTGGGGAGTA AGGTTAAGAG GGCCCAAAAG
ATCAGGGGGG CCAAGATTGC CGTTATTGAC GTCAGGAAAC ACGAGATGGC AGAGAGAGCT
AACCTCTTCA TCAGGCCCAA GCCAGGTACA GATGCAGCGG TCTTGGCAGG GGTAGCGAAG
TATATCGTGG ACCAGGATTG GGTGGATCAC GAGTTCCTCA AGAGAGTTAA TGGTTTTGAG
GAGTTTAAGG AGACCATCAA GGGCTTCACG CTGGACTACG TTGAGAGCGT GTCTGGAGTG
CCCAGGGAAC AGATAATCAA ACTAGCTGAA ATGATACATC AAGCAAAGGG CGTAGCAGTG
TTGTGGGGGA TGGGTGTAAC TCAGCACTTA GGTGGGGCTG ACACTTCCAC CATCATATCC
GACCTTCTCC TATTAACCGG AAATTACGGA CGCCCAGGAA CCGGGGCTTT TCCCATGAGG
GGGCACAACA ACGTTCAAGG GGTCAGCGAC TTCGGTTGTC TACCAAACTA CATGGTAGGA
TACCAAAAGA TGGAGGAGAG CGTTATGTCC AAGTTCGAGG ACTCGTGGAG AACGACCCTT
AACAGAAAAC CTGGCCTGCA GATACCACAA ATGATAGAAG GGGTCCTCGA GGGAAAGATC
CATGCCCTTT ACGTCGTGGG AGAGGATACC GTGATGGTTG ACTGTGGAAC TCCTCTTACT
AGAAAGGCGT TGGAGAACGT GGACTTTCTT GTGGTTCAGG ACATGTTCAT GACAGAGACA
GCGAAGTTAG CTGACGTCAT ACTTCCAGCT GCGGCGAGTC TTGAAAAGGA TGGGACTTTC
GTCAACACAG AGAGGAGGAT ACAGCGGATC TACAAGGCCA TGGATCCCTT AGGTGAATCA
AAGCCTGACT GGGAAATAAT CCAAATGATA GCTAACGCCA TGGGAGCTAA CTGGAACTAT
CATCATCCCT CGGAGATCAT GGACGAGATC GCTAGGCTGA CCCCAATATT TGCTGGCGTT
TCCTATTCGA GGCTCGAGGG TTTCAATAGC CTGGTCTGGC CAGTGAACGA AGATGGAACT
GACACCCCTC TGCTTTACGT GAATTCCTTC GCCACTCCTG ATGGAAAGGC AATCCTCTAC
CCGCTGGAGT GGAAACCAAG ACCGCTGAAG GATGAGGTAC ACTCCATAGT GGTAAACACT
GGAAGGGTAC TTGAGCATTT TCACGGTGGT ACAATGACTG GAAGGGTTGA AGGTCTTAGG
AGAAAGTTCC CAGAAACGTT CGTGGAAATA TCCAAGGAAC TAGCTGAGAG GTACTCCATC
AAAAACGGAG ATCTAGTACT CGTTAAGTCA AAGTTTGGAG GGGAGATCAA GGCAAGGGCG
CTGGTCAGCG AAAGAGTGTC AGGAGAGGAA GTGTTTATTC CCCTCTTTGC CTCAGAACCC
TCAAAGGGTG TGAACAACTT GACAGGCCAA GAGTTTGATA AGGCTTCCGG AACTCCAGGC
TATAAGGATA CGCCCGTCTT GATCGAGAAG ATATCCAGCG GTGAGGGAAC TCCGTTACCA
AGGGATAACT GGAGGTTTCA CGTACAGGAG AGGAAGAGAC AGATCGGGAT TGAGGTGACC
AAGAAATGGA TGAGAGAGGA GTTCAAACCC TTGACAGAGT AA
 
Protein sequence
MSLTVRINDK VYSANPGETI IDVLKRNNIY VPHVCLNEGL VPIESCDTCL VRVNGKLMRA 
CSTRVEDGMT ITTNDDKSKG ARKEAISRIL RYHKLYCTVC ENNNGDCVLH EAVIKEKVFH
QRYVEKPYSL DNSGPFYVYD PAQCILCGRC VEACQDFAVN EVIWIDWNLN PPRVVWDQGN
PIGNSSCVNC GTCVTVCPVN ALMEKGMLGE AGLFTWINPE LKKKTIEAVG KVEDNFSLLM
TVSELESKAR QSQIKKTKTV CTYCGVGCSF EVWTKGRKIL KVEPKPESPA NGILTCVKGK
FGWDFVNSPD RITKPLIREG DHFREASWDE AIQLVARKFK EIKERYGPDS LGFIASDKMT
NEEAYLLQKL ARAVVGTNNV DNSSRYCQSP ATVGLWRTVG IGGDSGTIRD IENADLILIV
GHNTTESHPV VGSKVKRAQK IRGAKIAVID VRKHEMAERA NLFIRPKPGT DAAVLAGVAK
YIVDQDWVDH EFLKRVNGFE EFKETIKGFT LDYVESVSGV PREQIIKLAE MIHQAKGVAV
LWGMGVTQHL GGADTSTIIS DLLLLTGNYG RPGTGAFPMR GHNNVQGVSD FGCLPNYMVG
YQKMEESVMS KFEDSWRTTL NRKPGLQIPQ MIEGVLEGKI HALYVVGEDT VMVDCGTPLT
RKALENVDFL VVQDMFMTET AKLADVILPA AASLEKDGTF VNTERRIQRI YKAMDPLGES
KPDWEIIQMI ANAMGANWNY HHPSEIMDEI ARLTPIFAGV SYSRLEGFNS LVWPVNEDGT
DTPLLYVNSF ATPDGKAILY PLEWKPRPLK DEVHSIVVNT GRVLEHFHGG TMTGRVEGLR
RKFPETFVEI SKELAERYSI KNGDLVLVKS KFGGEIKARA LVSERVSGEE VFIPLFASEP
SKGVNNLTGQ EFDKASGTPG YKDTPVLIEK ISSGEGTPLP RDNWRFHVQE RKRQIGIEVT
KKWMREEFKP LTE