Gene Msed_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1457 
Symbol 
ID5104827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1426534 
End bp1428507 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content48% 
IMG OID640507345 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_001191538 
Protein GI146304222 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGAAA CGAAAAGTAT CTGCCCATAC TGTGGAGTAG GGTGCGGTCT TATACTTGAG 
GGTGAGAATA ACGTAGTTGC GAGAGTGTAT CCAGACCGCG ATCACGTGGT TAGCAAGGGT
CACATATGCG GTAAAGGGAG TACAGCCCAC GAACCTGGAA ATAGTTGGGA CAGGCTTCTG
TATCCCCTGA AAAGAGAAAA GGACATCCTA GTCAGGATTT CGTGGGACGA AGCTATCCGA
GAGATCGCGT CAAAACTCTC GGAGATAAGG AGCAAGTACG GTCCTAGTGC CATAGGATTC
TACGGAGGTT GCCAGAACAC CCTTGAGGAG GGATATACCA TGATGAAGCT AGCAAGGGCC
TTAGGAACCA ATAATGTGGA TTCATGTGCG AGAGTTTGTC ACGATCCCTC AGCTACAGCC
CTAAAGGAGA TGGTAGGCCT CGGTGCAACC TCAACCTCAG TTACGGAGAT ACCCAAAAGC
AAGGTCTTGG TTATAGTTGG AGAATCACTA ACCGAGAGCC ATCCCGTCCT AGTTCAGTAT
CTCTCGATGT TGAAGAAAAA TAACGGCAAG GTAGTAGTGA TAGACCCTAG GGTAACAGGA
ACTGCGAGGT TGGCTGACCT TCACCTCAGG GTTAGACCTG GTACAGACAT TTACCTGTTT
AATGCCGTTG CCAACTACTT GATCTCCAAC AACATCTACG ACAAGAAGTT CGTTGAGGAA
AGGGTGGAAG GATTCGTTGA GTTCTCTAGG CTTGTTAAGT CCTACACAAT CCAAGGAGCA
GAGGAAATAA CGGGGATAGA TCAGTCCGCT ATCCTCGAGT TTGCCAAACT AATATCGCAG
AAACCTGTCA TCTTCTCCTG GGGTCTGGGG CTTACCCAGA CTGGAGGGCC TAAGGCAGTC
CGTAGCCTAA TTAACCTCGC CCTGCTTACA GGCAATGTGG GTTTCGAGGG AGCGGGCCTC
CTAGTATACA GGGGACAGAC CAATGTACAA GGATCAGGAG ATATGATTAA GCCCAACGTG
TTTCCCAATG GTCCCATGAC GCTGGAAACG GCGAGGGAGC TGGAGAAGCT ATGGGGTTTC
TTGCCTCCCA CATGGGAAGG TAAAACTGTA ACTGAAGCCC TCCTTGAGTC GGACATGAAG
GCCGTGGTAC TCATGAACTT CAACCCTGCA GTGAGTTTCC CAAACAGACA GAAGGTTGAG
AATTTCTTGA AGTCCCTAGA GCTTCTGGTA GTTATGGATC CCTTCATGAC AGAGACCGCA
AGGTTTGCAC ACTACGTCCT GCCGTCGGCT ATGTGGACCG AGAAGGAGGG TTCCGTCACC
AGCCTTGATA GAGTTGTGAA ATGGAGGTTT AGGGCAGTAT CTCCTCCAGG AGAGGCGAAG
GAAGAGCTCG AGATCCTGTC CCTCCTCGCA GATAGACTGG GATTCAAGGG ATTTTCCAGG
GATCCAAAGG AGGTATTCAA GGAATTGAGG AGCGTGGTCA AGATCTACTC TAACTTAACT
TTGGATCAGG TCATGGACTA CTCATCCCCC TCAAGATACC CAGAGAACGA CCCAGTTCTC
TACAGAACAA GGTTCTATAC TGCAAGTGGG AAGGCTAAGT TGAAGTTTGA GGAACAACCA
GAACCCAAGA AAGGTCTCAT CTTGATAACG GGCAGAGCGG TAACTAGGTA CAACACAGAC
GAGATGATAA GCAGAACACC TGGATTCGGC CAACTTACAC CCGTGATTTA CCTTAATCCA
AGGGACGCAC AAAACCTGGG TATCAAGGAT AATGACCTGG TAAAGGTATC CTCAAGATGT
GGTATGGCAA TCCTAAGCGC CAAAATCTCC CCCGACGTGT TAGAGGGAAC AACTTTCGCG
TATATGCACG TCCACAGTAT CAATAATGTA GTCTGTGATG AGCTGGATCC AGAAACTAAA
ACTCCGAGAT ATAAGTACAC TGAGATAACT ATAACAAAAA TTGAATGGGT CTAG
 
Protein sequence
MLETKSICPY CGVGCGLILE GENNVVARVY PDRDHVVSKG HICGKGSTAH EPGNSWDRLL 
YPLKREKDIL VRISWDEAIR EIASKLSEIR SKYGPSAIGF YGGCQNTLEE GYTMMKLARA
LGTNNVDSCA RVCHDPSATA LKEMVGLGAT STSVTEIPKS KVLVIVGESL TESHPVLVQY
LSMLKKNNGK VVVIDPRVTG TARLADLHLR VRPGTDIYLF NAVANYLISN NIYDKKFVEE
RVEGFVEFSR LVKSYTIQGA EEITGIDQSA ILEFAKLISQ KPVIFSWGLG LTQTGGPKAV
RSLINLALLT GNVGFEGAGL LVYRGQTNVQ GSGDMIKPNV FPNGPMTLET ARELEKLWGF
LPPTWEGKTV TEALLESDMK AVVLMNFNPA VSFPNRQKVE NFLKSLELLV VMDPFMTETA
RFAHYVLPSA MWTEKEGSVT SLDRVVKWRF RAVSPPGEAK EELEILSLLA DRLGFKGFSR
DPKEVFKELR SVVKIYSNLT LDQVMDYSSP SRYPENDPVL YRTRFYTASG KAKLKFEEQP
EPKKGLILIT GRAVTRYNTD EMISRTPGFG QLTPVIYLNP RDAQNLGIKD NDLVKVSSRC
GMAILSAKIS PDVLEGTTFA YMHVHSINNV VCDELDPETK TPRYKYTEIT ITKIEWV