Gene Msed_2105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2105 
Symbol 
ID5104399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2027393 
End bp2028976 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content38% 
IMG OID640507995 
Producthypothetical protein 
Protein accessionYP_001192169 
Protein GI146304853 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.282872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAG TACGATTTGA ATTTAAGAAT ATTACCCTAA GGATGGGCAG GTTCCTTGAG 
TTCTTAGACA GGACAAGATA TTTCCCGCTA TTATCAACGA TTCATCCCTT TAACTTGTTC
CTCAGGCTCA TAAGGCAGAA CATTGAAGTG TATGGAGGAA GTATCTCAAG AGTGGAAAAT
CTTTCAAATA GATCATTCTT AATCTTTTTA ATTTCATTGT CTCTCATTGG AATACTAGTA
TGGCACATGG GGATATACTT TGTTCCCCTA TTAATTATAC CAGTTCTAGT TTATTTACTT
CCTGTTATCT ATATACTAGC CTCCAAGATG GAATATTTAT CCAGGTTAAA TCTAGAGCTA
CTTCCATTTT CAATTCTTCT ATATCTCAAT GCGTCACTGG GAAAGGGGCT TTACGAAACA
TTCAACGATG TAAACCAGAG TTCCTTGTTC ATTGCCTTCA GGAAAGAGTT TGAAATTATA
CAGAGGTATG GCATATTTCA TGGTAAATCA TTTCTCGATG GAATTCAGAG GAGAATAAAG
AATCTCAGAA CTGGCTTAAT AGTAAAGTTA TACTCGTCTT CGCTTTCAGG CCAATTTTTA
GGCGTAACCA TGGGTCAAAG GTCGCTGGAG TTCATTAATG ACTTGCTAGG AAACATAAGG
GAGGCCTTCA ACAACTATGT TTCAAAGGCC TCCGAGATAG TTGAAGTCAT TTTCTCCATC
TTCCTCTTAG TCCCCCTAGT AGCCATAGGG TTTCAGGGAT TATCTAGTAA TAATAATGGT
GAGATTCTTT TAATACCGTT ACTATTTGCG CCTCTCATTT ACCTATGGAT ATCCGTGTCC
CAACCCAACA TGGGTATCCA TGTTAAAATT GGGAAATTAC AATTGGTTGG TTTACTTCTG
TCCACTGCAC TATTGGCTCT ACCATTCAAT CTATTGCTCA GAGTAGGCAT AACGTTCCTT
GCAACTCAGC TGATTCTCTT TCCTTCCTAC CTAGTAATTA AGAGAGATGA GAGTATTCTC
GCCGATTTTC CAACCATATT AAGAGAGATA GGCGATTTCA CTAAGCTCGG ATATGGAATT
AGGGCATCTA TTCAAAGAAT AAATTTCGAT GAACTAGGCC TACACAAACC AACTGTAAAG
TTCTTTGATA ACGTAAAGAA ACAGATTGGG ATGGGAAACA ATATCTATTT TGGATCTATT
CAGAACGAAC AGGTAAAGTT TATCGTGGAG CTATTGAACA TCCTAGACAG GAAGGGTGGA
GAAGGAGTCA GGGTGCTCCA GGAACTGAGC GACATGATAT ATTCGATCAC ATTATCCAGG
ACTAAGCTAC AAAGAGAACT GAGTACGTTC AATATTCTTG CGCTTATTAC TCCCGTACTT
TTCTGGTTTT CCACTACTTC AATTGAAACA ATTTCCAGTT TCTCTTCTTC CACTCTAGGC
CTATTGAATT TAGGTTATAG CTTGACATTA AGCCTGTTAT ACACTAAGCT AAGTAAATTT
ACCTTCCTAA ATCCTGTGGT ATACATCTCT GTGACGTTAA TATCGATATT ACTTTCAATC
CTCCCTCCAG GATTATTATC TTGA
 
Protein sequence
MSKVRFEFKN ITLRMGRFLE FLDRTRYFPL LSTIHPFNLF LRLIRQNIEV YGGSISRVEN 
LSNRSFLIFL ISLSLIGILV WHMGIYFVPL LIIPVLVYLL PVIYILASKM EYLSRLNLEL
LPFSILLYLN ASLGKGLYET FNDVNQSSLF IAFRKEFEII QRYGIFHGKS FLDGIQRRIK
NLRTGLIVKL YSSSLSGQFL GVTMGQRSLE FINDLLGNIR EAFNNYVSKA SEIVEVIFSI
FLLVPLVAIG FQGLSSNNNG EILLIPLLFA PLIYLWISVS QPNMGIHVKI GKLQLVGLLL
STALLALPFN LLLRVGITFL ATQLILFPSY LVIKRDESIL ADFPTILREI GDFTKLGYGI
RASIQRINFD ELGLHKPTVK FFDNVKKQIG MGNNIYFGSI QNEQVKFIVE LLNILDRKGG
EGVRVLQELS DMIYSITLSR TKLQRELSTF NILALITPVL FWFSTTSIET ISSFSSSTLG
LLNLGYSLTL SLLYTKLSKF TFLNPVVYIS VTLISILLSI LPPGLLS