Gene Msed_2164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2164 
Symbol 
ID5104903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2080228 
End bp2081976 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content48% 
IMG OID640508057 
Producthypothetical protein 
Protein accessionYP_001192227 
Protein GI146304911 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000710411 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAATTTG AACCAATAGT GGGAAGGCTC AGGGAGGTCA GAGTAACCAC AAGGCCCACG 
GCAGATGGAA GAAGTTCCGT AACGTTCAGG AATTTCGTAG TGGAGATGCC ATACTCCAAG
GACCTTAACC TCAACGTGGG AGCCCTTCTC GCGGTGGAGA CCATAAGGAG TAATACCTAC
CTTATCCTTG AGGTCGCTGA CTACGTTCCC GTCCATTATG GAATGATAAA CATGGATGGA
TCGATACCCA AGGAGATAAG GGATCAGGTA ATGGAGGAGG TCTCCAAGAG TTGGAAAGAC
GGGAATAGCA CGGAGATGTG GATAGATGTG TTTGCGTATC CCATAGGATA CATCGCAACC
CCTGAAGGGT TCAAAAAGGG ATATCTTCCA CCTCTTCCAG GCTCTCCAGT TAGGATTCTA
AGCAGGGAGT CATTCAGGGA GTTCGTTTGC GCAAGGAATG GAGTGGAGAT CGGGAAAGTA
ATAGGGGAAG ATATCCCCCT GACCGTGGAT CTTTCGAAGG CTATGGTTTA CCACATGGGA
GTCTTCGCCT TTACTGGGTC GGGCAAGTCT AACCTCACCG CCTCCATAAT AAGGAGAATC
CTCAATAACA CAAACGCCAA GGTTGTGATC TTCGACGTTT CCATGGAGTA TTCAATCCTT
TTGTTGGACC AGCTTCTGGC CCAGAGGGCC GAGATCCTTA CCACGGATAG GTACTCACCC
AATCCCTTAG ATGCTAGCAG GAAGTTCATG AGAACCCACG TAATACCAGA GGAGTTAGAG
AAGTTCAGGG AGAACATTAG GAGGAGAGTT GAGGAACTGT TCACGTCGGG AAAGATAAGA
ACCCTCTATA TCCCTCCAGA GGGCTCAATG GGCTTAACCT TCGAAACCCT CCTGGAGCTG
GTGAAGGATC AGATAGATGA CAAGTACACG GCCTTCGCTC AGAAGCCCCT GTTTAGCCTC
ATGCTCAGGA AGCTTGACAC CTTCATGAGG CAGAACAGAA TTTCCAAGGA CGCTCCTCTA
GACGACTCAA TCCTTTCAAT ACTAGATGAA ATGGAAAATG AGGGTAGAAA CGCTGGTCTG
AAGGAGAACT CATCGCTCTT CTCATTCATA TCATCCTTAA GATCATACAT AAACACAGAG
GTCGAGGAAA GTGAGGAGTA CGACGTCGAG AAGCTAGCAA TAGACATTCT AGATAAAGAT
GAGTCCTCTC CCAGACTTTT CATCCTAGAG CTACAGAATC TGGAGGAATC CAGGGAAGTT
GTGGCGTCTC TGCTCGAGGA GGTTATGTCA AGGAGGAAAA GGTCCTTCAG TACTTCCCCA
ATTCTCTTCG TTTTGGACGA AGCTCAGGAG TTCATTCCAT TTGATACTAG GCAGAGAGAC
AAGAGCGAGC TGTCCAGTAA CGCCGTGGAG AAATTGCTGA GACATGGAAG GAAGTATCAC
CTTCACGCCC TGATCAGCAC CCAGAGGTTG GCTTACCTAA ACACGAACGT TCTTCAGCAA
CTCCACACTT ACTTCATCAG TGTCTTACCC AGGCCCTACG ATAGGCAGTT GGTCTCTGAA
ACCTTTGGGA TAAACGATAC CCTCCTCGAT AGAACCCTTG ATCTTGAGGT TGGACAATGG
CTCCTTGTGA GCTTCAAGGC ATCCTTACCG CACGATGTTC CAGTTTTCTT TACAGCACCC
AACAACCTAG AGGAGGTGAG AAGGGCGCTT GAGGAGAATA GACCAGCTAA TCCTGTCAAT
GGTAAGTGA
 
Protein sequence
MEFEPIVGRL REVRVTTRPT ADGRSSVTFR NFVVEMPYSK DLNLNVGALL AVETIRSNTY 
LILEVADYVP VHYGMINMDG SIPKEIRDQV MEEVSKSWKD GNSTEMWIDV FAYPIGYIAT
PEGFKKGYLP PLPGSPVRIL SRESFREFVC ARNGVEIGKV IGEDIPLTVD LSKAMVYHMG
VFAFTGSGKS NLTASIIRRI LNNTNAKVVI FDVSMEYSIL LLDQLLAQRA EILTTDRYSP
NPLDASRKFM RTHVIPEELE KFRENIRRRV EELFTSGKIR TLYIPPEGSM GLTFETLLEL
VKDQIDDKYT AFAQKPLFSL MLRKLDTFMR QNRISKDAPL DDSILSILDE MENEGRNAGL
KENSSLFSFI SSLRSYINTE VEESEEYDVE KLAIDILDKD ESSPRLFILE LQNLEESREV
VASLLEEVMS RRKRSFSTSP ILFVLDEAQE FIPFDTRQRD KSELSSNAVE KLLRHGRKYH
LHALISTQRL AYLNTNVLQQ LHTYFISVLP RPYDRQLVSE TFGINDTLLD RTLDLEVGQW
LLVSFKASLP HDVPVFFTAP NNLEEVRRAL EENRPANPVN GK