Gene Msed_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1722 
Symbol 
ID5105085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1661163 
End bp1662392 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content46% 
IMG OID640507617 
ProductC/D box methylation guide ribonucleoprotein complex aNOP56 subunit 
Protein accessionYP_001191801 
Protein GI146304485 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTT ATTTGGCTGA GCACACAATA GGGTCCTTTG CCTTTGATGA GTCAGGGAAC 
CTTTTGGATT ACGTCCTTAA TCCAAAGGAA CTGGGCAAGG TAGTTGATAT TCTAATAAAC
GCCGAGAAAG GTGAACCAAT GCCCTCTACC ATGGAATTAA TTCAGAAACT GAAACCGTCG
GAAGTAGTTG TGGAAAGCGA AACGGAGAGC TCAAGGATGC AGACCTTAGG GATCAAGGTT
GTATCCAAGC CCCATCATGT GGGAGCCAGG GCTTTGAGGG GTTCTCTCGC TGAACTAGCA
GTAAAAACAA AATTCGCTGA AAATCCGAGC GAAGTGTACA ATTTCCTTTA TCAAGTATCT
CTAGAATATA CGAGAAGAAA GCTGAGAAAG GCCGCCCAGA AAAGGGACCT TCTCGCCATA
CAGGCCATAA GGGCTATCGA CGATATTGAT AAGACCATTA ACCTTTTCTC CGAAAGATTA
AGGGAGTGGT ATAGTATACA CTTCCCCGAA GCCGATAAAC TGGTTGAGGA CCATGAACAA
TACGCCAAAA TAGTTTCCCT GGCTGGTTAT AGGGATAATG TAACGGTGGA GACGTTAACC
GAGATAGGAC TTAATGAGCA AAGGGCTAAG AAGCTAGCCG ATGCTGCCAA GAAGAGTATA
GGAGCAGACA TCTCAGATGC GGATATCAAC TCCATCAGGG ATCTGGCTAA CACGATTTTG
TCTCTTTTCA AGCTAAGGAA CTCGCTTTAC GACTACTTGG ACTCAATTAT GAGGGAAGTA
GCTCCCAACG TGACTGAACT AGTGGGTCCC ACCCTTGGTG CTAGGCTGTT AAGTCTGGCA
GGGAGCCTTG AGGAACTTTC TAAGATGCCA GCTAGTACGA TTCAAGTGTT AGGGGCTGAG
AAAGCCCTCT TTAGGGCACT TAAGAGCGGA AGCAGACCAC CCAAACATGG AATCATTTTC
CAGTATCCAG CAATTCACGT CTCTCCCAGA TGGCAGAGAG GGAAGATTGC CAGGGCCCTA
GCTGCCAAGC TAGCAATAGC ATCAAGGATA GACGCCTATA GCGGAAGATT TGTGGGAACA
CAGCTTGTGG AACAGGTGAA TAAGAGAATC GAGGAGATAA AAACGAAATA TGCCCAGCCA
CCACCCAAAA AACAACAACC AGCTAAGGAA GAGGGGAAGA GATTTGATAA AAGAGAGCAT
AAAAAGGGGA AAAAGGGAAA GAGAAGGTAG
 
Protein sequence
MKIYLAEHTI GSFAFDESGN LLDYVLNPKE LGKVVDILIN AEKGEPMPST MELIQKLKPS 
EVVVESETES SRMQTLGIKV VSKPHHVGAR ALRGSLAELA VKTKFAENPS EVYNFLYQVS
LEYTRRKLRK AAQKRDLLAI QAIRAIDDID KTINLFSERL REWYSIHFPE ADKLVEDHEQ
YAKIVSLAGY RDNVTVETLT EIGLNEQRAK KLADAAKKSI GADISDADIN SIRDLANTIL
SLFKLRNSLY DYLDSIMREV APNVTELVGP TLGARLLSLA GSLEELSKMP ASTIQVLGAE
KALFRALKSG SRPPKHGIIF QYPAIHVSPR WQRGKIARAL AAKLAIASRI DAYSGRFVGT
QLVEQVNKRI EEIKTKYAQP PPKKQQPAKE EGKRFDKREH KKGKKGKRR