Gene Msed_2272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2272 
Symbol 
ID5104224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2172935 
End bp2174431 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content41% 
IMG OID640508169 
Product7-cyano-7-deazaguanine tRNA-ribosyltransferase 
Protein accessionYP_001192334 
Protein GI146305018 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0343] Queuine/archaeosine tRNA-ribosyltransferase 
TIGRFAM ID[TIGR00432] tRNA-guanine transglycosylase, archaeosine-15-forming
[TIGR00449] tRNA-guanine transglycosylases, various specificities 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.512273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGGTG ACTTTGAAGT AAAGGATGAG GATCTGGCTG GTAGAATAGG GATACTGGAA 
ACCAAACATG GTAAGCTCGA AACTCCAGTT TTTTTCCCAG TCATAAATCC CCTTAAGCAG
GAGGTATTCC TAGAGGAACT CAAGGCCGTA GGCTTTAATA ATTTTATTAC AAATTCATTC
ATACTAAAGA AAAATAATAT TCTCCAAGGC ACTATTCACG AGAAGTTTGG GGATAATTTC
GTAATTATGA CGGATTCTGG AGCATACCAA ATACTTGAAT ATGGTGAGAT AGAGCAGACA
AATAGGGATA TTGTATCATT CGAGGCGAAA ATTAGGCCAG ATATAGCGGT GTTCCTAGAT
ATACCAACTG GCAATACTGA CGACCGAGAA GAGGCAAAGT TCTCAGTGGA GATGACCCTA
GAGAGAGGGA AGGAAATCGC TGACATTGTG GACCAGAACG AGGATATTAT CTGGGTTCAT
CCTATTCAGG GAGGTCAGTT TCTTGATCTA GTGGAGTATT CAGCTAGGGA AGCCAATAAA
AGAACCAACT ACAAGATGTT GGCTCTTGGA AGCCCTACGG TTTTCATGGA GAAGTATAAA
TATGATACCC TAGTAGACAT GATCTATACC GCTAAGTCGA GCGTAAGTAG GGGTGTTCCC
TTCCACCTAT TTGGAGGCGG TGTTCCCCAT ATAATTCCCT TTGCCGTAGC TTTGGGCGTG
GATTCGTTTG ACTCTGCATC CTATGCTATT TTCGCTAGAG ATAATAGGTA CTTGACGAGC
GAGAGAACTT ATAGACTTGA GGACCTTGAG TACTTCCCGT GCAGTTGTCC TGTTTGTTCT
AGATACGATC CGTCAGAACT ACTGGAGATG AAATCCGAAG AGAGATACAA ACTTCTGGCT
ATCCATAACC TTTGGAAGAT AAGGGAGGAA GTGAACAGGG TAAAGCAGGC CATTAAGGAA
GGAAGATTGT TTGAATATAT TCAACAGAAA GCTTACTCGC ACCCAGCTCT CTATTCAGCC
TTTAAATCAA TACTCAAATA CTCCAGTTAT CTAGAGAAAT ACGATCCGAG GGTCAAAGGC
AACGTTAAGG GACTCCTGCT ATTTGATCAT AATTCCATGA ACAGGCCTGA ACTTCTGAGA
CATTCAGAAT TTATGGCAAA TCTGAAACCC AAGAGGAACA AGGTAATAAT AATTTGTGGC
GATAAATTAG GTAGTCCCTT TATTTCAGAC CCTAAGGTCA AATCGATACA AGGAAGGAAC
AGAGACTACG ATACATTCGT AGCGCTTCCC TTCTATGGCC TAGTACCTGT TATGGCATCT
GAAGCGTTCC CATTATCGCA GTTCGAGATT CCTGACATAA TAGACGATAC CACATTAAAT
GAAACAATAC TGAAAATAAA AGAGACCTTG CGCAATAAAA ATTACGCAGA GATAAAGTTC
ATGGAATGTG AAAAATCTGT ACTATCACAT ATAATGTCTA TCAACCCCAC TCTTTGA
 
Protein sequence
MIGDFEVKDE DLAGRIGILE TKHGKLETPV FFPVINPLKQ EVFLEELKAV GFNNFITNSF 
ILKKNNILQG TIHEKFGDNF VIMTDSGAYQ ILEYGEIEQT NRDIVSFEAK IRPDIAVFLD
IPTGNTDDRE EAKFSVEMTL ERGKEIADIV DQNEDIIWVH PIQGGQFLDL VEYSAREANK
RTNYKMLALG SPTVFMEKYK YDTLVDMIYT AKSSVSRGVP FHLFGGGVPH IIPFAVALGV
DSFDSASYAI FARDNRYLTS ERTYRLEDLE YFPCSCPVCS RYDPSELLEM KSEERYKLLA
IHNLWKIREE VNRVKQAIKE GRLFEYIQQK AYSHPALYSA FKSILKYSSY LEKYDPRVKG
NVKGLLLFDH NSMNRPELLR HSEFMANLKP KRNKVIIICG DKLGSPFISD PKVKSIQGRN
RDYDTFVALP FYGLVPVMAS EAFPLSQFEI PDIIDDTTLN ETILKIKETL RNKNYAEIKF
MECEKSVLSH IMSINPTL