Gene Msed_2254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2254 
Symbol 
ID5104207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2154562 
End bp2156490 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content44% 
IMG OID640508150 
Productbeta-lactamase domain-containing protein 
Protein accessionYP_001192316 
Protein GI146305000 
COG category[R] General function prediction only 
COG ID[COG1782] Predicted metal-dependent RNase, consists of a metallo-beta-lactamase domain and an RNA-binding KH domain 
TIGRFAM ID[TIGR03675] arCOG00543 universal archaeal KH-domain/beta-lactamase-domain protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.311245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAATT TTAGGTTAGG TATAATATCG TCCATATACA ATGGAATTCC AAAGGAGGCT 
CAAATCACAA GAATTGAGTT CGAGGGTCCA GAAATCGCAG TGTATGTTAA GAATCCGTCA
GTTCTTAGCG ATAAAATGGA CCTAATAAAG AAGATAGTAA AGGAAATAAA GAAAAGGATT
GTAATCAAAG CTGATAACAG TGTCAGAAAG GATAAAAAGG AGACGATAGA AATAATAAAG
AATTTTGTTC CAGCAGAAGC TCAAATTTCT GATATAAAGT TTGACGATGA ACTAGGAGAG
GTTCTCATAA AGGCAAAGAA GCCTGGACTG GTAATAGGTA AAAGAGGTCT AATCCAACAG
AAGATTTTCA TGGATACTTA CTGGAGGCCT GTAATCATAA GAGAACCACC CATCAAGTCC
AGAACCGTAG AGGGAGTTCT CACCCACATA TACAACGAAA CTGAGTATAG GGCGAAGATG
CTCAAGACCT TTGGAGAGAG GATACACAGG GAGATACTAT TTAAGGACAG ATACGTGAGG
GTAACTGCCC TGGGAGCCTT CCAAGAGGTT GGGAGATCAG CAGTTCTGGT GGAGACCCCT
GAAAGCAGGG TGTTGATGGA CGTGGGAGTG AATCCGAGCG TGAACTTTGG AGAAAGGATG
TTTCCGAAGC TGGATATTGA CCAGTTGAGA TTGGAGGACC TGGACGCTGT GGTATTGACC
CATGCCCACC TGGATCACAG CGGAATGATC CCATTTCTTT TCAAGTACGG ATATGACGGT
CCGGTGTACA CAACTCAGCC CACTAGAGAC ATAATGGCCT TAATGCAGTT GGATCTACTT
GACGTGGCAG ATAAGGAAGG AAGACCTCTG CCATATTCCG CTAAGGAAGT TAGAAAGGAG
TTACTCCATA CCATTACCCT AGATTACGAG GAAGTTACGG ATATAGCTCC GGATATAAGG
CTCACCTTCT ATAATGCAGG GCATATCATC GGTTCAGCAA TGGCCCATCT TCACATAGGA
GATGGTGTGC ACAACCTTGT CTACACTGGA GATTTCAAAT ACGCTAGAAC GAGATTGTTG
GACAGAGCCG TTTCCGAGTT CCCTAGGGTG GATACCCTAA TAATGGAGAC CACCTATGGT
GCACAGTTGC AGACCAATAG GGATGAATCG GAGAAGCAGT TGATTGATGT AATAAACAAG
ACCCTAAATA GGGGTGGCAA GGTTCTCATA CCTGTGTTAG CTGTGGGGAG AGGACAGGAA
ATCATGTTGG TGATAAATGA CGCAATGAAG AGAAAACTCA TACCAGAAGT TCCAGTTTAT
GTTACAGGGC TCTTCGACGA GGTTACAGCA ATACACACCG CATATCCTGA GTGGCTAGGC
AAAGAAGTGA GAGATTCAAT CTTGTTCAGG GATGAGAACC CGTTCACTTC TGAGTTCTTC
AAGAGAATAG AAGGATACAG GGAGGATATC GCAGAGGGTG AGCCCAGTAT AATTCTTGCA
ACTTCCGGTA TGTTGAACGG TGGACCGGCA GTTGAGTTCT TTAAACAGTT AGCTCCCGAT
CCAAAGAACA GTTTAATTTT CGTTAGTTAT CAGGCTGAGG GAACCCTGGG TAGAAAGGTT
AGAGACGGGG CTAGGGAGAT ACAGATTATA GGAAGGGACG GAAGAGTAGA TAACATAAAG
GTTAATCTAG AGACTACACC CATAGACGGA TTTTCAGGAC ATTCAGACAG GAGACAACTA
CTGAAATTCC TAGAGGATTT AACGCCCAAA CCCAGGAATA TTATATTAAA TCATGGAGAG
GCTTCAGCTA TTAGAGAATT CAAGAAAAAC ATAGAAAACT TTAGAGAAAG GGAGAGACTT
GGCTTGAGAT CAGCGAATAT CTATTCCCCA GCAATTCTTG ATAGCATCAG GTTGGACAAG
ACTTCCTAG
 
Protein sequence
MSNFRLGIIS SIYNGIPKEA QITRIEFEGP EIAVYVKNPS VLSDKMDLIK KIVKEIKKRI 
VIKADNSVRK DKKETIEIIK NFVPAEAQIS DIKFDDELGE VLIKAKKPGL VIGKRGLIQQ
KIFMDTYWRP VIIREPPIKS RTVEGVLTHI YNETEYRAKM LKTFGERIHR EILFKDRYVR
VTALGAFQEV GRSAVLVETP ESRVLMDVGV NPSVNFGERM FPKLDIDQLR LEDLDAVVLT
HAHLDHSGMI PFLFKYGYDG PVYTTQPTRD IMALMQLDLL DVADKEGRPL PYSAKEVRKE
LLHTITLDYE EVTDIAPDIR LTFYNAGHII GSAMAHLHIG DGVHNLVYTG DFKYARTRLL
DRAVSEFPRV DTLIMETTYG AQLQTNRDES EKQLIDVINK TLNRGGKVLI PVLAVGRGQE
IMLVINDAMK RKLIPEVPVY VTGLFDEVTA IHTAYPEWLG KEVRDSILFR DENPFTSEFF
KRIEGYREDI AEGEPSIILA TSGMLNGGPA VEFFKQLAPD PKNSLIFVSY QAEGTLGRKV
RDGAREIQII GRDGRVDNIK VNLETTPIDG FSGHSDRRQL LKFLEDLTPK PRNIILNHGE
ASAIREFKKN IENFRERERL GLRSANIYSP AILDSIRLDK TS