Gene Msed_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0206 
Symbol 
ID5104072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp169092 
End bp170189 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content46% 
IMG OID640506111 
ProducttRNA pseudouridine synthase D, TruD 
Protein accessionYP_001190307 
Protein GI146302991 
COG category[S] Function unknown 
COG ID[COG0585] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00094] tRNA pseudouridine synthase, TruD family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0784918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0319695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAATGA GTTTACTGGA CTTAGCAATA GGAATAGAGT TTAAGGTGCA TAGATGGGCT 
TCAATTCGTG CAGAAATTCC AAGGCCTGAC GGCTTTCGGG TAACGGAGGA GATTGACGGA
AAACCGTGTA CAGCCTGGAG AGGATCAGAG AGTGGCAAAT ATGCCGTTTA TCTCCTGAGA
AAAAGAGGAA TGGAACATAA TGCGGTTATG TCCAGGCTGG CCTCTATTCT CGGTGAAAAA
CCAAGGTACC TAGGAATAAA GGATACTAAT GCAGTTACGG AGCAACTAAT ATACGTAACG
AGGAAGTCAA AGGATTTCCA CAGGGAGGAG TCCTTTTCCA TAGAGTTCAT GGGATTCACT
TCGACAAAAC TGAACCACAC CGGTAACATT TTCTCCATAA AGCTTGAGAC AGGGGATAAG
GAGGAGCTCA AAAGAAGGGT CAACACCATA AAGGGTGAAG GCGTTTTACC AGCCTTCATA
GGGTATCAAA GGTTTGGAAC CAGAAGACCC ATAACGCATC TGGTAGGGAA GGCCTTAACT
CAAAGGGACT GGTGCAAGGC GGTGGACTTC ATTCTAGGTT ATCCCTTCGT GTGGGAGAAC
GAGAACATCA GGCTATTTAG AGAGGAATAC ATGAAAGGGG AGGTAAAAGA GGAACTTCTC
AGGAAGATAC CGAGCCAGGA GAGAAACATT TACCTTGAGT TGAGGAAGAC CGAAGATTGC
CTCTCTGCTC TCAGGAAATC GCGGGTTAAA CTTAGCTTTT ATGTGGAGGC TTATCAAAGT
TACCTTTTCA ACAGGGTGCT ATCCAGGAAA CTAAGATATT CCACAGTGCA CGAGAGGGAT
GAGATAACCA TTCCCACGGA TCCCAAACAA TGCGACGCAG AGTGCCTGGA AGTCTTCGAG
GTTGAAGGGA TACAGAGGGG CAGTTTCCAC ATTGAGGAAC TGGGAATATC CCTTAGACCT
GTGAAAAGAA ACGCTTTCAT GAATGTCAGA GGCCTGCATT TTGACGGCGA GTTCGTAACG
TTTTCCTTGG AAAGAGGGAT GTATGCAACT GTGGTTCTAT CTGAGATCCT AAACGCCGAT
CCAAAAGAGT TCACTTGA
 
Protein sequence
MRMSLLDLAI GIEFKVHRWA SIRAEIPRPD GFRVTEEIDG KPCTAWRGSE SGKYAVYLLR 
KRGMEHNAVM SRLASILGEK PRYLGIKDTN AVTEQLIYVT RKSKDFHREE SFSIEFMGFT
STKLNHTGNI FSIKLETGDK EELKRRVNTI KGEGVLPAFI GYQRFGTRRP ITHLVGKALT
QRDWCKAVDF ILGYPFVWEN ENIRLFREEY MKGEVKEELL RKIPSQERNI YLELRKTEDC
LSALRKSRVK LSFYVEAYQS YLFNRVLSRK LRYSTVHERD EITIPTDPKQ CDAECLEVFE
VEGIQRGSFH IEELGISLRP VKRNAFMNVR GLHFDGEFVT FSLERGMYAT VVLSEILNAD
PKEFT