Gene Msed_1494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1494 
Symbol 
ID5104741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1459736 
End bp1460782 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content46% 
IMG OID640507382 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_001191575 
Protein GI146304259 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATC AGTTAAGCGA TCAACTCAAA AGGGCACTGG CTGACATAAA TTACCAGCGT 
CCTACTAAGG TACAAGAGAT AGCCATCCCC GTCTTCATGA ATGGGGAGAG CGTAATAGTA
CAGGCAAAAA CAGGATCAGG AAAGACTGCG TCTTACCTGA TACCAGTCCT GGAGAGATAC
CACAACGCAC TCATTCTAGT TCCCACGAGG GAGCTGGCTG AGCAAGTTGC ATATGAGGCT
AAAAGACTTG GAAAATACAA GAGGACCTCT ATTGGGGTAA TCATAGGTGG AGTAGGTTAT
GACAGACAGG AGCGAGAGTC CGATAGTGAC ATCATAGTCG GAACTCCTGG AAGAATCCTT
GATTTGTGGG GCAGGGGTAC TCTTGACCTC TCCAGGTTTA AGCTGGCCAT CGTGGACGAG
GTTGATAGAA TGCTCGATAT GGGGTTCATT GACGACGTGA GAATGATACT CTCCAAGACC
AGTGCTGAGA ATTTCGGCTT CTTCTCTGCA ACAGTACCTC CAGAGGTCAA GGAGTTAGCT
GAGGAGTTCT CACCCAATGC TAGATTTCTC AAGGTTGATG AATATAAACC GGTTGAAATA
GATCACGAAT TTTATCCGGT CCGCGATAAC TGGCACGAGA AAGTAACCAA ACTGCTCAAG
GATGTAAATG GAAAGGCCAT AGTGTTCACC AACACAAAGG CTAGAGCAGA GGCATTGTAC
GACAATATCT CAGATAGGGT AAGCACATCA CTACTTCATG GCGATATGTC TCAGGGCTCA
AGGAGGAGAA ACCTGATGAG CTTTAAGAGA GGCGACAGTG ACATCCTAAT ATCCACAGAC
CTAGCAGCTA GGGGGATAGA TGTGATAGAT GTGGAGCAGG TAGTTAACTT TGACTTACCT
AGGGATGTTG AAACATATAT CCATAGGGTA GGAAGAACTG GGAGGATGGG AAGGAAAGGG
AGGGCAGTAT CATACTACAC GAGGAGAGAA GAGGAAATGG TCCAGAGGAT TAGAAGCATT
ATCAAATCAA CTATCATCTC TCAATGA
 
Protein sequence
MFDQLSDQLK RALADINYQR PTKVQEIAIP VFMNGESVIV QAKTGSGKTA SYLIPVLERY 
HNALILVPTR ELAEQVAYEA KRLGKYKRTS IGVIIGGVGY DRQERESDSD IIVGTPGRIL
DLWGRGTLDL SRFKLAIVDE VDRMLDMGFI DDVRMILSKT SAENFGFFSA TVPPEVKELA
EEFSPNARFL KVDEYKPVEI DHEFYPVRDN WHEKVTKLLK DVNGKAIVFT NTKARAEALY
DNISDRVSTS LLHGDMSQGS RRRNLMSFKR GDSDILISTD LAARGIDVID VEQVVNFDLP
RDVETYIHRV GRTGRMGRKG RAVSYYTRRE EEMVQRIRSI IKSTIISQ