Gene Msed_1354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1354 
Symbol 
ID5103413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1325755 
End bp1327737 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content48% 
IMG OID640507243 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_001191436 
Protein GI146304120 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0575606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTATTT CCGAGACCTT CGTCGAGAAA CTTAGAGATT CCCTTGGTTT CGCGCCCTAT 
CCCTACCAGG AAAAGGTCGT GCAGGACGTT CTGGACAACT TGAGGAAGAG CCGTTTCATT
GTGGTTTCCA TGCCCACGGG ATCGGGTAAA ACCCTAGTGG AGCTTGCCCT AGCTGACTAC
CTAAGAGGGA AGGGGATGAA AGTCCTGGTC CTGGAACCCA CGAGACTTCT CTGCGATCAA
ATGTATCATA ACTTTTGGGT CAAAGTCTTT GATGATGTGG GGGAAGAGTA TGAGGGAAAT
TGCACCAGTT TCGAGGAAGG AAAGGGCGTG ATCGTGTCAA CTCCATTCAC CTCCTCCAAG
TGTATGCCAA AGGTTGATGC GGTCATCGTG GACGAGGTAC ACCACGCCTT TGGAGATCCG
AGGTACATGT CCGGCCTAAT TTCCATGAAG CCGAGGATAG TGATAGGCTT CACTGCCCTC
CTACCCAGTT CCAAGAGGTA TAGCATGGAC TCGAGGTTCG TGGAGGCCTT TGGGGTGCCC
TCTCTTCTGG CCTATGACTT CAAGAAGCTC TCCGAAATAG ATCCTTCCTT TACCTTACCC
AAGGCGATCG CGGATGTTTT TGATGCGGAG ATGGACGGTG TTGAGAACGT GGCGTACGAT
GCCCTCCTCA AGGGGAAGAT TCCAGGGGAC GAGAGGACGT TAAGCTTTCT AAGGTTTACC
CTGTATAGTC ACGGAAAGAC GGCCTTCTGC GAAAGTCTGG AAAACGTGAG AGATAAGGTT
TCAGATAACG TTACGTTGAG GACTCTCTGC AGTTCAGAGG GTATGGGACA TAAGGCTCGA
AGTCTTAAGG AGATACTCGA GGCCTACGAC GTCGAGGAAC ATCGGCCGGT CTTGATTTTC
ACTGCGAGGC GTTCCACGGC CCACGAGTTT GAGGTAATAC TCCATAACAT GGGGGTGAAC
CGAGTTAAGA CTTTAACCGG GGAGTTAAAC AAGGAGGAAC GCCTTCAGAT AGTTAATGAG
GCTAAGAGCG GTAATGTGGA CGTGATCATC TCTACGCACG TGGGTGAGGA GGGGATTGAT
ATCCCCGAGG CTAGGCTCCT CATAATGACA GATGTACCTA AGAGTCCACT CAGGTTCTAT
CAGAGACTCG GCAGACTAAT AAGGAAGAGC GAATCCAAGG GCGTAAAGTA TCTGGTCGTG
ACCTTAACTC CCAAGACTCC CGAGTACGAT GATCTGGATG AGGCTTTGCG ATCGCTTCAC
AGGGAGGGTG TTGACGTGAG TTACATTGTG GAGAGGAAGT CGGGGAAGGG ATCCACATCT
AGAATACTGG ATCAGGTTAA GGAGAAGGGA GGGGAGGTTC CCCTCATGAA GCTACTGGAG
ATGGAATATG ACCTGAAGGA CTACATAATG GTAAGAGGAA AGTCCAACGT CACTCAGTTC
CTTAACGCCA AGGAACAGGA TATTCCATAT TCTGACTTCG TCGATAGGGC AATCATGGAT
GGAGATCTCA TGTACTACTA TGACGTAGAG GGAATGGGAG ACCTGTTTGC CAAGATATTG
CTCTCCAAGT ATTGCCAACT ATGTTATGGT TCCCAGTGCC AGGGTTTATG TGATCTGGAC
ATCATGGTAC TTGGGAGAAG CAAACAGTAT AAGCTCACTA GAAAGGACCT TTTGAGGTAC
TTCATGGTGT TGTTCCCACC TGACAAGCTG ACGGATGTGG AGAAAAGAAT GGAAATCACT
TTCGAGAACC CAGGTTTCGG TATTTCCCTT CAGAGCAACG TAAATGAGAA GAACAGTTCC
ATCAGCTTCA ACGTTCAGTT AAACGCGTCA ATAAACGGTA TCACTGTCTA TCCTAAGATC
ACCCTGGCGT ATTATGGAGT AAAGAAAGAG GTTAAGGACT TCCTAAAAAA GAATGTCTTA
GCTATATGCA ATACTGCAGG CAAAATCTAT TTCTCTTATT TCACTTCCCG ACCTGTTCCT
TGA
 
Protein sequence
MSISETFVEK LRDSLGFAPY PYQEKVVQDV LDNLRKSRFI VVSMPTGSGK TLVELALADY 
LRGKGMKVLV LEPTRLLCDQ MYHNFWVKVF DDVGEEYEGN CTSFEEGKGV IVSTPFTSSK
CMPKVDAVIV DEVHHAFGDP RYMSGLISMK PRIVIGFTAL LPSSKRYSMD SRFVEAFGVP
SLLAYDFKKL SEIDPSFTLP KAIADVFDAE MDGVENVAYD ALLKGKIPGD ERTLSFLRFT
LYSHGKTAFC ESLENVRDKV SDNVTLRTLC SSEGMGHKAR SLKEILEAYD VEEHRPVLIF
TARRSTAHEF EVILHNMGVN RVKTLTGELN KEERLQIVNE AKSGNVDVII STHVGEEGID
IPEARLLIMT DVPKSPLRFY QRLGRLIRKS ESKGVKYLVV TLTPKTPEYD DLDEALRSLH
REGVDVSYIV ERKSGKGSTS RILDQVKEKG GEVPLMKLLE MEYDLKDYIM VRGKSNVTQF
LNAKEQDIPY SDFVDRAIMD GDLMYYYDVE GMGDLFAKIL LSKYCQLCYG SQCQGLCDLD
IMVLGRSKQY KLTRKDLLRY FMVLFPPDKL TDVEKRMEIT FENPGFGISL QSNVNEKNSS
ISFNVQLNAS INGITVYPKI TLAYYGVKKE VKDFLKKNVL AICNTAGKIY FSYFTSRPVP