Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1354 |
Symbol | |
ID | 5103413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1325755 |
End bp | 1327737 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507243 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001191436 |
Protein GI | 146304120 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0575606 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTATTT CCGAGACCTT CGTCGAGAAA CTTAGAGATT CCCTTGGTTT CGCGCCCTAT CCCTACCAGG AAAAGGTCGT GCAGGACGTT CTGGACAACT TGAGGAAGAG CCGTTTCATT GTGGTTTCCA TGCCCACGGG ATCGGGTAAA ACCCTAGTGG AGCTTGCCCT AGCTGACTAC CTAAGAGGGA AGGGGATGAA AGTCCTGGTC CTGGAACCCA CGAGACTTCT CTGCGATCAA ATGTATCATA ACTTTTGGGT CAAAGTCTTT GATGATGTGG GGGAAGAGTA TGAGGGAAAT TGCACCAGTT TCGAGGAAGG AAAGGGCGTG ATCGTGTCAA CTCCATTCAC CTCCTCCAAG TGTATGCCAA AGGTTGATGC GGTCATCGTG GACGAGGTAC ACCACGCCTT TGGAGATCCG AGGTACATGT CCGGCCTAAT TTCCATGAAG CCGAGGATAG TGATAGGCTT CACTGCCCTC CTACCCAGTT CCAAGAGGTA TAGCATGGAC TCGAGGTTCG TGGAGGCCTT TGGGGTGCCC TCTCTTCTGG CCTATGACTT CAAGAAGCTC TCCGAAATAG ATCCTTCCTT TACCTTACCC AAGGCGATCG CGGATGTTTT TGATGCGGAG ATGGACGGTG TTGAGAACGT GGCGTACGAT GCCCTCCTCA AGGGGAAGAT TCCAGGGGAC GAGAGGACGT TAAGCTTTCT AAGGTTTACC CTGTATAGTC ACGGAAAGAC GGCCTTCTGC GAAAGTCTGG AAAACGTGAG AGATAAGGTT TCAGATAACG TTACGTTGAG GACTCTCTGC AGTTCAGAGG GTATGGGACA TAAGGCTCGA AGTCTTAAGG AGATACTCGA GGCCTACGAC GTCGAGGAAC ATCGGCCGGT CTTGATTTTC ACTGCGAGGC GTTCCACGGC CCACGAGTTT GAGGTAATAC TCCATAACAT GGGGGTGAAC CGAGTTAAGA CTTTAACCGG GGAGTTAAAC AAGGAGGAAC GCCTTCAGAT AGTTAATGAG GCTAAGAGCG GTAATGTGGA CGTGATCATC TCTACGCACG TGGGTGAGGA GGGGATTGAT ATCCCCGAGG CTAGGCTCCT CATAATGACA GATGTACCTA AGAGTCCACT CAGGTTCTAT CAGAGACTCG GCAGACTAAT AAGGAAGAGC GAATCCAAGG GCGTAAAGTA TCTGGTCGTG ACCTTAACTC CCAAGACTCC CGAGTACGAT GATCTGGATG AGGCTTTGCG ATCGCTTCAC AGGGAGGGTG TTGACGTGAG TTACATTGTG GAGAGGAAGT CGGGGAAGGG ATCCACATCT AGAATACTGG ATCAGGTTAA GGAGAAGGGA GGGGAGGTTC CCCTCATGAA GCTACTGGAG ATGGAATATG ACCTGAAGGA CTACATAATG GTAAGAGGAA AGTCCAACGT CACTCAGTTC CTTAACGCCA AGGAACAGGA TATTCCATAT TCTGACTTCG TCGATAGGGC AATCATGGAT GGAGATCTCA TGTACTACTA TGACGTAGAG GGAATGGGAG ACCTGTTTGC CAAGATATTG CTCTCCAAGT ATTGCCAACT ATGTTATGGT TCCCAGTGCC AGGGTTTATG TGATCTGGAC ATCATGGTAC TTGGGAGAAG CAAACAGTAT AAGCTCACTA GAAAGGACCT TTTGAGGTAC TTCATGGTGT TGTTCCCACC TGACAAGCTG ACGGATGTGG AGAAAAGAAT GGAAATCACT TTCGAGAACC CAGGTTTCGG TATTTCCCTT CAGAGCAACG TAAATGAGAA GAACAGTTCC ATCAGCTTCA ACGTTCAGTT AAACGCGTCA ATAAACGGTA TCACTGTCTA TCCTAAGATC ACCCTGGCGT ATTATGGAGT AAAGAAAGAG GTTAAGGACT TCCTAAAAAA GAATGTCTTA GCTATATGCA ATACTGCAGG CAAAATCTAT TTCTCTTATT TCACTTCCCG ACCTGTTCCT TGA
|
Protein sequence | MSISETFVEK LRDSLGFAPY PYQEKVVQDV LDNLRKSRFI VVSMPTGSGK TLVELALADY LRGKGMKVLV LEPTRLLCDQ MYHNFWVKVF DDVGEEYEGN CTSFEEGKGV IVSTPFTSSK CMPKVDAVIV DEVHHAFGDP RYMSGLISMK PRIVIGFTAL LPSSKRYSMD SRFVEAFGVP SLLAYDFKKL SEIDPSFTLP KAIADVFDAE MDGVENVAYD ALLKGKIPGD ERTLSFLRFT LYSHGKTAFC ESLENVRDKV SDNVTLRTLC SSEGMGHKAR SLKEILEAYD VEEHRPVLIF TARRSTAHEF EVILHNMGVN RVKTLTGELN KEERLQIVNE AKSGNVDVII STHVGEEGID IPEARLLIMT DVPKSPLRFY QRLGRLIRKS ESKGVKYLVV TLTPKTPEYD DLDEALRSLH REGVDVSYIV ERKSGKGSTS RILDQVKEKG GEVPLMKLLE MEYDLKDYIM VRGKSNVTQF LNAKEQDIPY SDFVDRAIMD GDLMYYYDVE GMGDLFAKIL LSKYCQLCYG SQCQGLCDLD IMVLGRSKQY KLTRKDLLRY FMVLFPPDKL TDVEKRMEIT FENPGFGISL QSNVNEKNSS ISFNVQLNAS INGITVYPKI TLAYYGVKKE VKDFLKKNVL AICNTAGKIY FSYFTSRPVP
|
| |