Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1378 |
Symbol | |
ID | 5104588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1353470 |
End bp | 1355443 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507267 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_001191460 |
Protein GI | 146304144 |
COG category | [R] General function prediction only |
COG ID | [COG1204] Superfamily II helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000404908 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACTTTC CACTGGCCGA GAGGTTCTTC AGGGAGTACG AGAGCAACAG GGATATCAAT TACCTGATTT CCGCGCCCAC CGGTAGTGGG AAGACTCACA TTGCCAAGAG GGTTCTAGTC GAAGACGAGG GAATATCTGT TTACGTCTCT CCCTTGAAGG CCTTGTCCAG GGAGGTATAT CTCTCAGTTA GGGACAGAAC CAACGCCGTG ATGGCTGACT CGGACGTTTA TGAGGACGAC CTAAGAAAGA TGAAGGGAGA CGTCCTCCTC GCTACCTATG AAAAGTTTGA CAGTGCAATA CGCCATAACT ACACCTGGCT CAACGACGTC TCGAGAATCG TCATAGATGA GGTTCATAAC GTCGAGACCG ACAGGGGGCT GGCACTTGAG AACCTTGTCC TCTGGGCCAA GTCCAGGAGA GTCCCCGTGA TTGCCCTGAG CGCGACCCTT AGCGACCCTG AGAGATACGT GACTTGGTTA AATGCAAAGC TAATTTCGCA CGAGAAGAGG GTAGTTCCCC TCCACGAGTG CGTGGCCTAT CCCTACGTTC TCAAGTGCGG TAATTGGCAG GAGAACCTGA AACCGTCCAG GCTCACTCGT CCCAGATTTG AGTTGTTGGT GTTAGTTCTC CAGAGGATAG TCTCCATGGG AAAGAATGCT CTGGTCTTCG TGAAGAGCAG GAGGTCTGCA GAAACCCTGG CACAGGAGCT TCAGAGGAGG GACTTCAGGG CTCATCACTA CCATAGCGGA ATGCCTCACG AGGACAGAAA GAAGGTTCTC GACATGTTAT TGGAGGGAAA CCTCAACGTC GTGGTATCCA CCACCGCTCT GGGTCAGGGG GTCAATCTCC CAGTTTATGC CGTCGTCTTT TACGAGCTAA AGTTGCCAAA CGTTGATGAG AGGGGAGAAT TCAAGGGCTG GAAAGACATT TCCCCATCCG AGTTCAGGCA GATGGCTGGA AGGGCAGGGA GGCCCAGGTA TGACAAGGAG GGAATGGTTA TCATCATAGC CAACTCCGAT AAGTTTGCGA CCCAACTAGA GGAGAGGTAC TACCGTGGGA GTACTAGGGG TGAGGGAGTG AAACCGGATC TGGACACTCT TTCCTTGGCC TTTGTGTCCT GGAATGACGG CGTTGAAATG GATGAACTTG GAAGATCAAT AAACTCTACC TTCAACTTCA GGGGGGTAGG GTACACTCTC GTCGAGTCCT CGATCTCGAG ACTAAGGGAT ATGAAGCTGG TCATGACGGA CCGAGGAGTT ACAGTAACGC CTTTGGGAAG GGCAGTAGCA GTGAGCTACA TTGATGTAAA GGCCCTGTCA GGCTTTCCCA TAGATAACAA GGACGCCGAC CTAGTGTCCG TGATAGCAGG TTCCCCTGCG GTTGCACAGG CCTTGAGGGG ATGCAAAGAG GGAAGGGAGC TTCTCAACAG ATGGATGAAT GGAAATAGTC TAGATGGAAT TTGCGAGAAA CTGACGTCGA AGGATCTCAT GGAGGTGATC TCCAATGCCA AATGGATTTC CTTTGCCCTG TTTCGTGTTT TGAGGGCTCT AGGGGATGAC AGGTATCGCA AGGCTCTGGA AATTCACGAC AGCCTCAAGT ACGGAGTGCC CTCGGAGGGC GTGAAACTGG CTAGATCAGG TTTACCTAGA GATGCCGTGA TGAGCTTGAT CTCCCTGGAC GTCAAGGATC TGAGGGAACT GTGCATGAAG GTAGGTTACA GGGAGCTCAG GGACGAACTG AGAAGGTCCA ATGTTCAGAT CGAGGTGCTA TGCAGGAGCG TCTACTCCCA AGACCCCCTA ACCTTTGATG TTAGGAGGGC AATACAGGAA TTTCGGCAAA GGGAATTTAG TCTCAGGGAG GTATCCTCCA AGTTTGGAGA GGATGTTCTG AGAGAGATGG TGAGGTTGCG AGTTCTAAGG AAGAGAGGGG ATAAGTACAT AATACGAGAT CTGGAACGTG ACACCACGGG ATAA
|
Protein sequence | MDFPLAERFF REYESNRDIN YLISAPTGSG KTHIAKRVLV EDEGISVYVS PLKALSREVY LSVRDRTNAV MADSDVYEDD LRKMKGDVLL ATYEKFDSAI RHNYTWLNDV SRIVIDEVHN VETDRGLALE NLVLWAKSRR VPVIALSATL SDPERYVTWL NAKLISHEKR VVPLHECVAY PYVLKCGNWQ ENLKPSRLTR PRFELLVLVL QRIVSMGKNA LVFVKSRRSA ETLAQELQRR DFRAHHYHSG MPHEDRKKVL DMLLEGNLNV VVSTTALGQG VNLPVYAVVF YELKLPNVDE RGEFKGWKDI SPSEFRQMAG RAGRPRYDKE GMVIIIANSD KFATQLEERY YRGSTRGEGV KPDLDTLSLA FVSWNDGVEM DELGRSINST FNFRGVGYTL VESSISRLRD MKLVMTDRGV TVTPLGRAVA VSYIDVKALS GFPIDNKDAD LVSVIAGSPA VAQALRGCKE GRELLNRWMN GNSLDGICEK LTSKDLMEVI SNAKWISFAL FRVLRALGDD RYRKALEIHD SLKYGVPSEG VKLARSGLPR DAVMSLISLD VKDLRELCMK VGYRELRDEL RRSNVQIEVL CRSVYSQDPL TFDVRRAIQE FRQREFSLRE VSSKFGEDVL REMVRLRVLR KRGDKYIIRD LERDTTG
|
| |