Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0257 |
Symbol | |
ID | 5103877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 217547 |
End bp | 219727 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506163 |
Product | protein of unknown function DUF699, ATPase putative |
Protein accession | YP_001190358 |
Protein GI | 146303042 |
COG category | [R] General function prediction only |
COG ID | [COG1444] Predicted P-loop ATPase fused to an acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.585668 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCAA ATTATGACAT TCAGGAGATA TTTAGGGACG CAGTGTCAGG CAACTATAGG AACCTTGTGG TGATAGAGGG GGACTATCGA GAACCCCTCC TTGAGTTCAT CCATGAGTAT CTGAAGGTGG AGAGGAATCC GTCCACTGGA TATTACTTCC ATCCATGGGC GCAGGGAAGT AAGGAGAGGC TAAACTGGAT AAGGTCAGTT CTAACAACCG TGACTGACAT TGATTATTCC TCCTCCGAGA GGTATCTGGG AAGCACGTTC GACCTTTCCA TAATAGATGC GGTAGATGAT TTTAGGCCGT CATATGCGGC TAGGGCCGTA GAGACCGTTA GGGGAGGAGG GCTCATAATC CTTTACACAA ACAGATTGGA GGAGGGGAAA CTGTACAGGG ACACGTTAAC GAGAGACGGT AAGGTTAACA ATTTATTCGA GGAAAGGTTC AGGCATAAAC TACTTTCCCA CAGGGGTATA CTATACTGGA ATGATGGGGA ACTAATCGTA AGGCCCTACT CGAGCTCGGA GGTAAGCAAG CCTAAGAGAT CTAGGAAGGG AAAATACCCT GAGCTTGCCA AACTCTGTAA AACGGATGAC CAGGTTAAGG TACTTGACGA GATAGATTTC CTCCTCGAGG AAGGGAAGAA GCTCTTGGTT GTAACGGCTC CACGCGGGAG GGGTAAGAGC GCCTCAGTGG GGCTCGCCCT GCCTCTGCTT ATTTCCCAGA GCAAGTATCC CCTCTCCATT GTAGTTACTT CCCCGACCTA CTGGTCAGGT GCTGAGATAA TGAGATTTTC AGAGATGTCC CTAAAGGCCC TTCACAAGAG GTTCAGGAAA GTTATCTCTA GGGATGGCAA AATCCTCTCT CTGGAGATTG GAGAGAGTAG AATAAGATGG CTTCCCCCTG AGCTAGCTAG GGATCAGCAT GGGGATATAC TCGTTGTGGA CGAGGCAGCT GCTCTCGGTA AGGAGTTCGC TGATTACGTC CTGAGGCGAT GGAATAAGGT TGCCCTGGTC ACCACAGTTC ACGGATATGA GGGATCAGGA AAAATATTCC TGAAAATATT CGATAGCTAT GACGGGGAGC ACGATGTTCA GAGGCTGAAA CTGGATTTCC CCGTGAGATA TGGGAAGGGA GATCCGGTGG AGAGATTTCT AAATGATGCT TTCCTGTTGA ACGCAGATGC CACCGAGGGA GAAGATCTGA ACAAGGTAGC AGAGGTCGAC GTGAGGTCCC TCTTCGAGGA TGAGGAGAGG TTGGCTTCAG TCTACGGAAT CTTGGTAACT GCCCATTACA GGAACACTCC TGACGATCTC ATGATGTTGG GGGACATGGC GTTTCAGAGA TTATTTGTGG CTGAAAACAA TGCTGGTGTT GCTCAGGTTG TAGAGGAGGG AGGACTTTCT CAGGAGGCAA TAAAATCCAT AGCCCAGGGA GAAGAAAACC TGGGGCATCT GATTCCCCAT AGGCTCGTAA AGTACTGGAG GCTGTTTGAA TTTGGTGAGT TACGTGGCTG GAGAGTGATG CGTATAGCAG TCGCTCCCGC ACTTCAGGGG AGAGGGATAG GATCTAAGCT CCTCAGGGAA ATTGAAATTA AAGGTGAGGG GGAGGGTATT GATTGGATTG GATCATCCTT TCTCGCTAGC TATAACGTTA TCAAGTTTTG GGCAAAGAAC GGTTACATAC CAGTTCATGT CTCCACGAAG AAGAACGAAA GTCTAGGGGG ATACTCGGTT ATTGTTGTGA AGCCGTTCTC GTCCGCGGCT AAGGAAATGA GCAGTCGCGT TTCCCTACTG CTTAAGGACA AATTGCTCAG GACATCCCAT CAGGTATACT TCAATTTAGA TCCAAGGATA CTAGCCATTC TACTCAAACT CACACCTCCC GCTTCCAACG TTACAATCTC AGATCTATAT GTGAAGAAAC TCAGAGCCTA CCTGGAGGGA CTTCTACCCT ATAACTCTGT CGCGGAGGCT GTTCACCTTC TAGGGGAGAA GTACTTCAAG GAATTACGAT TTGACGTGGA CGACGTTTCC CTGGCCACGC TAATCTCAAG GACATTCCAG GGCAAGAGCT GGTATCATGC GGGGGTATCT CTGGGCCTAA CGAGCTCTCA GGTCGAGCAG AGACTTAAGG ACACGATCGC ATTGATCTTA GAAAAATATC AGTTATCATA G
|
Protein sequence | MKSNYDIQEI FRDAVSGNYR NLVVIEGDYR EPLLEFIHEY LKVERNPSTG YYFHPWAQGS KERLNWIRSV LTTVTDIDYS SSERYLGSTF DLSIIDAVDD FRPSYAARAV ETVRGGGLII LYTNRLEEGK LYRDTLTRDG KVNNLFEERF RHKLLSHRGI LYWNDGELIV RPYSSSEVSK PKRSRKGKYP ELAKLCKTDD QVKVLDEIDF LLEEGKKLLV VTAPRGRGKS ASVGLALPLL ISQSKYPLSI VVTSPTYWSG AEIMRFSEMS LKALHKRFRK VISRDGKILS LEIGESRIRW LPPELARDQH GDILVVDEAA ALGKEFADYV LRRWNKVALV TTVHGYEGSG KIFLKIFDSY DGEHDVQRLK LDFPVRYGKG DPVERFLNDA FLLNADATEG EDLNKVAEVD VRSLFEDEER LASVYGILVT AHYRNTPDDL MMLGDMAFQR LFVAENNAGV AQVVEEGGLS QEAIKSIAQG EENLGHLIPH RLVKYWRLFE FGELRGWRVM RIAVAPALQG RGIGSKLLRE IEIKGEGEGI DWIGSSFLAS YNVIKFWAKN GYIPVHVSTK KNESLGGYSV IVVKPFSSAA KEMSSRVSLL LKDKLLRTSH QVYFNLDPRI LAILLKLTPP ASNVTISDLY VKKLRAYLEG LLPYNSVAEA VHLLGEKYFK ELRFDVDDVS LATLISRTFQ GKSWYHAGVS LGLTSSQVEQ RLKDTIALIL EKYQLS
|
| |