Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2073 |
Symbol | |
ID | 5105053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1990376 |
End bp | 1991869 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640507963 |
Product | ATPase-like protein |
Protein accession | YP_001192137 |
Protein GI | 146304821 |
COG category | [R] General function prediction only |
COG ID | [COG1106] Predicted ATPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.672394 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.790362 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATACTGT TGAGGCTAGC TGAGTTCTAC GCTAATAACT TCAGGAGTTT GGAGTCAGTT GAATTAAGGG ATGTTGGAGG CTTCAATGTG ATTGTGGGGT TTAACGGTTA CGGCAAAACT AACCTGCTGA CGGCAATCTA CCTATACATA AAGAACCTGT CAGCGGGTAT TGAGAAGAGG TCAATAGAAG ATAAGAACCA GGAATATCTG TTAATGTGGA ACAGTTATGA TACGAGCAAA CCCATTCTTT TAGGAGGCAG GTTACAGTTT AGTGAGAAGG AAGTTGAGAA AGTCGTGGGA AAGAGTATGC CCATGAACAT CGACCTAGTG AACAAGTTGA GGTACATTAA TGGATACCTT GAGTGGGACC TGGACCTCAT CAGAATAAAT GGATCTGTCC CATCCAAGGA CGAAATAGAT CAGGCCAAAA AGCTGTTGGA CTACGCCTCA AGTCAAGTTG AGTATGTTCC CATTTTCGAT CAGAACTACT TTGATGACGT ACTCAACAGA ATTATTGGAC TGAACAGATC GCCCATCAGC TTGAGAAAGT ATTGGTATGA CTTTGCGAAC CTGGTGAGTA ATACTATTCC AGAGGTAAAG GGAATAGAGA TCTGGGATTC AAAGAAGCTA GTTCTAAACG TGTATAATTT GCCAATCTAC ATTGACTTAG CTGCTAGTGG ATTTCAGAGA ATAATTCTTA TCCTTTTCGT AATATGGTTG AGCGGAAACA AGATACTCCT GATCGAGGAA CCAGAGGTGA ACATGCATCC CATCATGCAG TATAAGATGG CGAAGCTCCT GAAGTCATGG ACTGAAAGCG ATGTTCTACA GGTTTTCATG ACAACACACT CTCCCTTTAT TGTATCCTCC GATGTGGATA GCTTCATAGT TCTGAGAAGG GGTCAGACCG CGTCTAAGGC TGTCAACTTC CAGCCCACAG AGGATGTAAA ATCGGCCTTC TCCATTCTTA ACGTAAATAT CAGTGATCTT CTCTTTAACA AGACTATTAT AGTAACGAGT GAAATGGCCG AGCCAAACGT AATCCTGAAT TGGCTCAGGA AACTGAACGT GAATCCAGAG TACAACGGAA TTGTTATATA CACGGTGAGG AACGAACTGG AGTTGCAGAC CTGGCTTAAG TTAAGGAACA TGCTGAAACT TGACATGCTA TTCCTGGGCC TTTGTGACAA AATAGACATA GAGTTAAAGG ACTCCTGTCT TCCCCTTACC AAGGAGGTAG AGTCATTTTA CAGTAAGAGT GGAATGTTAG AGGCACTCAA GAGAATAGGC ATTTACCCAG ATGAAAAGGA AATGAGGGAT CTCTCTCGGG AAGACAACGC CAGATGGTTG ATAAACGTTC TTAAGAGGAG AGGATTGGAT TACGGTACCA TGAGATCGTC TATAGGTGAC ATAATATCTA GAATAGATTC CATTGAGATC CCCAAGGAGA TGGAAATCCT CGTGAATAAA ATTAAAACCG CGCAGGTTAT CTAG
|
Protein sequence | MILLRLAEFY ANNFRSLESV ELRDVGGFNV IVGFNGYGKT NLLTAIYLYI KNLSAGIEKR SIEDKNQEYL LMWNSYDTSK PILLGGRLQF SEKEVEKVVG KSMPMNIDLV NKLRYINGYL EWDLDLIRIN GSVPSKDEID QAKKLLDYAS SQVEYVPIFD QNYFDDVLNR IIGLNRSPIS LRKYWYDFAN LVSNTIPEVK GIEIWDSKKL VLNVYNLPIY IDLAASGFQR IILILFVIWL SGNKILLIEE PEVNMHPIMQ YKMAKLLKSW TESDVLQVFM TTHSPFIVSS DVDSFIVLRR GQTASKAVNF QPTEDVKSAF SILNVNISDL LFNKTIIVTS EMAEPNVILN WLRKLNVNPE YNGIVIYTVR NELELQTWLK LRNMLKLDML FLGLCDKIDI ELKDSCLPLT KEVESFYSKS GMLEALKRIG IYPDEKEMRD LSREDNARWL INVLKRRGLD YGTMRSSIGD IISRIDSIEI PKEMEILVNK IKTAQVI
|
| |