Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1102 |
Symbol | |
ID | 5103576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1029592 |
End bp | 1030827 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506997 |
Product | hypothetical protein |
Protein accession | YP_001191190 |
Protein GI | 146303874 |
COG category | [R] General function prediction only |
COG ID | [COG1106] Predicted ATPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.89522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATACA GGGTAATTCC CTCGAGGCCA GTGACAGTGA TAGAGAGCTT ACCGAACAAA AGTGTATCCT CTGACGATCC GAGCAGTTTC CTTGTGGTTG ATCAGGAGGA TAATCCCCTG GACTCTTATT GGGACGGAAA GAGACTCTAC GTCGTTCCTA AGGAGAACAC CACTGCTATT TACCTCATAG AGGCCTCACC TGGTTCCACG GAGTTCAAGG GATCCAACCC GCCCATGACG GAGAGACCCT CATCACCCGG TATTCCACGG GAAGTTGGGG TTGATAGGTT GAGGTTCTCC AACTTCAAGG GGATCGCTGA GGGTGAGCTT GATCTTAGGG ACGTGGCGAT CATATTAGGA GGAAATAACG CAGGCAAAAC AACCGTCCTG GAGGCCATCT ACCTGCTCCT TAATCCCGAC ATAAGGGAGG CGTTCAACGT CCTTCCATAT CTTCGTCAGG TCGATGAATT GAGTGTAAAG GCTGGCATTA ACGAGATCCA GAATTGGGTA AACCTGTTCC GATACTATCA GAGGGGAAAT TTTAGGATTG AGTCTGGATC GAATTTCGTG GAAGGTCGTT ACGATAACCA GAGAATTCTC CTGAAGCATC CGGATTATGA GGCTGGCATG ACTCCTGGTG AAGGATTTAG CATCATGAAG GGAGTTTCCC CCGAGTCTAG AATTGAAACC TTGTTATTCA GTCCAAGACT TGCATACGTT TATTTCTCGA GAATAGCTCA AAATTGGGAG GAAATATCCA ATCTAACCGA AGCCGTAAAC TCTATTCTCG ACGAGTTGAA TGAAATTAGC AATGAAAAGT ATCAATTTAT CACCTTCGAG CCCTTCAGAG GTATTCAGAC GCTGTACCTA GTGAAGGATG ATAAGAAGAG AGTTAGGATT GCTGATGTAG GTGAGGGGTT CAAGATATAT GTAATCCTGA GGCTAATGTT TGAGTATTAT AAGCCAAGGG TTCTCCTATG GGATGATATA GAGAGTCACA TGAATCCCTT CACCCTAGCC TCGATTTCAG GTTGGCTCTA TAGGATATCC AAGACAAGGC AGATCCTTGT GTCTACCCAT AGTCTCGAGG CTGCGAAGAT CGTGATGAAC GCCACAGGAA AGGACAGTAT AATAGTGGAT GTTATCGATG GTAAAATGAC ATACAGGAGA CTTTCCCTCT CTGAGTTAGA GAGATATGAG GATTTGGGAG TTGATCCCAG GACCCTGAGG GTTTAA
|
Protein sequence | MIYRVIPSRP VTVIESLPNK SVSSDDPSSF LVVDQEDNPL DSYWDGKRLY VVPKENTTAI YLIEASPGST EFKGSNPPMT ERPSSPGIPR EVGVDRLRFS NFKGIAEGEL DLRDVAIILG GNNAGKTTVL EAIYLLLNPD IREAFNVLPY LRQVDELSVK AGINEIQNWV NLFRYYQRGN FRIESGSNFV EGRYDNQRIL LKHPDYEAGM TPGEGFSIMK GVSPESRIET LLFSPRLAYV YFSRIAQNWE EISNLTEAVN SILDELNEIS NEKYQFITFE PFRGIQTLYL VKDDKKRVRI ADVGEGFKIY VILRLMFEYY KPRVLLWDDI ESHMNPFTLA SISGWLYRIS KTRQILVSTH SLEAAKIVMN ATGKDSIIVD VIDGKMTYRR LSLSELERYE DLGVDPRTLR V
|
| |