Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1024 |
Symbol | |
ID | 5104327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 948514 |
End bp | 949539 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640506923 |
Product | hypothetical protein |
Protein accession | YP_001191116 |
Protein GI | 146303800 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.687262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.15221 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG GAACTTCTAC TCTAAATATT AAATCTTTGG GAGTTAGGAG AAGGAAGATA AGATCTATAG GTGCTTCCTC GACCATTGTT CTTGTCACGG CCATAATTGT AATTGGAGTT GTTGCAGTAA CAGGATTATA CTTGTTCTCC CCGGCATCTC ACTCCATTCA ACAACCTTCA TCTACTCCTC CATCTACAAC CACATCTCCT CCATCTACAA GTACGTCAAG TAACGGTGGA TCCTCTGGAA GCACATCTAC CAATCAGGGG GGTCAACAGA GTACTTCCAC TGGTACATCG ACACAAGGCA CACAGTCCTC ATCATCTACA GGCGGATCCT CGTCTGGAAG TACAACCTCC TCCAGCTCCG GTAGTTCAGG CTCATCGACC TCTTCCACAA TTACCGCGGT GGAAGTCCTT GATGTGTGCG GATCTATCTC AATCGTGTCT GGTCAATTCC AGGTGAATTC AAACGTGCAA TTATCTACCT CCTATAGTGG GAGCGAAGCC GTAATTCAGG GTTCCGCCTC CGAAGTAGGG AACACCACGA TCACTGTTCC TCCAACAGTG ACGCAGATAG TCATTGAGAA ATCCAATGCT AACATCTACA TCTCGAACAA CTACGTAACC AGTATTACGG CCATAACCAG TAACGGTAAC ATCGAGATAA TATCCAATTC TGCAACTAGT GTTGAGGCGG AAACATCAAA TGGTGGAGTA ATTCTTCAGT TAAACTCTCC AACCTCTGTG AGTGTGACTG CCTCTAATGG GGCAATACAA GCCCAGTTCT CCACTCTAGA CGGCGGATCT ATTTCCCTTA TTACTTCCAA TGGTAATATC TACTTCACGA CTCCAACATC CTCAAGCATA GAGGTGTCTG CCATGACCAG TAACGGTGCC ATCTCTTACA CCTTACCACT AACAAACGTA CAAAATATCA ACGATCAAAT ACTCACAGGA TCTATGAACG GAGGAGCAAC ACAGGTATCC CTTGAAACTT CTAACGCAGA CATTCAAATA AACTAA
|
Protein sequence | MKKGTSTLNI KSLGVRRRKI RSIGASSTIV LVTAIIVIGV VAVTGLYLFS PASHSIQQPS STPPSTTTSP PSTSTSSNGG SSGSTSTNQG GQQSTSTGTS TQGTQSSSST GGSSSGSTTS SSSGSSGSST SSTITAVEVL DVCGSISIVS GQFQVNSNVQ LSTSYSGSEA VIQGSASEVG NTTITVPPTV TQIVIEKSNA NIYISNNYVT SITAITSNGN IEIISNSATS VEAETSNGGV ILQLNSPTSV SVTASNGAIQ AQFSTLDGGS ISLITSNGNI YFTTPTSSSI EVSAMTSNGA ISYTLPLTNV QNINDQILTG SMNGGATQVS LETSNADIQI N
|
| |