Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0862 |
Symbol | |
ID | 5105221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 795279 |
End bp | 796616 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506766 |
Product | hypothetical protein |
Protein accession | YP_001190959 |
Protein GI | 146303643 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.207625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATGCA CTGTACCCAA GGTTACCGTA ACCACGTGGA ACGCTAAGAA CGTGGTTCTG GCCGGGCCAA CTTACAGGAG ATATCTCGAG AAGACCTTAC TTGCGGTCAA GAATAAGGAC ATCGCCTCGG TGATAGGGCA ACCTGGAATG GGAAAGACTA CCATCCTCAG GAAGACTCAG GAGGAAGCTC CAGGTCTCAC CTTCTTCCTA GACCTAGCTA GTAAGGCAGA GATTGAGGAC GAGTTCTGGG GTAAGGTGGA CCAATTCAAG ATAAGGGAAC TGGTTCTCCC CACTCTCAGG GGCAAAGCTA ACAAGCTAGG TTACGGTTTC CTTAGAAGGC TAACCGGTGT TAAGCTAGAG GATTGGCTAC TCAAGGTCTG CAACAAGTAT GACGACATAC ATCTCAGGCT ATTCTGTTCG AATTACCCCA AGGACTTTGA CGGAATGTTG AAGTTCCTCG GGGACCTGAA GAACGTGATT GACGTTAACC TCATGGTAGA TGAGGTGAGG GACTCTCACA TACCCAAGAT CCACAGGCTG ATTAACGCTG GTCTCGGTAT CCCAGTAATC ATGGCAATTC CCACGGATTC CTACAGTAAG GTTACCGATT TGGCCGTGAG AAGAAGATTG GATGAAAGCA GGGTATCTCT CGATACGGTG CTTACCCAGG AGGACATCAA GGAAATCATA GATGCTTACT GTCATCCCCT AGCAGAGGAC CTCTTCCCCA TCATATACTC CCTATGGAGT GGGGGAGAAC TCAACACAGT CAGTTCCATG TTGCAGTACG TGAAATCACA AGTTGAGAAT TTTGAGAGGG AATGTGGGGA TAACCTGGAT TGTTTCAGGG AGAAACTGAG AAGCTCACAC TCCCTTAAGA ACCCTGAAGA AGACTCAAGG GAGATGGAGA AGATGGTGAG GGAAGTTCTG TCATCCGAGG GAAAGGAGAT GGGTATCTCT TACGTCCATC CAAGGGGGAA AAGGGTTGAG GCCAACGGCA AGTTCATGGT GGTGGGAATA TTCTTCATAA AGGACGAACA GGCGGTACTG GGTCAGGTGA AGCTCATGAA AGATGACAGG GAGAGCGATG ACGAGATCAA CCTTCTGCCT GAGGTAAGGA CGGTGGAGCA TGAGAAGAGG AACTATCCCG TGGGTAAGAG GTTCGTGATC ACGAACTCAG CCAAGTTAAA GGTACCCAAC TCCGTGAACA AAATAGAGAT CTCAACCTTC GAGGCCGTGC GTATACTTAG AGGGGATGGT GAAATTCTCA GGGAGATAGT CAGACCCCTT CAAGATTTAC CTGGTGCAGG GCGAACCCCT GTAGAGAGCA CAGCTTAG
|
Protein sequence | MICTVPKVTV TTWNAKNVVL AGPTYRRYLE KTLLAVKNKD IASVIGQPGM GKTTILRKTQ EEAPGLTFFL DLASKAEIED EFWGKVDQFK IRELVLPTLR GKANKLGYGF LRRLTGVKLE DWLLKVCNKY DDIHLRLFCS NYPKDFDGML KFLGDLKNVI DVNLMVDEVR DSHIPKIHRL INAGLGIPVI MAIPTDSYSK VTDLAVRRRL DESRVSLDTV LTQEDIKEII DAYCHPLAED LFPIIYSLWS GGELNTVSSM LQYVKSQVEN FERECGDNLD CFREKLRSSH SLKNPEEDSR EMEKMVREVL SSEGKEMGIS YVHPRGKRVE ANGKFMVVGI FFIKDEQAVL GQVKLMKDDR ESDDEINLLP EVRTVEHEKR NYPVGKRFVI TNSAKLKVPN SVNKIEISTF EAVRILRGDG EILREIVRPL QDLPGAGRTP VESTA
|
| |