Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1460 |
Symbol | |
ID | 5104830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1429812 |
End bp | 1431572 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640507348 |
Product | thermopsin |
Protein accession | YP_001191541 |
Protein GI | 146304225 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTGT ATGAGTCTAA CACCTCTACC ACCCTTAGCG CAGGTCAATA CGAGTATTTC CCGCTCAACG TTAACACAAC AGAGATACTT TCCTACTCCT GGAATTCAAC AGGTAGTGTT GCAGTGATGG TAATGAACCA GACCCAACTT CAAGAATTTC TCAATGGTAC AGGAAGCCCC TACAAGGGGT TGGTTATCCT GAATTCATCG TTCAGCAATC AAGTTCTGCT AACTCCCGGT AAGTATTACT TTGTGCTCTA CGCCTACCTA CAACAGGTAA CCCTGCAGTA CAGCCTGAAA CTGGTTCCAG CTCAGGTGTC CTATACCCTT CCGGTGGGGT ATCAGGAAAA TTATCAGTTG AACCTAAGCT ATCCCTTTCA TCTTTACCTC TATCTAGTTT CCAATAACTC GTTCTCTGTG AGGGTCACTT CAGGCAACGT TACTTACTTC AGTGCAGCTC CCTCCAGGGA TACTCCTCTT ACTTTCGTCA ATCACACGTT AACCCTCAGC CCAGGCAACT ATTCCATTAC TGTGGTTAAT CCTGGATCCT CAACGATAGC CGTTTATTCC TCAGTGCTGT ACGCCTCCAC TTATCCAGAT CCCTTATCCT TGAATAGGAC AGATTACCCT ATGGGAGTGG CAAGTTATGG CCTATTTAAC AGGTCAGGGG TACCAGTCCC GTACGTGGTC AAGGCATCCT CCGTAGTGGG GTTTGCAAAT ATTTCCTCTA TCTTTGCTTA TAATCAGACT GCTGAGAAAC TGAATGTCTC CCCCTATTCG GCGAGTTTAC AGTTAAATGT CCCACTGGTC GTGATAAACG GAAAGCAGAA CCAGACTTAC TGGGTACAGA ACGTGATAGT TTTCATGACT AATGAGTCGA CCCTATGCTA CGAGTCCTCC GTCTTAAACG TGACTAACGC GAATGCCACC TTAACAAACA TCTCTATACA AGGGAGAGGC GGAGTGTATC CGCCCTTCAA TAATGGAATA TACTATACCT ACAAGACTAA GGGAGTGCAG TACAAGACGC CCCTATCTCT CTTGATCTCC ATAAATGTAT CAGTGATAAA GAAACTTGGA GTGAGAATAG GCTTCGATTA TAAGGTGCTT GAGAACGGTT CGGTAGTTAA CGGTAGCTGG AACCAGTTTG ATTCTCCTCT CATTCTAGAC TCGGGAGTGT CACAGGCCTA TCTTTACGTG GATGGATATA ACTCCCCATC TACCCTGAAT TTCTATGATG CTGAACTAGT TTTTGGAGGC GGAGGAAACG GAGAGGTGGC ATATTTCCAG AACCTTTCGG CTACCCTTGC CATCTTCTAC TATAACGGAT CTCTTCATCC CTTCCCAAGT GTGTATAGCT TCGGCGCAGA TACTGCCGAG GGCACGAGCG ACTTACACGT GTCATTAATG AATGGTCTGG TTTCCGTTTC TAAGGGTCAG GACAACCCAG TCTTTCTCAC GAACCAGTTT AATGCGTCCA TACCGGTATT GCGAGTTGTC GTTAATCATG TTCAAAACAA GAGCTCTGTG TCTAACGTTA CCACTACTAC AACTCATACG AACACATCAA CTTCCACCAG TTCCAATGTT ACCAAGAATA CTGTACCTCC ACCAAGTAAC ACTAGCCAGA CTTCTAGCGC CCCTACGAAG AAGGGTGGGC TTCCCCCTTA CCTTCTACCT GGGCTAGTGA TTGCCGTAAT CGTCGTGATA GTAATATGGG TGCTCATTAA CAGGTTCAGA AAGCCCGACC TGAATATATG A
|
Protein sequence | MNLYESNTST TLSAGQYEYF PLNVNTTEIL SYSWNSTGSV AVMVMNQTQL QEFLNGTGSP YKGLVILNSS FSNQVLLTPG KYYFVLYAYL QQVTLQYSLK LVPAQVSYTL PVGYQENYQL NLSYPFHLYL YLVSNNSFSV RVTSGNVTYF SAAPSRDTPL TFVNHTLTLS PGNYSITVVN PGSSTIAVYS SVLYASTYPD PLSLNRTDYP MGVASYGLFN RSGVPVPYVV KASSVVGFAN ISSIFAYNQT AEKLNVSPYS ASLQLNVPLV VINGKQNQTY WVQNVIVFMT NESTLCYESS VLNVTNANAT LTNISIQGRG GVYPPFNNGI YYTYKTKGVQ YKTPLSLLIS INVSVIKKLG VRIGFDYKVL ENGSVVNGSW NQFDSPLILD SGVSQAYLYV DGYNSPSTLN FYDAELVFGG GGNGEVAYFQ NLSATLAIFY YNGSLHPFPS VYSFGADTAE GTSDLHVSLM NGLVSVSKGQ DNPVFLTNQF NASIPVLRVV VNHVQNKSSV SNVTTTTTHT NTSTSTSSNV TKNTVPPPSN TSQTSSAPTK KGGLPPYLLP GLVIAVIVVI VIWVLINRFR KPDLNI
|
| |