Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1640 |
Symbol | |
ID | 5104166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1581387 |
End bp | 1582727 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640507531 |
Product | hypothetical protein |
Protein accession | YP_001191719 |
Protein GI | 146304403 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTCAG TGGTGGTAAC TCTTCTAGTG GTCCTCGTTA CATTAATGCT AGCGATAGGT GTATTTGGTC TTTACAATTC TTACTTTTCC ACCCAAGGCT CAACAATTGC GGGAAATGAG ATTGTAGTAT CTCAATCGAA GCAAACCGAG ATCTTCGTGT CACCTATAAC ATTCAAGGGG ATAGGGCCCA GCTACACTTA TTTTAATGTA TCGTTTCTAC TATGGTACTC TGCTCCAGTG AAGAATGTGG TTCTAGTTCC ATTCGTGGTT AATCCTTTGC CTGGTCTGGG AACTTACCTG TATAGTCCCC AAGGAAATCC TAACGCAACT ATTCTCATGA ATTACTCGGG AGGCCTTAAG ATATTGCCCC ACTTCACGCT TAATTCGCCT GTTTATACTC CACAGGGGCA ACAACTTAAG TCAGGAGTAC CAGCATACAA CGCGTCAAGC ACCGGGACAT ATATTGTACA TTTCGTTGTG AAGCCAGGTC AGATCGTTGT GGTCTGGGTC TTAACGCATG AGTTCGGGAA GTGGTATAGG CTCGGCTACT TCTTCGTGAA CCCAGCTGAT GCTGGTTTAG GACTTTATGT TGTAACGCAC ACAGGCAATT ATCTAGGAAA TAGTAAACAA GTGAATTTCC AAGCACCTCA TCTATTCACC AACAACCAGG GAGTTCAGAC AGGTCTATGG TTTGAACCGC TGGGTAATGC TACAACGAAT TCTACCATAT TTTATGCCAC ATTAAATACT ACTAATAACA ATTTTTACTA TCTAAAAATA TATCAAAGCG GATATAATAT TTACGTTAGT TCCAATTACA GTAAGAGCAA TGGAAATCCC CAACTTCTAG GTTCAGTATC TCCATTTAAT TGGTATTTCC TAAACTTTAC TTACGGTCAT CAAACAGGTA ATAATGTAAT ACTTTTTAGT GATACTGGAA AAATTATTGG AAGTGTTGCT TTTCCTATTG GTCAAACTAA TGGGTCCGAG TTGAAAATTA GTTTTGGTTC CAATAGTTTT ACAGACGCTA TATCTCAAGC GTTCTTAGTG ACAAAACAAA GCAATAGTCC AGGCACGGAT ACATCTTTCT ATAACGTATC TACCACTATG CTTACTCATG GTCCATATTA TAATAATACT ATGGCGTATA ATTGGACTAT CAATCATGCT CAACAGTCTC TAAACGGAAT AGTCTATTGG AACTTTGTGT ATCCATCCTC CTCTCCACCA GCTATATTAT CTGCTATAGT TTGGTATTGG CCGTCGGGAT CAGGTCATTA TAAATATGCT AGCATTACAT ACTTGATGGA ATCTGGGCCT AATACCTGGA TAATAGGATA G
|
Protein sequence | MPSVVVTLLV VLVTLMLAIG VFGLYNSYFS TQGSTIAGNE IVVSQSKQTE IFVSPITFKG IGPSYTYFNV SFLLWYSAPV KNVVLVPFVV NPLPGLGTYL YSPQGNPNAT ILMNYSGGLK ILPHFTLNSP VYTPQGQQLK SGVPAYNASS TGTYIVHFVV KPGQIVVVWV LTHEFGKWYR LGYFFVNPAD AGLGLYVVTH TGNYLGNSKQ VNFQAPHLFT NNQGVQTGLW FEPLGNATTN STIFYATLNT TNNNFYYLKI YQSGYNIYVS SNYSKSNGNP QLLGSVSPFN WYFLNFTYGH QTGNNVILFS DTGKIIGSVA FPIGQTNGSE LKISFGSNSF TDAISQAFLV TKQSNSPGTD TSFYNVSTTM LTHGPYYNNT MAYNWTINHA QQSLNGIVYW NFVYPSSSPP AILSAIVWYW PSGSGHYKYA SITYLMESGP NTWIIG
|
| |