Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0664 |
Symbol | |
ID | 5103824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 607828 |
End bp | 608739 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506568 |
Product | hypothetical protein |
Protein accession | YP_001190763 |
Protein GI | 146303447 |
COG category | [S] Function unknown |
COG ID | [COG4034] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0181311 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00974404 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATTCAC TAGTTCTGGG AGCAGGCGGA GGTGGAGATG TAGTATCTGC ACTTCTCCCT TACGAGAGGA TCAGAAGAAG AGGTGAGAGA GTAACTCTGG GAGCTGTGCT ATGGGAGAGA AGAGTGGAGG ATCCAGTTCC GGGCCCCATC TGCACTAACG ATCTCAGAGA AGCACAGGTA ATAAACGAGA GAGTGGCGAT CCTTGGGCCA AACTCCTTCG CTGTAAGGGG AGGAAGGAAA GTGAAACCGC AGGCTGCAAG AGTAGCGCAA GTATTGGGGA TTAATGTTGT TTCATTCTGT ATATCTGGAG GTGTTAGTGG TCTATACAGG GATATCATGG AGATAGCCAA TATGCTAGGG ATCGATCAGG TCATAGGGGT GGACGCTGGA GGTGATATTC TTGCGGAGGG AACAGAAAAT AATCTCCTAA GTCCTCTTGC AGATTCCATT ACCCTTGCCT TGCTATCCAA GTTAGAGGAA AGTTCCATGA ACGTACAACT ATCGGTAATC GGTTTAGGTG CTGATGGGGA ATTGAAGCCG GATTACCTTT TACAGAGGAT AAGCAAAATA GCATCACTGA ACGGATTAAT AGAAATCCTT GGATTTGATG AAAGTTCTGC CTCTGTAATA GAGAGGGTAC TAGAAGTTAC CGTTACAGAG GCCTCGGCCC TGCCCTTGAA GGCCTTCAGA GGGCTTTATG GTGAGGTTCC GATCAGAAAC GGTAAGCGTA GGGTATTCGT AAGCCCGCTC ACATCGGTCA TGTTCAGCTT CAAGCCCATA GTTGTAGCAT CAATCTCTAA ACTGTATGAG GCTGTGAGAG ATTCCTCTTC CCTTGAGGAA GCCAACGAGG CACTTCACAG GTTAGGTTTA ATCACAGAGC TAGATAGAGA AAGAGGCCTC AAGGACGAAT AA
|
Protein sequence | MNSLVLGAGG GGDVVSALLP YERIRRRGER VTLGAVLWER RVEDPVPGPI CTNDLREAQV INERVAILGP NSFAVRGGRK VKPQAARVAQ VLGINVVSFC ISGGVSGLYR DIMEIANMLG IDQVIGVDAG GDILAEGTEN NLLSPLADSI TLALLSKLEE SSMNVQLSVI GLGADGELKP DYLLQRISKI ASLNGLIEIL GFDESSASVI ERVLEVTVTE ASALPLKAFR GLYGEVPIRN GKRRVFVSPL TSVMFSFKPI VVASISKLYE AVRDSSSLEE ANEALHRLGL ITELDRERGL KDE
|
| |