Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0013 |
Symbol | |
ID | 5105152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 9918 |
End bp | 10883 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640505906 |
Product | AsnC family transcriptional regulator |
Protein accession | YP_001190114 |
Protein GI | 146302798 |
COG category | [K] Transcription |
COG ID | [COG1522] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000293296 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0126394 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCTTG CCGATACCCA GATTAAGCTT CTCATGGAAC TTCAATACAA CTTCCCTCTC GATGAGAAAC CCTTCGATAT AGTAGCTGGC AAGCTGAATC TAAGAACTGA CCTAGTCCTC AAGGAAACAC TGAATCTTAT AGATTCTGAA ATTATAAAAA GAGTTGGAAT GTATGTCAAT TTTAGATCTA AGGGAATGGA AGGAGCGTTA ATTGCCGCAT CCATACCCCT AGATCAACTG GATAAGTTCA GGAGAGAAGC GCTTCACATA AGGGAACTCA CGCATAACTT CATCAGGAAC CATCCAAGAT ATAACGTCTG GTACGTCCTT AAGGCTGAAT CGAAAGAAGC CTTAGAGAAG AGGGTCAGGG ATCTGATGGA GGAGGTAAAA GCTGAAGACT ATGTAATACT TTTCTCCAAG AGGAATCTCA AATTAAGTGT AAAGTACGAT TTAATACGCG GGATCTCATG GAGCAAAAAT GAGAAAACAC CTGAGAAAAT TCCCACGGCT GATGAACTAG GGCTAAACAT GGAATTCCTG AAGGCTTTGT CCTATCCCCT TCCCATAGTA GAGAGGCCTT TCAAGGCACT TGCCGAGAGA TTTGGGTATA GAGAAGCGGA ACTCGTGGAC TTGATTTCTG AATTAAGATC AAAGCACGTT ATCAAGGATT ACGGAGCCAC AGTAAATGGA GAAAAGGTAG GAATTACTGA GAACGCTATG TTACTCATCA ATACTGATAA TATCGAAGAA TCCTGTAACA GGATAGCTGA AAATCTGAAC GAGGCCACGC ACGTGGTGTT AAGGGAAAGC AATAAGCCCT GGGATTACCT GTGCTACTGC ATGCTGCATG GTAGGAGCAA GGCAGTTATA AGGGAAGCCT CCATGAAGGC CTTGGGGATA ACCGGAGCTA AAAGCTACAT GCTCCTATAC AGTCTTGACA ATTTAAAGCC CGGAATAGTT ATGTGA
|
Protein sequence | MDLADTQIKL LMELQYNFPL DEKPFDIVAG KLNLRTDLVL KETLNLIDSE IIKRVGMYVN FRSKGMEGAL IAASIPLDQL DKFRREALHI RELTHNFIRN HPRYNVWYVL KAESKEALEK RVRDLMEEVK AEDYVILFSK RNLKLSVKYD LIRGISWSKN EKTPEKIPTA DELGLNMEFL KALSYPLPIV ERPFKALAER FGYREAELVD LISELRSKHV IKDYGATVNG EKVGITENAM LLINTDNIEE SCNRIAENLN EATHVVLRES NKPWDYLCYC MLHGRSKAVI REASMKALGI TGAKSYMLLY SLDNLKPGIV M
|
| |