Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2115 |
Symbol | |
ID | 5104408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 2036162 |
End bp | 2037064 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640508004 |
Product | signal transduction protein |
Protein accession | YP_001192178 |
Protein GI | 146304862 |
COG category | [K] Transcription |
COG ID | [COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.548765 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAATT TATCATCAAC ACAAAGGGAG ATACTCTTAG CGTTAATTGA GCTCTATAAT AGGCAAAAGA AAATGATCAA AAGCAAGGAA GTTGCCGACA TGATAAGTAA GGACGAAGGT ACTGTTAGAA ATATCATTCT AAGCCTGAAG GTCCTTGGAC TGGTGGACTC TAAGCCAGGT CCAAATGGAG GATACTTGCC AACATTGAAG GCCTACGAGT TCGTAAAGAA CCCGAGCATA TTGCCAATAC TGGATAAGTT AAGTCTCTAT CGTGGAAACG TGGAAACCGA TGTTAAAGTG GAAAACATTG AACTCCTGGA CATAACTAAT CCCTCAGGGA ATAGGGTCAT CCTGAAAATC TCAGGAGATC CCAGGAAGAT CAGGCCGGGT GATAGCGTTA GGGTGGGACC AACGCCTTAC AGCAGGCTTG TTATAGAGGG TGTTGTGCTC AATGCAGAGG AAGAACGTAG GGAAATTATC ATTGATGTAA AGAGAATGAT AAGCATCCCA AAGGAAAAGG TAAAGAACAT AATAGGGAAG AGGCTCGTCT CCTTGAAACC TAACATGTCT TTAAGGGATG CCTCGAGGAT ACTTCACAAG GAGGGGATAA GGGGAGCCCC TGTTCTAGAC GAGTCAGGGA ACGTCATAGG GATAATTACA ACTGCAGACT TAATGAGGGC ATTTTATGAG GGTAACTTCG ACGCAACGGT CTCAGATTAC ATGAAGAGAG ATGTGATAAC CATAAAGGAG GAGGATGATA TAATGGAGGC CGTTAAAAAG ATGGTGACAT ATAATGTCGG AAGATTAGTG GTTATGGATG CCATTAACAG GGTCACAGGA ATGGTCACGA GAACCGATAT TCTGAAATCC ATCGCTGGAC TAGAGGGGTT ATGGTCCATC TGA
|
Protein sequence | MQNLSSTQRE ILLALIELYN RQKKMIKSKE VADMISKDEG TVRNIILSLK VLGLVDSKPG PNGGYLPTLK AYEFVKNPSI LPILDKLSLY RGNVETDVKV ENIELLDITN PSGNRVILKI SGDPRKIRPG DSVRVGPTPY SRLVIEGVVL NAEEERREII IDVKRMISIP KEKVKNIIGK RLVSLKPNMS LRDASRILHK EGIRGAPVLD ESGNVIGIIT TADLMRAFYE GNFDATVSDY MKRDVITIKE EDDIMEAVKK MVTYNVGRLV VMDAINRVTG MVTRTDILKS IAGLEGLWSI
|
| |