Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0190 |
Symbol | |
ID | 5103934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 153774 |
End bp | 154904 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506095 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001190291 |
Protein GI | 146302975 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.557885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.116027 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCTCAA GGATAGGAAG AGAAATAGAA CTCTCGCCCG TGGAGATGGG ATCTAGGCTC GGGCGTAACG TAAAAATTAA CATGGCCAGC GGGTCACCCG ATCCGTCAAC TATTCCAGTT GATGAGATAG GAAGAGCCTA CGAGGAAGTG CTGGCCGACC TGGGCCCAAG ATCACTTTTC TACCCAGGTG CTGGAGGTCA GCAGGAGCTA ATTGAGGAGG TGAACAAATA TCTTCCTGCC ATAGGCTTGA GAAGTAAGGA TCCGATAGTC ATAACCAGCG GTGCTCAACA CGCCATAGAG TTGCTGTCGA AGTACTTCCT CGAGAACGGG ACAGTTGTGG TGGAGAACCC AACCTTCGTG GAGACTTTTT CAGCCTTTAA GTTAAGGGCT TCGGTCACGA TACCCGTCAC TGTGGATGGA AAGGGTATTT CCACAGATGA GCTGGAGCTC GTTACCAAGA TAGTTAAGCC AGATCTAGTC TACGTGATAC CGGACTGTCA TAACCCTGCT GGAGTGAACT TGAATGAGGA AAGGAGGAAG ATACTGGTTG AGTTGGCTGA GGAAAGGGAC TTCTATGTGA TAGAAGACGA CCCTTACAGA CCCATAGCTG GGTGCGTTCC AGCTCCATTA AAAAATTACG ATCGTAGTGG CAGGGTCATA CACGTCAGCA GTTTCAGTAA GATCTTGGCA CCGGGTCTAA GGATAGGTTT CGTGGTAGCC CCTCCAGAAA TAGCGGAGAA GTTGAGCCTC ATGGAACAAC TGGACTTTTC CACTTCAACT CTAAATCAGT ATGTCGTCTC GCGCCTTTTG AGATCTGGAT TCATTTTATC TAGAACGAAG ATTCTTCCAG AGCACTACAG GAAGAAAATG AAAGTCCTTG TGGACTCCCT AACGGATGCA GGGATATCAG AGTTTAATCA GCCCAGTTGC GGGTTCTTCC TTTTGCTTGA CCTTAAGAGG GATGCCCATA GAGTTTTGGA GGAAGCGGTA AGGCAAGGCC TAGCTTTCGT TCCTGCTAAG GACTTCTTCC TACGGGGCGG AGAGACAATG GCTAGGCTGA GTATCACAGT TCCCAATGAG GAGCAGATCA AGGCCGGAGT TGAGATACTG AAGAGGGTTA TTCGAGGCTA G
|
Protein sequence | MVSRIGREIE LSPVEMGSRL GRNVKINMAS GSPDPSTIPV DEIGRAYEEV LADLGPRSLF YPGAGGQQEL IEEVNKYLPA IGLRSKDPIV ITSGAQHAIE LLSKYFLENG TVVVENPTFV ETFSAFKLRA SVTIPVTVDG KGISTDELEL VTKIVKPDLV YVIPDCHNPA GVNLNEERRK ILVELAEERD FYVIEDDPYR PIAGCVPAPL KNYDRSGRVI HVSSFSKILA PGLRIGFVVA PPEIAEKLSL MEQLDFSTST LNQYVVSRLL RSGFILSRTK ILPEHYRKKM KVLVDSLTDA GISEFNQPSC GFFLLLDLKR DAHRVLEEAV RQGLAFVPAK DFFLRGGETM ARLSITVPNE EQIKAGVEIL KRVIRG
|
| |