Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2120 |
Symbol | |
ID | 5104413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 2039472 |
End bp | 2040677 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640508009 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001192183 |
Protein GI | 146304867 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.265686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAAT GGGAAGGCTT CTTCTCAAAT GAAACCAAGG GGCTTAAGTC CTCAGAGATA AGGGACTTAC TAAAGCTCAC CGAGGGAAAA AATGTAATAA GTTTCGCTGG CGGTTTACCT GATCCCTCCA CTTTCCCGGT AGAGGACATA AAAAAGATAA CAGATGAGAT ACTTGAAACT AAGTCCTCGT CTGCACTACA ATATACCGCA ACAGCAGGAG TTGCGGAGCT TAGGAAACAG TTAGTATCTT TCTCGGGACT TAGGGGAATC ACCGGGATAA GTGAAAATAA TGTTTTCGTG AGCGTGGGGA GTCAGGAGGC CCTCTTCATC CTATTTAATG TCCTCGTAGA TCCAGGAGAT AACGTGATTG TGGAGAAACC CACTTATCTT ACCGCCCTCA ATGTCCTTCG TACCAGAAAG CCCCAATTTC ATGGAGTTAC TGTCACAGAT AAGGGACCTG ACCTACACGC TCTCGAGTCC CAGCTAAAGA AGCTGAAATC GGAAGGGAAG AGGATTAAAC TCATGTATGT TATTCCCACA GCTCAGAACC CTGCTGGCAC AACCATGTAC CTGGATGATA GGAAGTATCT TATGGAACTT GCCTCCTCAT ATGACTTCTT GGTAGTGGAG GATGATGCGT ACGGATTTCT GGTGTTCGAG GGAGATAACC CTCCCCCTCT CAAGGCATTA GACAAGGAGG GTAGAGTAAT CTATCTTGGT ACCTTTAGCA AGATCCTATC TCCAGGTCTT AGATTAGGAT GGATAGTGGC CGACGAAGAA ATAATAAGGG AAGTGGAGCT CTTCAAGCAA AACGTGGACC TTCACACGCC TTCATTGAAC CAGTTTATAG CTGCAGAGGC CATAAGGAGG GGGGTAATAC AATCTAACTT GCCCAAAACC AAGGCCCTCT ACAAACAGAA GAGGGACTAC ATGATACAGG CAATGGACAA GTACTTCCCA TCTGTAGTGA AGAGAACTAA GCCGGTGGGT GGAATGTTCG TGTTCACTTG GTTACCTGAG AAGTTCAATA CCACTCAACT ACTGCAAGAG GCTATGTTAA AGGGAGTGGC CTACGTGCCA GGTAATAGCT TCTATTACGA CTATAGTGGG GCAAATACCA TGCGAATAAA CTTCAGTTTC CCCAGTAAGG AGGAGATAGA GAAGGGTATA GAAATCTTAG GTAATCTAAT AAAATCTAAG CTATGA
|
Protein sequence | MGKWEGFFSN ETKGLKSSEI RDLLKLTEGK NVISFAGGLP DPSTFPVEDI KKITDEILET KSSSALQYTA TAGVAELRKQ LVSFSGLRGI TGISENNVFV SVGSQEALFI LFNVLVDPGD NVIVEKPTYL TALNVLRTRK PQFHGVTVTD KGPDLHALES QLKKLKSEGK RIKLMYVIPT AQNPAGTTMY LDDRKYLMEL ASSYDFLVVE DDAYGFLVFE GDNPPPLKAL DKEGRVIYLG TFSKILSPGL RLGWIVADEE IIREVELFKQ NVDLHTPSLN QFIAAEAIRR GVIQSNLPKT KALYKQKRDY MIQAMDKYFP SVVKRTKPVG GMFVFTWLPE KFNTTQLLQE AMLKGVAYVP GNSFYYDYSG ANTMRINFSF PSKEEIEKGI EILGNLIKSK L
|
| |