Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0892 |
Symbol | |
ID | 5103538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 827861 |
End bp | 828964 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506795 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001190988 |
Protein GI | 146303672 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAATAA ACATAGGAGG AGGATTGCCT GATCCCAGAA CGTTCCCCTG GGAGCGTATG GGTGAGATTG TAGATTACCT TATCAGAGAA AGGAGCGAAA CAACCCTGCA GTATGCCCCA AGTGAAGGTA TAGAGGAGGT CAGGAAAGAG ATATCCAATT TTGTTAGGAA GAGAGGATTC TCCTTGGAGG AGGATCAGAT ACTTATAACG GGAGGAGCTA AGGAGGCCAT ATACTTACTC TCTGAACTTT TCTCGCAGAA CATGGTAGCC TCTGAGGAAC CCACGTTTCA GGGATTCATA AGTACCATGA GTTACAGAGG GTTAAGGGCA TATCCAATCC CTTGGGATGA ATACGGTCCC ATGACCGACG TCCTCGAGAA GAGGTTAAAG GCACTTCGAA TGTGGGCAGA CCCAGTGAAG TACTTTTACG TAGTCCCAGT TCACAACCCG ACGGGGAGAG TCATGACCAA GGATAGGCGC AAACACCTCC TTGAGTTGGC CAGTGACTTC AACTTTCAGA TCATTGAGGA TGACATATAT GGGTTCTACA TGTATGACGA TCCTCCCTAT CCTGCACTTA AATCCCTTGA TAAGGAAGGA AGAGTAATCT ACATCTCAAG TTTTAGTAAG ATCATTTCTC CAGGGCTCAG GGTAGGCTTC ATAGGCTATG AGGGAAGGGA GATCGAAAAG TTAGCTACTA TCAAGAGCGA AATTAATCAT CAAGTTTCTA CACTGGATCA ACTTATCGTG GGGGAAATGC TCAGGAGAGA CCTCGTGGAC GCCGTAGTCG AGAACTCCGT ACTCCTTTAC AGGAAAAAGA GGAACGTCAT GCTCGACGCA ATAGAGGAAT ATTTCCCGTC CAGCACTGGG TGCAGTTACA CAGAGGGAGG TTTCTTCACT CTATGCAGAA AAGAGGCACT AGACTCGTCA TCCCTGCTCA AGGAGGCCTT GAAAAGGGAC GTTAAGTTCA TTCCTGGAGA GAAGTTCTTC TACTCTAGCG AACAGGGAAG AAATTCCTTT AGACTTAGTT TCAGTTTCGC TAAGGAGGAA GAAATAGTGG AAGGTGTGAG GATACTTGGT GAGCTGTTGA AGGGAATTAA ATGA
|
Protein sequence | MPINIGGGLP DPRTFPWERM GEIVDYLIRE RSETTLQYAP SEGIEEVRKE ISNFVRKRGF SLEEDQILIT GGAKEAIYLL SELFSQNMVA SEEPTFQGFI STMSYRGLRA YPIPWDEYGP MTDVLEKRLK ALRMWADPVK YFYVVPVHNP TGRVMTKDRR KHLLELASDF NFQIIEDDIY GFYMYDDPPY PALKSLDKEG RVIYISSFSK IISPGLRVGF IGYEGREIEK LATIKSEINH QVSTLDQLIV GEMLRRDLVD AVVENSVLLY RKKRNVMLDA IEEYFPSSTG CSYTEGGFFT LCRKEALDSS SLLKEALKRD VKFIPGEKFF YSSEQGRNSF RLSFSFAKEE EIVEGVRILG ELLKGIK
|
| |