Gene Msed_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0190 
Symbol 
ID5103934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp153774 
End bp154904 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content49% 
IMG OID640506095 
ProductGntR family transcriptional regulator 
Protein accessionYP_001190291 
Protein GI146302975 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.557885 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.116027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTCAA GGATAGGAAG AGAAATAGAA CTCTCGCCCG TGGAGATGGG ATCTAGGCTC 
GGGCGTAACG TAAAAATTAA CATGGCCAGC GGGTCACCCG ATCCGTCAAC TATTCCAGTT
GATGAGATAG GAAGAGCCTA CGAGGAAGTG CTGGCCGACC TGGGCCCAAG ATCACTTTTC
TACCCAGGTG CTGGAGGTCA GCAGGAGCTA ATTGAGGAGG TGAACAAATA TCTTCCTGCC
ATAGGCTTGA GAAGTAAGGA TCCGATAGTC ATAACCAGCG GTGCTCAACA CGCCATAGAG
TTGCTGTCGA AGTACTTCCT CGAGAACGGG ACAGTTGTGG TGGAGAACCC AACCTTCGTG
GAGACTTTTT CAGCCTTTAA GTTAAGGGCT TCGGTCACGA TACCCGTCAC TGTGGATGGA
AAGGGTATTT CCACAGATGA GCTGGAGCTC GTTACCAAGA TAGTTAAGCC AGATCTAGTC
TACGTGATAC CGGACTGTCA TAACCCTGCT GGAGTGAACT TGAATGAGGA AAGGAGGAAG
ATACTGGTTG AGTTGGCTGA GGAAAGGGAC TTCTATGTGA TAGAAGACGA CCCTTACAGA
CCCATAGCTG GGTGCGTTCC AGCTCCATTA AAAAATTACG ATCGTAGTGG CAGGGTCATA
CACGTCAGCA GTTTCAGTAA GATCTTGGCA CCGGGTCTAA GGATAGGTTT CGTGGTAGCC
CCTCCAGAAA TAGCGGAGAA GTTGAGCCTC ATGGAACAAC TGGACTTTTC CACTTCAACT
CTAAATCAGT ATGTCGTCTC GCGCCTTTTG AGATCTGGAT TCATTTTATC TAGAACGAAG
ATTCTTCCAG AGCACTACAG GAAGAAAATG AAAGTCCTTG TGGACTCCCT AACGGATGCA
GGGATATCAG AGTTTAATCA GCCCAGTTGC GGGTTCTTCC TTTTGCTTGA CCTTAAGAGG
GATGCCCATA GAGTTTTGGA GGAAGCGGTA AGGCAAGGCC TAGCTTTCGT TCCTGCTAAG
GACTTCTTCC TACGGGGCGG AGAGACAATG GCTAGGCTGA GTATCACAGT TCCCAATGAG
GAGCAGATCA AGGCCGGAGT TGAGATACTG AAGAGGGTTA TTCGAGGCTA G
 
Protein sequence
MVSRIGREIE LSPVEMGSRL GRNVKINMAS GSPDPSTIPV DEIGRAYEEV LADLGPRSLF 
YPGAGGQQEL IEEVNKYLPA IGLRSKDPIV ITSGAQHAIE LLSKYFLENG TVVVENPTFV
ETFSAFKLRA SVTIPVTVDG KGISTDELEL VTKIVKPDLV YVIPDCHNPA GVNLNEERRK
ILVELAEERD FYVIEDDPYR PIAGCVPAPL KNYDRSGRVI HVSSFSKILA PGLRIGFVVA
PPEIAEKLSL MEQLDFSTST LNQYVVSRLL RSGFILSRTK ILPEHYRKKM KVLVDSLTDA
GISEFNQPSC GFFLLLDLKR DAHRVLEEAV RQGLAFVPAK DFFLRGGETM ARLSITVPNE
EQIKAGVEIL KRVIRG