Gene Msed_0892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0892 
Symbol 
ID5103538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp827861 
End bp828964 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content46% 
IMG OID640506795 
ProductGntR family transcriptional regulator 
Protein accessionYP_001190988 
Protein GI146303672 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATAA ACATAGGAGG AGGATTGCCT GATCCCAGAA CGTTCCCCTG GGAGCGTATG 
GGTGAGATTG TAGATTACCT TATCAGAGAA AGGAGCGAAA CAACCCTGCA GTATGCCCCA
AGTGAAGGTA TAGAGGAGGT CAGGAAAGAG ATATCCAATT TTGTTAGGAA GAGAGGATTC
TCCTTGGAGG AGGATCAGAT ACTTATAACG GGAGGAGCTA AGGAGGCCAT ATACTTACTC
TCTGAACTTT TCTCGCAGAA CATGGTAGCC TCTGAGGAAC CCACGTTTCA GGGATTCATA
AGTACCATGA GTTACAGAGG GTTAAGGGCA TATCCAATCC CTTGGGATGA ATACGGTCCC
ATGACCGACG TCCTCGAGAA GAGGTTAAAG GCACTTCGAA TGTGGGCAGA CCCAGTGAAG
TACTTTTACG TAGTCCCAGT TCACAACCCG ACGGGGAGAG TCATGACCAA GGATAGGCGC
AAACACCTCC TTGAGTTGGC CAGTGACTTC AACTTTCAGA TCATTGAGGA TGACATATAT
GGGTTCTACA TGTATGACGA TCCTCCCTAT CCTGCACTTA AATCCCTTGA TAAGGAAGGA
AGAGTAATCT ACATCTCAAG TTTTAGTAAG ATCATTTCTC CAGGGCTCAG GGTAGGCTTC
ATAGGCTATG AGGGAAGGGA GATCGAAAAG TTAGCTACTA TCAAGAGCGA AATTAATCAT
CAAGTTTCTA CACTGGATCA ACTTATCGTG GGGGAAATGC TCAGGAGAGA CCTCGTGGAC
GCCGTAGTCG AGAACTCCGT ACTCCTTTAC AGGAAAAAGA GGAACGTCAT GCTCGACGCA
ATAGAGGAAT ATTTCCCGTC CAGCACTGGG TGCAGTTACA CAGAGGGAGG TTTCTTCACT
CTATGCAGAA AAGAGGCACT AGACTCGTCA TCCCTGCTCA AGGAGGCCTT GAAAAGGGAC
GTTAAGTTCA TTCCTGGAGA GAAGTTCTTC TACTCTAGCG AACAGGGAAG AAATTCCTTT
AGACTTAGTT TCAGTTTCGC TAAGGAGGAA GAAATAGTGG AAGGTGTGAG GATACTTGGT
GAGCTGTTGA AGGGAATTAA ATGA
 
Protein sequence
MPINIGGGLP DPRTFPWERM GEIVDYLIRE RSETTLQYAP SEGIEEVRKE ISNFVRKRGF 
SLEEDQILIT GGAKEAIYLL SELFSQNMVA SEEPTFQGFI STMSYRGLRA YPIPWDEYGP
MTDVLEKRLK ALRMWADPVK YFYVVPVHNP TGRVMTKDRR KHLLELASDF NFQIIEDDIY
GFYMYDDPPY PALKSLDKEG RVIYISSFSK IISPGLRVGF IGYEGREIEK LATIKSEINH
QVSTLDQLIV GEMLRRDLVD AVVENSVLLY RKKRNVMLDA IEEYFPSSTG CSYTEGGFFT
LCRKEALDSS SLLKEALKRD VKFIPGEKFF YSSEQGRNSF RLSFSFAKEE EIVEGVRILG
ELLKGIK