Gene Msed_1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1981 
Symbol 
ID5103368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1915197 
End bp1916537 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content53% 
IMG OID640507869 
Productamidophosphoribosyltransferase 
Protein accessionYP_001192045 
Protein GI146304729 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0034] Glutamine phosphoribosylpyrophosphate amidotransferase 
TIGRFAM ID[TIGR01134] amidophosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.568591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAA GGGAACACTG TGGAGTTTTC GGGGTTGTCG GGCCTGACTC CACGAAGCTT 
ACCTTTGAAG GACTAAAGCT CCTCCAACAT AGGGGTCAGG AATCCGCAGG GATTTCCTGG
ATAGATGGGG ACAGGATACA AACTAGGAAG GGCCTTGGTC TTGTGGGTGA GGCCCTAGAT
CCCAAGGAGA TAGGGGAGTC GCTCTTTTCC ATAGGCCACG TGAGGTACTC AACCACAGGC
TCCACTACCC TACAGGAGGC TCAGCCCCTC GACGACGGCT TCATTGCTGT TTCCTTTAAC
GGTACCATAA CCAACCACTT CCAGAACGGA GACTTTTCCA CTGACACCGA GTTTATCTTG
TCCTTCCTTA GGAACCAGCT GTCTCAAGGG AGAAGCCTGG AGAGCTCCGC GAGGGCATTC
ATGGATGTGG CCGATGGAGC CTTCTCCCTC CTGGTTCTCT CCTCCAAGGG CGAAATTTTG
GCAATGCGGG ATCCGCGTGG CTTCAGACCT CTCGTTATAG GCGAAATAGG GGATAACAAG
GTGGTCTCCT CTGAGGACTC AGCAATCAAG CAGTTGGGTG GAAGGGTCAT CGGTTTCGTT
CACCCAGGGG AGATAATTAA GATAACCAGG GATACAGTGG TCAGGGAGAG GGTCTCCTCC
CTTCCGACGA CTACGTGCGC CTTTGAATAC ATTTATTTCT CTAGAGCCGA TTCAGAGATA
GACGGAATCT CGGTCTATGC ATCCAGAATT AAGTTAGGGG AGTTGCTGGC TAGAAATCAC
CCAGCCAACG GAGATGTGGT AGTGCCTGTG CCAGACTCGT CCAGGCCCAT AGCCCTGGGC
TTCTCTAGGA CCAGCGGTAT ACCCCTAGAG GAGGCACTGG TAAGGACCAT CTCTTCAAAG
AGGTCCTTCA TCATGCCTTC CGACGAGAAA AGGAACGAGG TGCTTAAGGA GAAGTTCGGC
ATTGTGGAGT GGGCCGTCAG GGGAAAGAGG GTTGTACTCG TTGACGATTC CATAGTCCGC
GGGAATACCA TGAAGAGGAT AGTGAACTCG CTTAGAAGCG CGGGAGCGAG GGAGGTTCAC
ATTAGAATTG GATCCCCCAT GATTAGGTTT CCCTGTTACA TGGGAATCGA CTTCCCTAGA
AGATCTGAGC TTGTGGCAAA CATAGGAGAT GAGAGGGCCA TAGCCCGTGA GCTTAACGCG
GATAGCGTCG AGTACCTTTC TGTGGAGGAA ATGGTTCAGG CCATAGGCAG AACTACCCTA
TGTAAGGCCT GTTTCACAGG GGAGTATCCC CTCAAGGGGA AATATGACCT CTCCGCTCTG
GAGAGCGTTT TCGCAAGGTG A
 
Protein sequence
MNAREHCGVF GVVGPDSTKL TFEGLKLLQH RGQESAGISW IDGDRIQTRK GLGLVGEALD 
PKEIGESLFS IGHVRYSTTG STTLQEAQPL DDGFIAVSFN GTITNHFQNG DFSTDTEFIL
SFLRNQLSQG RSLESSARAF MDVADGAFSL LVLSSKGEIL AMRDPRGFRP LVIGEIGDNK
VVSSEDSAIK QLGGRVIGFV HPGEIIKITR DTVVRERVSS LPTTTCAFEY IYFSRADSEI
DGISVYASRI KLGELLARNH PANGDVVVPV PDSSRPIALG FSRTSGIPLE EALVRTISSK
RSFIMPSDEK RNEVLKEKFG IVEWAVRGKR VVLVDDSIVR GNTMKRIVNS LRSAGAREVH
IRIGSPMIRF PCYMGIDFPR RSELVANIGD ERAIARELNA DSVEYLSVEE MVQAIGRTTL
CKACFTGEYP LKGKYDLSAL ESVFAR