Gene Msed_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1983 
Symbol 
ID5103370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1917748 
End bp1919181 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content53% 
IMG OID640507871 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_001192047 
Protein GI146304731 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.601261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTTC TACTTGTGGG AGACGGAGCC AGGGAACACG CGATGGCAGA GGCCCTAGCC 
AACTCTCCGC AGGGGTATAG GGTGTACGCC CTATCCTCCT ACGTGAACCC AGGGATCAGG
GAGGCCGTGA ACAGGACTGG TGGAAAGTAC GTTCAAGGCA ACATAAACTC AAGGGAGGAC
GTAGCTAAGG CCATTCGCGA GTTCAACCCT GACTTTGGAG TGGTCGGGCC AGAGGATCCA
CTCTTCCACG GAATAGCTGA CGAGTTTAGG AGGAATGGAA TACCAGTTGT AGGCCCCAAC
AGGGCAGGGG CTGAGATAGA GCGGTCCAAG GTGTGGATGA GACAACTCAT GTGGAAGTAC
AAGATAGATG GTAGGTTGAG GTTTAGGAGC TTCACCAGTC TCGAGGAGGC CTCACGGTTC
ATTGTGGAAT ACGGTGGATC AGTTGCCGTA AAGCCTGCAG AACAGGTTGG AGGAAAGGGA
GTTAAAGTAG TTGCGGATAT TCAGGCTTAT CTCTCTAACG AGAAGAGGAG GGCATTAAGC
AAGAGTGTGG ATGAGATAGG CTCCCTCGTT AAGAACGAGG TTAAGATTAT TATTGAAGAG
AAAGTGGACG GACCAGAGTA CACCCTCCAC GTTCTCACTG ACGGACATAC CTTTCTACCG
CTCCCGCTGG CACAGGATTA CAAACACGCG TACCAAGACG GAATAGGCCC CGAGACTGGC
GGTATGGGAT CAGTGTCAGG TCCAGGGAGA CTCCTTCCCT TCATTACCGA GGAAGAGTAC
GAGAAAACGC TCAAGATAGT TCAGGACACG GCAAGGGCCA TTCGCGAGGA GACTGGGGAG
CCCTACAGGG GATTCATCTC CGGGCAGATG ATGCTCACAG AGTTATGGGG ACCAACCGTG
ATAGAGTTCT ACTCCAGGAT GGGAGATCCA GAGACCTCTG CGATCATACC CAGGATATCC
TCAGACTTCG GTTACTTACT ACAGTTAACT GCTGAGGAAA AGCTGTCCCA GGCTAAGCTT
GAGGTGAGGG AGGACCCCAC GGTGGTGAGA GCTATCGCCC CCCTTGGCTA TCCACTGAGA
AGGGAAATGG CCACTGGGAA GGAGATTTTC CTCGACGTTA ACTCCATGAG GGAGAAGGGA
TGTCTAGTGT ACTTCGGCTC CGTGTCATAT GAAGGTGGGA AACTGATCAC GAGGGGATCT
AGGGCACTGG AGCTCGTGGC CTTCGGGGAC TTTGAGGAGG CCTCCAGGAA GCTGGACACA
TGCACCAAGA TGATTTCAGC CAACACCGAG CTGGTGTATA GAACTGATAT TGGCAGAACC
CTTGGGGAGC AGGCCGAGAA GGCGGAGATC GTGAGATACT CCTACCAGAA CAGGATTAGG
AGGAACTCCC TGGGCGTATC AGCGGACTGG GCACCAAATG GTGGTTTATG GTGA
 
Protein sequence
MKVLLVGDGA REHAMAEALA NSPQGYRVYA LSSYVNPGIR EAVNRTGGKY VQGNINSRED 
VAKAIREFNP DFGVVGPEDP LFHGIADEFR RNGIPVVGPN RAGAEIERSK VWMRQLMWKY
KIDGRLRFRS FTSLEEASRF IVEYGGSVAV KPAEQVGGKG VKVVADIQAY LSNEKRRALS
KSVDEIGSLV KNEVKIIIEE KVDGPEYTLH VLTDGHTFLP LPLAQDYKHA YQDGIGPETG
GMGSVSGPGR LLPFITEEEY EKTLKIVQDT ARAIREETGE PYRGFISGQM MLTELWGPTV
IEFYSRMGDP ETSAIIPRIS SDFGYLLQLT AEEKLSQAKL EVREDPTVVR AIAPLGYPLR
REMATGKEIF LDVNSMREKG CLVYFGSVSY EGGKLITRGS RALELVAFGD FEEASRKLDT
CTKMISANTE LVYRTDIGRT LGEQAEKAEI VRYSYQNRIR RNSLGVSADW APNGGLW