Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1983 |
Symbol | |
ID | 5103370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1917748 |
End bp | 1919181 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640507871 |
Product | phosphoribosylamine--glycine ligase |
Protein accession | YP_001192047 |
Protein GI | 146304731 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0151] Phosphoribosylamine-glycine ligase |
TIGRFAM ID | [TIGR00877] phosphoribosylamine--glycine ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.601261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTTC TACTTGTGGG AGACGGAGCC AGGGAACACG CGATGGCAGA GGCCCTAGCC AACTCTCCGC AGGGGTATAG GGTGTACGCC CTATCCTCCT ACGTGAACCC AGGGATCAGG GAGGCCGTGA ACAGGACTGG TGGAAAGTAC GTTCAAGGCA ACATAAACTC AAGGGAGGAC GTAGCTAAGG CCATTCGCGA GTTCAACCCT GACTTTGGAG TGGTCGGGCC AGAGGATCCA CTCTTCCACG GAATAGCTGA CGAGTTTAGG AGGAATGGAA TACCAGTTGT AGGCCCCAAC AGGGCAGGGG CTGAGATAGA GCGGTCCAAG GTGTGGATGA GACAACTCAT GTGGAAGTAC AAGATAGATG GTAGGTTGAG GTTTAGGAGC TTCACCAGTC TCGAGGAGGC CTCACGGTTC ATTGTGGAAT ACGGTGGATC AGTTGCCGTA AAGCCTGCAG AACAGGTTGG AGGAAAGGGA GTTAAAGTAG TTGCGGATAT TCAGGCTTAT CTCTCTAACG AGAAGAGGAG GGCATTAAGC AAGAGTGTGG ATGAGATAGG CTCCCTCGTT AAGAACGAGG TTAAGATTAT TATTGAAGAG AAAGTGGACG GACCAGAGTA CACCCTCCAC GTTCTCACTG ACGGACATAC CTTTCTACCG CTCCCGCTGG CACAGGATTA CAAACACGCG TACCAAGACG GAATAGGCCC CGAGACTGGC GGTATGGGAT CAGTGTCAGG TCCAGGGAGA CTCCTTCCCT TCATTACCGA GGAAGAGTAC GAGAAAACGC TCAAGATAGT TCAGGACACG GCAAGGGCCA TTCGCGAGGA GACTGGGGAG CCCTACAGGG GATTCATCTC CGGGCAGATG ATGCTCACAG AGTTATGGGG ACCAACCGTG ATAGAGTTCT ACTCCAGGAT GGGAGATCCA GAGACCTCTG CGATCATACC CAGGATATCC TCAGACTTCG GTTACTTACT ACAGTTAACT GCTGAGGAAA AGCTGTCCCA GGCTAAGCTT GAGGTGAGGG AGGACCCCAC GGTGGTGAGA GCTATCGCCC CCCTTGGCTA TCCACTGAGA AGGGAAATGG CCACTGGGAA GGAGATTTTC CTCGACGTTA ACTCCATGAG GGAGAAGGGA TGTCTAGTGT ACTTCGGCTC CGTGTCATAT GAAGGTGGGA AACTGATCAC GAGGGGATCT AGGGCACTGG AGCTCGTGGC CTTCGGGGAC TTTGAGGAGG CCTCCAGGAA GCTGGACACA TGCACCAAGA TGATTTCAGC CAACACCGAG CTGGTGTATA GAACTGATAT TGGCAGAACC CTTGGGGAGC AGGCCGAGAA GGCGGAGATC GTGAGATACT CCTACCAGAA CAGGATTAGG AGGAACTCCC TGGGCGTATC AGCGGACTGG GCACCAAATG GTGGTTTATG GTGA
|
Protein sequence | MKVLLVGDGA REHAMAEALA NSPQGYRVYA LSSYVNPGIR EAVNRTGGKY VQGNINSRED VAKAIREFNP DFGVVGPEDP LFHGIADEFR RNGIPVVGPN RAGAEIERSK VWMRQLMWKY KIDGRLRFRS FTSLEEASRF IVEYGGSVAV KPAEQVGGKG VKVVADIQAY LSNEKRRALS KSVDEIGSLV KNEVKIIIEE KVDGPEYTLH VLTDGHTFLP LPLAQDYKHA YQDGIGPETG GMGSVSGPGR LLPFITEEEY EKTLKIVQDT ARAIREETGE PYRGFISGQM MLTELWGPTV IEFYSRMGDP ETSAIIPRIS SDFGYLLQLT AEEKLSQAKL EVREDPTVVR AIAPLGYPLR REMATGKEIF LDVNSMREKG CLVYFGSVSY EGGKLITRGS RALELVAFGD FEEASRKLDT CTKMISANTE LVYRTDIGRT LGEQAEKAEI VRYSYQNRIR RNSLGVSADW APNGGLW
|
| |