Gene Msed_1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1980 
Symbol 
ID5103367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1913092 
End bp1915200 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content54% 
IMG OID640507868 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_001192044 
Protein GI146304728 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTCA CGTTATCCAA GACTGAGATG GAGGTAGTGA GGAGGGTTCT CGGGAGGGAA 
CCCAAGGAAG CTGAGTGGAA GCTTGTGGAT GCACTTTGGT CGGAGCATTG TTCCTACAAG
AGCTCGAAGG TATTCCTTAG GAGCTTGCCC AGTACAGGTC CCAACGTGAT AATGAGCGTT
GAGGACTGGC AGGACGCCGG TGCCGTGGAC GTGGGTGATG GTCTAGCCCT GGTCCTCAAG
GTTGAGAGCC ACAACCATCC CTCAGCCATA GATCCCTTCA ACGGTGCAGC TACGGGAGTT
GGGGGAATCC TAAGGGATAT CATAAGTAAG GGAGCGAAAC CCATAGCCCT CCTCGACATG
ATAAGGGTTG GCAAACTGGA TGACAAGGGG AAGTGGCTAC TCAAGAACAT AATCTCAGGG
ATAGGCTTTT ACGGGAACAG CGTCGGGGTA CCTGTAGTCG GAGGAGAGCT GTCCTTCGAC
GAAACGTATA ACGATAATCC GCTCGTGGAC GTGGCCTGCG CAGGTATAGT GAGCAAAGAC
TCCATAGTCC CCAGCGTGGT TCGGGAGCCA GGTCTCAGGC TCGTGTTGGC AGGCTTCACT
GGACTTGACG GTCTAGGGGG AGCCTCGTTT GCCTCTAGGA AATTGAGCGG AATGGATGAG
ATAGGGGCAG TTCAGATAGC TGATCCCTTT GCGGGGAAGA TAATCATTGA CGTAACCCTT
GAGATCGCCA AGGAGGTTGA GGCAATAAAG GACCTCGGTG GTGGGGGACT AGCAGTTGCC
GTTACTGAGA TGGCCAACGG ATTGGGGGCA GTGGTTAACC TGGAGAGGGT ACCACTGAGG
TTTGAGTACC TCTCGCCAGA GGACATCATA ATATCCGAGA CCCAGGAGAG AATGCTCTTT
GCGGTGAAGC CCGAGAAAGT GGAGTCCGTG TGCGCCAAGT TCAGGGAGTA CAACTACCCC
TGTGAGGACA TAGGCGAGAT CACGGCCGAA CCCAAGGTGA GGTTCCTGTA TAATGGTGAG
CCTGTGGCGG AGCTACCCTC AGACCTTCTC CTATCTCCCC CACTCAATGT CTGGCCAGCT
GAGAGGAAGC CGAGGAAAAG AGAGCCTGAA AGGGAAGTCG CCCTAGGCGA GGCACTGAGG
GCAGTCCTCA CACATCCTGA CCTTGTGAGC AAGGAGTGGG CCTACTCCCA GTTCGATTAC
GAGGTGGGGA CGTCCACCGT GGTGAAGCCT GGACAGGGTG ACAGCGCGGT TGTGGAGTTG
CCCAATGGGA AATACCTGGC GCTTAAGGGG GATGCAAATC AGGACCTATG TGCCGAGGAC
GCTTACGAGT GCGGTAAAGC CATAGTAGCA GAGGCCTACA GGAATCTGGC AACAGTTGGG
GCGAGAGGAA TAGCCCTTGT GGACCATCTG CAGTTTGGCG ATCCAAGGAA GCCTCACGTC
TATCAGGACT TCATTGACGC TGTGAGGGGA ATATCGGAGG CTTCCAAGTT CTTCTCGATA
CCCGTTGTGG GCGGAAAGGT CTCGTTCCAC AACGAGGATA AGAACGGTAA TCCCATAAAG
CCTACACCCC TGGTGGTGAT GGCTGGGCTA GTTGAGGGCA AGCTGGCCAG AAATAGGGTA
GAGGAAGGAG ATCTAGTGCT AATTGGGGAG ACTAGGAACG AGCTGGCTGG AACCCTTTTC
TCCAGGATAT TTGGAGGAGG AGGCGAGGTG GCCAAAACCA GATTAATGGA GGACCTAATA
GCGTCGAACC TGGTAATAAA GGGAATAAAC GAGGGTAAGA TAACTTGGAA CAAGGATGTC
TCCAAGGGCG GTCTAGCCGG AGCACTTCTC CCAATCCTAG CTAAGGGATT CTCCGTGAGG
ATCTCCTCCT CCCAGGTAAT CGGTACCTCA AATCTCCTGG GCAAGATGTT CTCGGAGAGC
GGTGGAAGGT TTCTAGTCCT CACCAGTGAT CCGCAGTGGT TCATGTACCA GGCTGGCAGG
ATGGGAATAC AGGCGTTGGC CATAGGGAAG GTGACCAAGG ATGGGAGCAA GCTCGTTCTA
GATTACGAGA CCTTTCCCAT GGACTCTATC GTGGAGAACT ACTACTCCTT TCTGGAGGGA
TCGCTATGA
 
Protein sequence
MILTLSKTEM EVVRRVLGRE PKEAEWKLVD ALWSEHCSYK SSKVFLRSLP STGPNVIMSV 
EDWQDAGAVD VGDGLALVLK VESHNHPSAI DPFNGAATGV GGILRDIISK GAKPIALLDM
IRVGKLDDKG KWLLKNIISG IGFYGNSVGV PVVGGELSFD ETYNDNPLVD VACAGIVSKD
SIVPSVVREP GLRLVLAGFT GLDGLGGASF ASRKLSGMDE IGAVQIADPF AGKIIIDVTL
EIAKEVEAIK DLGGGGLAVA VTEMANGLGA VVNLERVPLR FEYLSPEDII ISETQERMLF
AVKPEKVESV CAKFREYNYP CEDIGEITAE PKVRFLYNGE PVAELPSDLL LSPPLNVWPA
ERKPRKREPE REVALGEALR AVLTHPDLVS KEWAYSQFDY EVGTSTVVKP GQGDSAVVEL
PNGKYLALKG DANQDLCAED AYECGKAIVA EAYRNLATVG ARGIALVDHL QFGDPRKPHV
YQDFIDAVRG ISEASKFFSI PVVGGKVSFH NEDKNGNPIK PTPLVVMAGL VEGKLARNRV
EEGDLVLIGE TRNELAGTLF SRIFGGGGEV AKTRLMEDLI ASNLVIKGIN EGKITWNKDV
SKGGLAGALL PILAKGFSVR ISSSQVIGTS NLLGKMFSES GGRFLVLTSD PQWFMYQAGR
MGIQALAIGK VTKDGSKLVL DYETFPMDSI VENYYSFLEG SL