Gene Msed_0534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0534 
SymbolpurT 
ID5103694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp491129 
End bp492316 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content50% 
IMG OID640506438 
Productphosphoribosylglycinamide formyltransferase 2 
Protein accessionYP_001190633 
Protein GI146303317 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0027] Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) 
TIGRFAM ID[TIGR01142] phosphoribosylglycinamide formyltransferase 2 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0760232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTTG GAACCCCATT GGTGGGAAAC GGAAAGAAGA TTCTCCTTCT TGGTAGCGGT 
GAACTGGGTA AGGAGATGGT CATAGAGGCA CAGAGGATGG GAATAGAGAC TGTAGCTGTG
GATAGATATG ATATGGCACC AGCCATGCAT GTTGCACATA GGAAATACGT TGTGGACATG
CTCAATGGTA GCGCAATTAG GGCAATAATC AAGAGAGAGA ACCCAGACGC GGTGATAGCG
GAGATAGAGG CCATTGACAC TGACGCATTG CTTGACCTTG AGGATCAGGG AGTCAGGGTG
ATACCCAACG CAAACGCTGT GAAGACATGT ATGAACAGGA TGCAGTTAAG GAAGCTGGCT
GCGGAAAAGG TAGGCGTGCC CACAACTAGG TACGCCTTTG CGAGCGACGA GGAGGAGGCG
AGAAGGGCGT GTAAAGAGGT TGGATTTCCG TGCCTCCTGA AACCCGAGAT GAGCTCCAGC
GGTCATGGTC ACGTTCTGGT GAAATCAGAG GATGAGGTGG AGAAGGGCTT CAGGGAATCG
GTATCCCATG CTAGAGGTAA GAGCAGAACT GTAATAGTTG AAGAGTACGT CAAGGTGGAC
ACCGAGCTCA CCGTTCTCAC CTATCGTCAC ATGAATAACG GATCCATAGA GACCAGAACC
ATTGAACCCA TAGAGCATCA AAGGCCCAGC TACTATTACG TCGAGTCATG GCAACCATCC
ACGGTGAGCC AGGAGGTTAT TGCAAGGTCA AGGGAATACG CCACTAGGGT GGTGAACGAG
TTGGGTGGTC TCGGGATATT TGGGGTGGAG ATAATTGTCT CAGGGAACAG GGTACTTTTC
AGTGAAGTAT CGCCGAGGCC ACATGATACA GGCCTCGTCA CCCTGGCCAG TCAAGACATC
AGTGAGTTTC AGATTCATGT TAGGGCAGCA TTGGGTTTAC CTATACCTCA GGTGAGGGTA
TTAACGCCAG CAGCCTCCCA TGTGATCCTC GCCCAATATG AGACTTGGGC TCCATCCTAC
CTGAACGTGG AGAAGGCCCT CTCTATTCCA GGTGTTCAGG TTAGATTCTT CGGCAAACCT
TCAACCTATG ACAAGAGGAG AATGGGAGTG GTACTAGCAA ACGGAAATGA TGTGAATGAG
GCAAGGGACA AGGCGAGAAA GGCTTCCTCC CTCATCCTTG TTAAGTAA
 
Protein sequence
MEFGTPLVGN GKKILLLGSG ELGKEMVIEA QRMGIETVAV DRYDMAPAMH VAHRKYVVDM 
LNGSAIRAII KRENPDAVIA EIEAIDTDAL LDLEDQGVRV IPNANAVKTC MNRMQLRKLA
AEKVGVPTTR YAFASDEEEA RRACKEVGFP CLLKPEMSSS GHGHVLVKSE DEVEKGFRES
VSHARGKSRT VIVEEYVKVD TELTVLTYRH MNNGSIETRT IEPIEHQRPS YYYVESWQPS
TVSQEVIARS REYATRVVNE LGGLGIFGVE IIVSGNRVLF SEVSPRPHDT GLVTLASQDI
SEFQIHVRAA LGLPIPQVRV LTPAASHVIL AQYETWAPSY LNVEKALSIP GVQVRFFGKP
STYDKRRMGV VLANGNDVNE ARDKARKASS LILVK