Gene Msed_1449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1449 
Symbol 
ID5104819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1416100 
End bp1417185 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content48% 
IMG OID640507337 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001191530 
Protein GI146304214 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTACT CTCCACGGAA CTCGTTTAAG TTCTGTATCC TTGGGGGAGG ACAACTTGGC 
TGGATGATGG TTCTTGAGGG ACTGAAGTTT CCAATCTCCT TCCACGTATA CGGAGAGAAA
GAGGATCCAG CCTGCAAGAT TGCCAACTGC TTCAAGGAAG AGTACAGGAA GGTTATTGAG
GAGTGCGACG TCGTCACATA CGAGTTTGAG CACGTGGATG ATAAGCCACT TGAACTAGCT
AGGGACCTTA ACAAGCTAAT GCCTGGAATG AATGCGGTCG AGCTCAAGAG GGTGAGACAT
CTAGAGAAGG AATTTCTTAG AAGAGAAGGA TTACCGGTAC CGCGTTTCGT CACTGTGAGG
GGTGGAGATG AGGCACTTAG GGTTCTCAAG AACGAGTTCA ATGGCACGGG AGTTATAAAG
AGATCCAAGG GTGGTTACGA CGGAAAGGGG CAGTTCTTCG TGAGGGGAGA CCCTGAAAAA
TACTCTTTCC TTAGGGATGA GAACGATTAC TTCGTTGTTG AGGAACTGGT CAACTTCGAC
TATGAGGCCT CAATAATAGC TGTGAGGAGG GGGAACGAGT TCAGGGCCTA TCCTCCGACG
TTCAATTACA ACGAGAAGGG AATCCTCGTC TATAATTACG GGCCCTTCGG TAACGAGGAG
ATGGTGAAGA TTGCCGAGGA ACTCACGAGA AAGTTAAACT ACACTGGCGT AATAGGAATA
GAGTTCTTCG TTAAGGATGG AAAGGTTCTC ATCAACGAGT TTGCTCCCAG GGTTCATAAC
ACGGGCCATT ACACCTTGGA CGGAGCTGAG GTATCCCAGT TTGAACAACA CGTTAGGGCC
CTGGCTGGAT TGGAACTAGG GAGTACCAAA GTGCTTACCT TCTCGGGGAT GATAAACATA
CTGGGCATAG CCTCTCCTCC CATGGAGATC CTAAAGCTCG GAACCCTATA CTGGTATGGA
AAAAGCGAGG CTAGGAAGAG GAGGAAGATG GGGCACGTGA ACGTTCTAGG AGATGATCTG
GCTGAAGTTA AGGAAAAGAT TGAAAATGTT ATGAATATAT TATATCCCAA TGGGCCTGAT
CTATGA
 
Protein sequence
MTYSPRNSFK FCILGGGQLG WMMVLEGLKF PISFHVYGEK EDPACKIANC FKEEYRKVIE 
ECDVVTYEFE HVDDKPLELA RDLNKLMPGM NAVELKRVRH LEKEFLRREG LPVPRFVTVR
GGDEALRVLK NEFNGTGVIK RSKGGYDGKG QFFVRGDPEK YSFLRDENDY FVVEELVNFD
YEASIIAVRR GNEFRAYPPT FNYNEKGILV YNYGPFGNEE MVKIAEELTR KLNYTGVIGI
EFFVKDGKVL INEFAPRVHN TGHYTLDGAE VSQFEQHVRA LAGLELGSTK VLTFSGMINI
LGIASPPMEI LKLGTLYWYG KSEARKRRKM GHVNVLGDDL AEVKEKIENV MNILYPNGPD
L