Gene Msed_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1984 
Symbol 
ID5103371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1919175 
End bp1920146 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content51% 
IMG OID640507872 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001192048 
Protein GI146304732 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0608348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGTG AGTACAAGGA TGCCGGGGTT GACCTAAATA AGTTGAAGGA GATACACAGG 
GATATAGCTT CCGCGATCTC CTCAACGTAC AGGAGAACTG TGCTGGGGGC AGGGCACTAC
TCTGGTGTTG TGGAGATAAA CGGGTTGAAA CTCGCAATTC ACGTGGATGG GGTTGGCACC
AAGGTTATCC TTGCCAAAAG GGCCAGAAAA TATCGGAGCG TCGGGATAGA TTGTGTTGCC
ATGAACGTGA ACGATCTCAT TAGCATAGGT GCTAAGCCCA TTGCCCTAGT GGACTACATT
GCCATGGACC AGCCATCCGA AGAGGTGATA TCAGAGATAG TCCAGGGACT GGTGCAGGGA
GCCAAGGAGT CTGACACTGA GATAGTGGGA GGAGAGACGG CAGCGATGAA GGATGTGGTG
AACGGCTTCG ATCTGTCCTG TACCGCGCTG GGCGTTGTGG ATAAACTGAA GACTGGGGAA
GAGGTTTCTC CCGGGGACGT GATTATTGGG CTAGCTAGTA ACGGAGTTCA CGCCAATGGC
TACTCCTTGG TCAGGAAGCT CCTTGATGAG GGGAAGCTAT CGTGGAAGGA TTGGGAGGAG
GAGCTCCTGA AACCCACCAG GATCTACGTT AAGCCTGTCC TCGAGGTTCT GGAACTCATC
AAGGCAGCTG GACACATCAC GGGGGGTTCC TTCAGTAAGC TCAGGAGGAT AACCAACTAC
TCACTGGAGT TGACCCTCCC AGATCCACCC CTGATCTTCA AGACCATTGA ACAAGCTGGT
ATTTCGCACG AGGAAATGCA CAGGGTCTTC AACATGGGTA TTGGCATGGT AGTCTTTGTG
GATAGAACCA ACGCCGAGGA TGTTCTTAGG AAATTAAACC CCTATGTCCC ATCACAGATT
ATTGGCGAGG TTAAGGACAA CGTTGGTCAG ATCAAAATTA CCACGTATAA GTCCCAGGTT
CTTTATTTAT AG
 
Protein sequence
MVSEYKDAGV DLNKLKEIHR DIASAISSTY RRTVLGAGHY SGVVEINGLK LAIHVDGVGT 
KVILAKRARK YRSVGIDCVA MNVNDLISIG AKPIALVDYI AMDQPSEEVI SEIVQGLVQG
AKESDTEIVG GETAAMKDVV NGFDLSCTAL GVVDKLKTGE EVSPGDVIIG LASNGVHANG
YSLVRKLLDE GKLSWKDWEE ELLKPTRIYV KPVLEVLELI KAAGHITGGS FSKLRRITNY
SLELTLPDPP LIFKTIEQAG ISHEEMHRVF NMGIGMVVFV DRTNAEDVLR KLNPYVPSQI
IGEVKDNVGQ IKITTYKSQV LYL