Gene Msed_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0018 
Symbol 
ID5105157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp15575 
End bp16951 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content46% 
IMG OID640505911 
Productadenylosuccinate lyase 
Protein accessionYP_001190119 
Protein GI146302803 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0015] Adenylosuccinate lyase 
TIGRFAM ID[TIGR00928] adenylosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.146335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000415583 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTATGG ATGTGGTTTG TCCCATAGAC TGGAGATATG GTAGCAAGGA AATGAGGAGG 
ATATTCACTA GAGAGGGAAT CATTGAGTAC AGAATTAAGG TTGAATTAGC GTTACTTTAC
GCCCTAAAGG AACTGGGCTA CGTATCTGAG GAGGAATATG GTAAGGTGGA GAAAGCCTCA
GAGAGCGTCA CGCCCGAAGA GGTGGATAAC CTAGAGGCCA AACTTGGACA TGATGTGATG
GCCCTTGTGG TATCCATGGC AGAAAAGGCT GGGGATGCGG GTAGATTCGT ACATTTTGGT
GCAACTAGTT ACGATATAGT GGACACTGCC TATGCCTTGA TGTTCAGAGA TGCGCTATCC
ATCCTGAAGG CTAAGTTCGT CAAATCTTTA GATAAATTAA AAGATCTTAG CCTGAAGTAT
CAGGACCAGG TTATGGTGGG CAGGACTCAC GCTCAACATG CAGTACCCAT CACCTTGGGA
TTCAAATTCG CGAACTACTT GTATGAGATG ACAAGGTCAT TGGAAAGGTT GGTAGAGACC
GAGAAGAGGG TAGTCCTGGG TAAAATGAGC GGAGCTGTCG GTACTATGGC GAGTTGGGGA
AAGGATGGAC TCAAGGTGGA AGAGTTGGTT ATGAGGAGGC TTAACCTGAG GCCTCACTCC
ATATCCACTC AGGTTGCACC TAGGGACGGT TTTGCCGAGT TGATCAGCGA TCTAGCCATA
GCTGGATCGG TGATGGACAG GTTCGCGATA GAGGTTAGGG AGCTCATGAG GCCTGAGATC
GGGGAGATGG CCGAGGGAGT AGGGGACAGG GTCGGTAGCA GTACCATGCC CCACAAGGAG
AACCCGGTAA CTGCGGAGAA GATTAGTGGT CTCTCCAAGT TACTAAGAGG AATGGTCGTA
TCTGAACTGG AAAATATTCC ACTCTGGCAT GAGAGAGATT TAACCAATAG CTCAAGTGAG
AGATTTATCA TATCCCACGC CTTCCTTGTT ATGGACGAGA TGCTAGACAG CTTTAATGAA
TTATTATCTA ACTTGAGAAT TAATTTAGAC GCCATGGAAA GGAATCTTCG GCTAAGTAAA
GGATTGAATA TGGCTGAAAG CTTAATGATT AACCTCACAT TGAAGGGTTT ACCTAGGCAT
AAGGCCCATG AGATAGTTAG CAAGATCTCC AGAGAGGCCA GAAAAGGAAA TTCCAGTCTC
CTGGAGGAGG CCATCAAAGA CAAAACAATC TCGTCGATGT TCAGTCAAGA GGAGCTTGAA
AGAATATTGA ATCCCAAGGC CTATCTGGGC CAGTATAAAA CACTTATACA ACGGGCTATA
GATTACTATG AGAACCTAGT AAGAGAGCTT AATGAGACCA GGGGAGTAAT ACATTAA
 
Protein sequence
MSMDVVCPID WRYGSKEMRR IFTREGIIEY RIKVELALLY ALKELGYVSE EEYGKVEKAS 
ESVTPEEVDN LEAKLGHDVM ALVVSMAEKA GDAGRFVHFG ATSYDIVDTA YALMFRDALS
ILKAKFVKSL DKLKDLSLKY QDQVMVGRTH AQHAVPITLG FKFANYLYEM TRSLERLVET
EKRVVLGKMS GAVGTMASWG KDGLKVEELV MRRLNLRPHS ISTQVAPRDG FAELISDLAI
AGSVMDRFAI EVRELMRPEI GEMAEGVGDR VGSSTMPHKE NPVTAEKISG LSKLLRGMVV
SELENIPLWH ERDLTNSSSE RFIISHAFLV MDEMLDSFNE LLSNLRINLD AMERNLRLSK
GLNMAESLMI NLTLKGLPRH KAHEIVSKIS REARKGNSSL LEEAIKDKTI SSMFSQEELE
RILNPKAYLG QYKTLIQRAI DYYENLVREL NETRGVIH