Gene Msed_1522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1522 
Symbol 
ID5104050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1483754 
End bp1484866 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content47% 
IMG OID640507409 
Productcitrate synthase 
Protein accessionYP_001191602 
Protein GI146304286 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.640515 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.251629 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGTTAA GGAAGGGACT TGAAGACATT GCAATAAAGG AGACTTCAAT AACCTATATT 
GACGGTGAAC TTGGCAGACT TTACTACAGG GGTTACTCCA TCTTTGATCT GGCCAGCTTC
TCGAATTTCG AGGAAGTCGC ATATCTCCTG TGGTATGGTA AATTACCCAC TAGACACGAG
TTGGACGATT TCAAGTCAAG GTTGGCTGAG GAGAGATCCA TCTCTGAGGA CATCTCTACC
TTCGTTAAAA GAACCGCAAA GTTTGGCAAC CCCATGGATA TACTTAGAAC TACCGTTAGC
ATGATGGGTC TAGAGGATAG GAGTGAGGGA GACCTCATAG GAAAGGCAAT AAAGATGACT
GCTAAGATCC CAACCATAAT ATCTCTCATC CAGAGGACTA GAAGGAACCA GGAGTTCGTT
GAGCCTGATC CCTCTCTTTC CCACTCCGAA AATTTCCTTT ACATGATTAG GGGAGAGAGG
CCATCCCCCT CAGACACCAG GGTGTTGGAC GTTTCCTTGA TGCTACATAT GGACCATGAA
ATGAACGCCT CAACAATGGC ATGCCTGGTT GTAGCCTCTA CCCTTTCCGA TATCTACTCC
TCAGTCGTTG CTGGAATTTC TGCGTTAAAG GGCCCCCTTC ACGGTGGAGC CAACTCGGAG
GCCTTGAAGC AGTTCATGGA GATAGAAACC CCAGACAACG TGGAGAAATA CGTGATGAAC
AAGCTCAGTT CTGGGCAGAG GTTGATGGGA TTTGGACACA GGATTTACAA GACCATGGAT
CCAAGGGCCA AGATACTCAA GGAGTACGCG AATCAGCTCT CCAAGAACGA GGAAATCAAG
AGGTTATTTG AGATCGCGAA TAGGGTTGAG GAAATTGGTA TAAAAATACT GGGTAAGAGG
GGAATCTATC CCAACGTGGA CTTCTACTCT GGACTCGTGT TTTACGCCAT GGGTTTTGAC
CCTGACCTGT TCCCTACGAT ATTTGCATCT GCCAGGGTCA TAGGATGGAC AGCCCACGTG
GATGAATACC TGAAGGACAA CAAGCTCATA AGGCCCAAGG CCATATACGT GGGGGATCTA
GGAAAAAGGT ATGTTCCCAT AGAAGAAAGG TAA
 
Protein sequence
MELRKGLEDI AIKETSITYI DGELGRLYYR GYSIFDLASF SNFEEVAYLL WYGKLPTRHE 
LDDFKSRLAE ERSISEDIST FVKRTAKFGN PMDILRTTVS MMGLEDRSEG DLIGKAIKMT
AKIPTIISLI QRTRRNQEFV EPDPSLSHSE NFLYMIRGER PSPSDTRVLD VSLMLHMDHE
MNASTMACLV VASTLSDIYS SVVAGISALK GPLHGGANSE ALKQFMEIET PDNVEKYVMN
KLSSGQRLMG FGHRIYKTMD PRAKILKEYA NQLSKNEEIK RLFEIANRVE EIGIKILGKR
GIYPNVDFYS GLVFYAMGFD PDLFPTIFAS ARVIGWTAHV DEYLKDNKLI RPKAIYVGDL
GKRYVPIEER