Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0281 |
Symbol | |
ID | 5104917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 240332 |
End bp | 241468 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506187 |
Product | citrate synthase |
Protein accession | YP_001190382 |
Protein GI | 146303066 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | [TIGR01800] 2-methylcitrate synthase/citrate synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.160338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.143922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA TAAGTAGAGG ATTGGAAAAC GTATTCATTA AGACAACCTC CTTAACCTAC ATAGATGGAG AGAACGGGAT ACTGAGGTAC GGAGGATATG ATATAGAGGA TCTAGTTGAA CACACCAGTT TTGAGGAAGT AGTACATCTT ATGCTATATG GGGACCTGCC CACTAAACTT CAACTCCAGA GATTGAAGAG TGCATTAGAC GAGGCATATG AAGTACCGCA GCAGGTCATT GACATGATAT ATTCTCTGCC AAGGGATTCG GATGCTGTGG GTATGATGGA AACAGCCTTT TCAGCCCTTT CCTCCATCTA CGGAATGCCA TGGAACAAGG CCACAAATAG GGATAACGCG GTTAAGCTGG TGGCAAGGGC CTCCACCGTG GTCGCAAACG TGTTAAGGGC AAAGGAGGGA AAGAAACCTG CCATACCTGA GCCCTCTGAG AGCTTCGCTA AGAGCTTTCT AAAGGCCTCA TTCTCCAGAA CGCCCACAGA GGAAGAGGTT AAGGCAATGG ATGCTGCATT AATACTCTAC GCAGACCATG AAGTTCCGGC ATCCACAACG GCAGCCCTAG TCACCTCGTC AACTCTCTCT GACATTTACT CCTGTGTAGT AGCAGCCCTT GCAGCACTGA AGGGACCCCT ACATGGTGGT GCCGCAGAGG AGGCGTTCAA GCAGTTTGTG GAGATTGGAG AGCCTGACAT GACTGAGTCA TGGTTTAAGA GAAAGATAAT CGAGGGCAAG TCCAGGCTAA TGGGGTTCGG ACACAGGGTA TACAAGACCT ACGATCCTAG GGCAAAGATA TTCAAGAAGT ACGCTAAAGT TATCTCGGAG AGGAACAGTG ACGCCAGAAA ATATTTCGAA ATAGCCCAGA AGTTAGAGGA GTTAGGAGTG GAAACCTTCG GTGCCAAGCA CATCTACCCG AACACAGATT TCTACTCGGG TGTAGTGTTC TACGCTTTAG GATTCCCAGT CTATATGTTC ACTTCCCTGT TCGCCCTTTC CAGGACTCTG GGATGGACAG CTCATGTCAT AGAATACGTG GAAGATCAGC ATAGACTCAT AAGACCTAGG GCCTTGTACG TTGGTCCTCT GAAGAGGGAT GTCGTGCCTA TAGAATTGAG AGGATAA
|
Protein sequence | MSQISRGLEN VFIKTTSLTY IDGENGILRY GGYDIEDLVE HTSFEEVVHL MLYGDLPTKL QLQRLKSALD EAYEVPQQVI DMIYSLPRDS DAVGMMETAF SALSSIYGMP WNKATNRDNA VKLVARASTV VANVLRAKEG KKPAIPEPSE SFAKSFLKAS FSRTPTEEEV KAMDAALILY ADHEVPASTT AALVTSSTLS DIYSCVVAAL AALKGPLHGG AAEEAFKQFV EIGEPDMTES WFKRKIIEGK SRLMGFGHRV YKTYDPRAKI FKKYAKVISE RNSDARKYFE IAQKLEELGV ETFGAKHIYP NTDFYSGVVF YALGFPVYMF TSLFALSRTL GWTAHVIEYV EDQHRLIRPR ALYVGPLKRD VVPIELRG
|
| |