Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1219 |
Symbol | |
ID | 5103833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1192442 |
End bp | 1193614 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640507111 |
Product | amidohydrolase |
Protein accession | YP_001191304 |
Protein GI | 146303988 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.82253 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAAAT TTAGCGGTAA AATTTTCGAC GGGACTAAAC TCATTGAGGG GACAGTCTTG GTTGAGGGTA ATCAAATCGT TGAGGTCGAG GAGGGAAAGA TGGAGAGTGA CTCCACGAAG GGTTTCATCA TCCCTGGAAT GATTGATGCC CACCTCCACT TCTTCGGAGT TCATGAGGAC AACGTAATGT CTTGGAACCT CGTGAACGAG ATCGACGTGG CCATTAGGAG CACCAGGGAT ATGGAGAGAC TTCTTAGGTC AGGGTTTACA ACGGTCAGGG ATCTGGGAAG TAAGGTGGCA ACTAGGCTCT CCAACCTGGA GAGATCTGGG GAGATCATAG GCCCTAGAGT AATAGCCTCA GGTTACTCCT TGGCCATCAC GGGAGGAGAC GACGATCCGC GGGATTTGCC CCTTGAGATG GCACAAAAGC TGTCGTACTC CTTCTATTGT GATTCTCCCT ATGAGTGCAG GAAGGCCGTG AGGTTAGCCG TGAGGCAGGG AGCTGGAGTC ATAAAGGTCT ACGCCTCGGG AGCGTTTTCC CAGGGCGGAA AGATCCTTCC AGGGCTGGGA CCATACGAGC TGAAATCCAT CGTGGAGGAG TCTCATAGGT TTGGGCTTAA GGTAGCGTCC CACGCCTATG GAAGGGAGGC AATCCTCAAC TCAGTTGAGG CAGGAGTGGA CACCATTGAA CACGGGCTTG GCCTCGATAA GGACACGGCA TCCATGATGT TAGACAGGGG GACATGTTAT ATCCCTACCC TAGCTACGTA TGAGATCCCA TTTCACGTGG CAAACCCAGA GGTAAGGAGA TACAGGGAAG AGGCGGTCTC AAGGCATATG AAGGAAGACG TTAAGTTAGC CAAGTCCGTG GGACTTAAGA TCGCCACCGG GACGGATTAC GTGGGTTCAG ATGCTAGACC ACATGGCAAA AATTACAGGG AAGCGGTCCT CCTCTCGCAG TTCATGGGAA ACGACGAAGT TCTTGCATCA ACAACTTCCG TGGCGGCGGA GTGTCTGGGA ATAAGGGCTG GTAGGATAGA GAAGGGATAT CTGGCAGACC TCGTGGTTCT GAGGAATGAT CCTCTCCAGA ACGTGGAGAA CCTTTCGCCC GAGAACGTGC TTTACGTCGT TAAGGACGGG AAAATGTATC GAGGAGTAGG AAGGGAGGAC TAA
|
Protein sequence | MLKFSGKIFD GTKLIEGTVL VEGNQIVEVE EGKMESDSTK GFIIPGMIDA HLHFFGVHED NVMSWNLVNE IDVAIRSTRD MERLLRSGFT TVRDLGSKVA TRLSNLERSG EIIGPRVIAS GYSLAITGGD DDPRDLPLEM AQKLSYSFYC DSPYECRKAV RLAVRQGAGV IKVYASGAFS QGGKILPGLG PYELKSIVEE SHRFGLKVAS HAYGREAILN SVEAGVDTIE HGLGLDKDTA SMMLDRGTCY IPTLATYEIP FHVANPEVRR YREEAVSRHM KEDVKLAKSV GLKIATGTDY VGSDARPHGK NYREAVLLSQ FMGNDEVLAS TTSVAAECLG IRAGRIEKGY LADLVVLRND PLQNVENLSP ENVLYVVKDG KMYRGVGRED
|
| |