Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1484 |
Symbol | |
ID | 5104731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1451562 |
End bp | 1452818 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507372 |
Product | hypothetical protein |
Protein accession | YP_001191565 |
Protein GI | 146304249 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.555419 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTCTA TTCAAGACCT CATCGAGTTT GCCAGGATAG ACACGACCTC GGCCAAGGGG AAAGGAGAGG AGGGCGCGAA GTTCATAAGA GATTACATGC AGGAGCACGG AATCGAGGCA AGGATAATCA GGCACAGATC TAAGAATCCC TATGTCTATG GGGAGATCAA CGTTGGGGCC ACTAAGACAT TGCTAATCTA TAACCATTAC GATGTACAGC CGGCTGAGCC ACTCGATAAG TGGAACAGCG ATCCCTTCGA TCCCGTGATT AAGGACGGTA AGCTAGTGGG AAGAGGGGTT GGAGATGATA AGGGCTCGCT CATGGCTAGA CTGCAAGCGT TGATCGAGAT GGGGAAGCCT CCGATGAATA TTAAGTTCAT TTATGAGGGG GAAGAGGAGA TAGGTAGTCC CAACATAGAC TTATTTCTAG CAGAGCACAG GGAATTACTA GCCTCAGATT ACGTTCTCTG GGAAGGTGCA GGAAGGGGAT CCAGTGGCGC ACCCACCATA GTCCTTGGAG TAAAGGGCCT ACTATACGTT GAAATCTCGG TAAGAACACA AAAGGACCTT CACTCCATGT ATGCCCCTGT GGCCAAGAAC CCGGCCTGGG AGTTAGTTTA CTTGCTGTCT TCCCTAAAGA GTGGTGGTAG GGTTAACCTT CCTGGTTTCT ATGACAAGGT CAAATGGCTC ACGGAAGAGG AAAGGAGATA TCTTAAGGGA AATAAGAGGT CCATGGAGGA CGCACTGGGA CAGGAACTCC CAGACGATTT TCAGAGGAGA CTAGTGGAGG AACCTACGTG TAACATTGCG GGACTATACT CGGGATACAC AGGTGAGGGA TCTAAGACGG TGATTCCATC CTATGCCATG GCAAAACTGG ACTTTAGACT TGTACCAGAC CAGGACCCTG ACGAAATCCT GAAGATACTT CAGGACCATC TAAAGGGAGT GGATATCAAG GTCTGGGGTA AGGTAAGGCC CTATAGGACC TCAATCAATA GTAAGATTGC TAAATCGCTG ATGGATTCCG CCAAGAAGGT ATATGGTGTT GATCCTGAGG TCATACCCAA CAGTTACGGT ACTGGGCCCA TGGAGTCCTT CGCAAGGATT TTGCAGAATA ATCAAATAGC CGATGGAGTT GGAGTGGAGC ATCCGGGCTC AAATATTCAT TCCTTTAACG AGAACATATA CGTAGATGAT TATATGAAGG CTAAGGAGTG GATGAAGAGT TTCGTGAGGC ACTTAGCTCA GCCTTAG
|
Protein sequence | MNSIQDLIEF ARIDTTSAKG KGEEGAKFIR DYMQEHGIEA RIIRHRSKNP YVYGEINVGA TKTLLIYNHY DVQPAEPLDK WNSDPFDPVI KDGKLVGRGV GDDKGSLMAR LQALIEMGKP PMNIKFIYEG EEEIGSPNID LFLAEHRELL ASDYVLWEGA GRGSSGAPTI VLGVKGLLYV EISVRTQKDL HSMYAPVAKN PAWELVYLLS SLKSGGRVNL PGFYDKVKWL TEEERRYLKG NKRSMEDALG QELPDDFQRR LVEEPTCNIA GLYSGYTGEG SKTVIPSYAM AKLDFRLVPD QDPDEILKIL QDHLKGVDIK VWGKVRPYRT SINSKIAKSL MDSAKKVYGV DPEVIPNSYG TGPMESFARI LQNNQIADGV GVEHPGSNIH SFNENIYVDD YMKAKEWMKS FVRHLAQP
|
| |