Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0050 |
Symbol | |
ID | 5105189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 45010 |
End bp | 46110 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640505945 |
Product | amidohydrolase |
Protein accession | YP_001190151 |
Protein GI | 146302835 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.352004 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.512598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAAGCTT CTTCAAGAAT AGTAAACGTA GAATATGCTC TACTGGGTCA AGATTTAGAA CTGGTACACA AAATTCACCT GGAGATCAGT GACGGGATAA TCTCCCACAT AGGAAAGGGA TGGGATGTTA AGGGCGAATC CTACCCTAAT TCCCTTCTCA TGCCTGGGCT GGTAAACTCC CACGTTCATA CCTTCGATGG GATCGCACCT GAGTTAGGAT GGAACCTAAC CCTTAAGGAG GTGGTAGGCG ACCCTCATAG CGAGAAGTAT AGAGTACTCT CCCTGAGGAG CCTCCCTGAG TTAAGGGCCT CAACCCTTAA CTTCCTTAAC AGGTCCTTGG AGCTTGGGAT TTTCACTGTA ATCGACTTCA AGGAAATGGA TATCAATGGT GCAAAAATCT CGAAAGAGGT TAAGGAAATC TCACCCATTA ACTACGTGAC TCTGGGGAGA TTGGACGGTG AGATAACAAG GGATAGACTT GAAATCCTCA AGGAGTTGGT GGACGGGTAT GGGGTAAGTA GCGTATCCAT AGGGATGGAA AAGCTTAGCC TCATTCGAGA GGTATTTAGG GATAAGATGA CCGCTATCCA TGTTTCTGAA ACGCTTAGGC ATAACTTGGC CTCAGACCTT GAGACCTCTC TTTCCACTCT TAAACCTGAT ATGGTGGTTC ATGGCATCCA TCTATCTGAG GAGGAGATGG AACTCTTGGC AGAAACGGAC ACCAAACTGG TTATATGTCC TCGAAGTAAC CTATGGTTCT CAACGGGTAT TCCCAACATT CCCATGATGA TAAGAAAAGG GGTTAGACTA CTAATCGGAA CCGACAACGC CGGGATCACC GATCATGATC TCTGGAAGGA GTTAGAGGTT GCATTACTTC TATCGCGACT TCTGGATCCA GGAAGTGATT TTTCCAGGGA TATCCTTAAG TCTGCTACCG TTAACCCTGG AAAAGGGGTT TATCCAATAG AGGAGGGAAA TAGGATGACA GGAATAATCA TGGGGCCACT CCCCAGGTTC GAAGTCTCAA ATAACAGGTA TATGGCACTA ATCAAGGAAC CTGGAAAAAT AATTAGAGTC TTGGGGCTAC CCAAAATCTA A
|
Protein sequence | MEASSRIVNV EYALLGQDLE LVHKIHLEIS DGIISHIGKG WDVKGESYPN SLLMPGLVNS HVHTFDGIAP ELGWNLTLKE VVGDPHSEKY RVLSLRSLPE LRASTLNFLN RSLELGIFTV IDFKEMDING AKISKEVKEI SPINYVTLGR LDGEITRDRL EILKELVDGY GVSSVSIGME KLSLIREVFR DKMTAIHVSE TLRHNLASDL ETSLSTLKPD MVVHGIHLSE EEMELLAETD TKLVICPRSN LWFSTGIPNI PMMIRKGVRL LIGTDNAGIT DHDLWKELEV ALLLSRLLDP GSDFSRDILK SATVNPGKGV YPIEEGNRMT GIIMGPLPRF EVSNNRYMAL IKEPGKIIRV LGLPKI
|
| |