Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2026 |
Symbol | |
ID | 5105248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1954244 |
End bp | 1955467 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507914 |
Product | amidohydrolase |
Protein accession | YP_001192090 |
Protein GI | 146304774 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.761901 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGAGTAT CAAATAGAAC GTATACACTG AGGAATTGCG CCTTTGCCGT TGATTACAGT CACGTCGAGG GGCCAACGAA CATAGTCGTC GAAGACGGTT TCATTAAGCA CGTTGGTAAG GAGGTTGAAG GAGACGAGTT GGAGTGCAGT GAGTACGTGG TAATGCCTGG TTTGGTGAAT GCCCATACTC ACTCAGCCAT GACAGTCCTG AGGGGTGTAT TTGACGACGG GGAACTTCAC GAGTGGTTAG CTCGAATGTG GGACGAAGAA AGGAAGTTAA CCAGGGAGAT TATGGCTGTA GGTTCAGAAA TCGCAGTGAT TGAGATGATC TCCTCTGGGA CAACGGCCTT CGTTGATATG TACTTCAACC CTGACCAGAT AAGGGATATC TCCACTCAGT ATGGGATTAG GGCGAGAGCG GGCCCAACTC TCATGAAAGA CAAGAGTGTC GATGAAACCG TAAGGGAACT ACGTGCACTG GGGGAAAGTG AGTTCTTTAG GCCCATCGTT AACGTCCACA GTCTCTATGC CACGGACCTT CAGAAGCTTA GGGAGCTAAG AGATAACCTG AACCGAGGGT ATCATCTCCA CATTCATCTT TCAGAAACAA GGGAAGAGGT CTTCCAGATA AAGAGAAGAT ATGGGATGTT CCCGGTGGAG CTCATTCACA GGGAAGGTCT AACGGAACGT GTGCATGGTG TACATCTAGG CTGGATAACC TCATGGGAAC TCAACTATCT GAGAAGTTCC ATTGCGGTCA CTCATTGTCC AACGTCTAAC ATGAAGCTTG CCACCGGAGG GGCTTTTCCC ATGAAGGAGG CGTTGACTCA AGGACTTAAT GTAACCATTG GGACAGATGG TGCAGCGAGC AATAACTCCC TCAACATGTT TCAAGAGATG AAAATGGCAG TCTTGCTACA GAGACATAAT TACTGGTCCA CAGGAATAAC TGCAGTTGAC GTGTTTAGGG CATCATCAGT TAACGGGTAT AAGATGCTGG GCATACGCGG AGGGGAGATT AGGCCCGGAT ACGTGGCTGA TCTAGTCCTG CTAAGTAAGT ATGAGGTTTA TCCATTAACC AAGGAGAGAC TTCTATCTCA TCTAGTTTAC AATCCGCCAA AGGAAGTTGA GAAGGTTATA ATCCAAGGAA AGATTGTTTA TCAAAAGAAT GACTTTAGGG ATAGGTTGAA GAAACTTTTA GAGAAGTTAA GCCTTTACCT CTAA
|
Protein sequence | MGVSNRTYTL RNCAFAVDYS HVEGPTNIVV EDGFIKHVGK EVEGDELECS EYVVMPGLVN AHTHSAMTVL RGVFDDGELH EWLARMWDEE RKLTREIMAV GSEIAVIEMI SSGTTAFVDM YFNPDQIRDI STQYGIRARA GPTLMKDKSV DETVRELRAL GESEFFRPIV NVHSLYATDL QKLRELRDNL NRGYHLHIHL SETREEVFQI KRRYGMFPVE LIHREGLTER VHGVHLGWIT SWELNYLRSS IAVTHCPTSN MKLATGGAFP MKEALTQGLN VTIGTDGAAS NNSLNMFQEM KMAVLLQRHN YWSTGITAVD VFRASSVNGY KMLGIRGGEI RPGYVADLVL LSKYEVYPLT KERLLSHLVY NPPKEVEKVI IQGKIVYQKN DFRDRLKKLL EKLSLYL
|
| |