Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0454 |
Symbol | |
ID | 5705451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 522140 |
End bp | 523321 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641269979 |
Product | molybdopterin biosynthesis-like protein MoeZ |
Protein accession | YP_001535374 |
Protein GI | 159036121 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.427252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000325534 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGTCGCTGC CCCCGCTCGT CGAGCCCGCC GCCGAGCTGA CCGTTGACGA GGTCCGTCGC TACTCACGCC ACCTGATCAT CCCGGATGTC GGGGTCGAGG GCCAGAAGCG GCTGAAGAAT GCCCGGGTGC TCTGTGTCGG TGCCGGTGGT CTCGGCTCGC CGGCGCTGAT GTACCTGGCC GCCGCCGGGG TGGGTACGCT CGGCATCGTC GATTTCGACA CTGTCGACGA ATCCAACCTC CAGCGTCAGA TCATCCATGG CGTGTCCGAT GTCGGCCGGG CCAAGGCGGA ATCGGCCGCC GCGACCATCC GAGAGATCAA CCCGCTGGTT GCGGTGGAGA TCCACGACGT GGCGTTGGAT CGGGACAACG TCAAAGACAT CTTCGCCCGA TATGACCTGA TCGTGGACGG CACCGACAAC TTCGCCACCC GGTACATGGT CAACGACGCG GCTGTGCTGC TCGGAAAGCC GTACGTGTGG GGCTCGATCT ACCGCTTCGA CGGGCAGGCG TCGGTGTTCT GGGCCGAGCA CGGTCCCTGC TATCGCTGCC TCTACCCGGA GCCTCCGCCG CCCGGTATGG TGCCGTCCTG CGCGGAGGGC GGCGTGCTCG GCGTGCTGTG CGCGTCGATC GGCTCGATCC AGGTGAACGA GGCGATCAAG CTACTCGCCG GCATCGGTGA GCCGCTGGTC GGTCGGCTCA TGGTCTACGA CGCACTGGAG ATGAGCTACC GCAAGATCAA GGTGCGGAAG GACCCGAACT GCGCCCTCTG CGGCGAGAAC CCCACGGTCA CCGACCTGCT GGCGGACTAC GAGGACTTCT GCGGCGCGGT GTCGGCTGAA GCCCAGGAGG CGGTGATCGA CGCGACGATC ACCGCGGGTG AACTGAAGGC GTGGCAGGAT GCCGGCAAGG ACTTTCTCCT GGTTGACGTG CGGGAGCCGG CCGAATTCGA GATCGTGCGG ATTCCGGGCG CCACGCTGAT TCCCAAGGGC GAGATCATCT CTGGCGAGGC GCTGGTCAAG CTTCCGCAGG ACCGCCAGAT CGTGCTGCAC TGCAAATCAG GTGTCCGCTC CGCGGAGGCG CTCGCCGCGC TGAAAGCCGC CGGCTTCCGG GACGCGGTGC ACGTCCAGGG CGGCGTTCTC TCCTGGATCA AGCAGGTCGA TCCGTCGCTG CCCGCGTACT GA
|
Protein sequence | MSLPPLVEPA AELTVDEVRR YSRHLIIPDV GVEGQKRLKN ARVLCVGAGG LGSPALMYLA AAGVGTLGIV DFDTVDESNL QRQIIHGVSD VGRAKAESAA ATIREINPLV AVEIHDVALD RDNVKDIFAR YDLIVDGTDN FATRYMVNDA AVLLGKPYVW GSIYRFDGQA SVFWAEHGPC YRCLYPEPPP PGMVPSCAEG GVLGVLCASI GSIQVNEAIK LLAGIGEPLV GRLMVYDALE MSYRKIKVRK DPNCALCGEN PTVTDLLADY EDFCGAVSAE AQEAVIDATI TAGELKAWQD AGKDFLLVDV REPAEFEIVR IPGATLIPKG EIISGEALVK LPQDRQIVLH CKSGVRSAEA LAALKAAGFR DAVHVQGGVL SWIKQVDPSL PAY
|
| |