Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SYO3AOP1_0026 |
Symbol | |
ID | 6331637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfurihydrogenibium sp. YO3AOP1 |
Kingdom | Bacteria |
Replicon accession | NC_010730 |
Strand | - |
Start bp | 28269 |
End bp | 29567 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642656306 |
Product | amidohydrolase |
Protein accession | YP_001930231 |
Protein GI | 188995980 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGATT TAGTAATTAA AAATGCTTGG GTATTAACAA TGGATGAAAA CTTTACAGAA TATAAAAATG GATACATTGC CATAAAAGAT GGAAAAATTG CAGAAGTAGG AGAAAACAAA GAGAATTTAA AAAGTAGAGA AGTAATTGAT GCTAATGGAA ATATTGTACT ACCCGGATTT ATAAACACAC ATACCCATGC AGCAATGACT CTGCTTAGAG GATATGGAAG CGATAATCCT TTAAAAGTGT GGCTTGAACA GTATATTTGG CCAGTTGAAG GAAAGTTTGT TAGTTATGAG TTTGTAAAAG ATGGTACAGA TATAGCATGT TATGAGATGT TAAGAAATGG TATCACATGT TTTGTTGATA TGTATTTTTA CGAAAATGCA GTTGCTGATG CTGTAAAATC CGCACATATG AGAGCTGTAT TGACCACCGG AATTCTTGAC TTTCCTACCC CCGGAGCAAA AACACCAGAC GAAGGAATCC AAAAAACCAT AGATTTTATA AGAGAATATA AAAATGATGA GTTTATATAT CCAGCAATAG GACCACATGC ACCTTACACG TGCAGCCCTT CAACGCTACA AAAATCCATG CAGGTTGCAG TAGATTATGA CGTTGTATAT CACATACACG TTGCAGAAAC TTTACATGAA GTTGAAGATA TTAAAAACAG ATATGGAGAT ACACCTGTAA AACATCTAAA CAATATCGGG GTTTTAAATG ATAGAGTTTT AGCAGCTCAT ATGGTTCATC CGACAGATGA AGAAATTGAA CTATTGGCAG AAAAAAATGT AAAGATTGCC CACTGCCCAG AGAGTAATTT AAAATTAGCA TCAGGAATTG CACCTGTTCC AAAAATGTTA GAAAAAGGAG TTATTGTTTC TTTTGGAACT GATGGAACAG CTTCAAACGA TGACCTTGAT ATTATTGGTG AGCTTTCTAC TGCTGCAAAA TTACACAAAG GATATAACTT AAATCCAACA GTTTTACCTG CAAAGCAAGT TTTAGCAATG GCAACAAGAG ATGCAGCAAA AGCAGTTAGA TTAGACAAAA AAATAGGAAG TATAGAAGTT GGTAAGTATG CAGATTTAGT AATAATTGAT ATAAATCAAC CACACTTACA GCCACTTTTT GACCCATACA TACAGATAGT CTATTCATCA AGAGGCAGTG ATGTTGATAC TGTATTAATA AATGGAAAAG TTGTAGTTAA AAACAAGGAA GTTTTAACTG TTGAAAAAGA AAGAGTGCTA TCAATAGCTA AAAAGTGGAA GGAAAAAATT TTAAGTTAA
|
Protein sequence | MFDLVIKNAW VLTMDENFTE YKNGYIAIKD GKIAEVGENK ENLKSREVID ANGNIVLPGF INTHTHAAMT LLRGYGSDNP LKVWLEQYIW PVEGKFVSYE FVKDGTDIAC YEMLRNGITC FVDMYFYENA VADAVKSAHM RAVLTTGILD FPTPGAKTPD EGIQKTIDFI REYKNDEFIY PAIGPHAPYT CSPSTLQKSM QVAVDYDVVY HIHVAETLHE VEDIKNRYGD TPVKHLNNIG VLNDRVLAAH MVHPTDEEIE LLAEKNVKIA HCPESNLKLA SGIAPVPKML EKGVIVSFGT DGTASNDDLD IIGELSTAAK LHKGYNLNPT VLPAKQVLAM ATRDAAKAVR LDKKIGSIEV GKYADLVIID INQPHLQPLF DPYIQIVYSS RGSDVDTVLI NGKVVVKNKE VLTVEKERVL SIAKKWKEKI LS
|
| |