Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1007 |
Symbol | |
ID | 4069772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1271077 |
End bp | 1272033 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983014 |
Product | putative sulfite oxidase subunit YedY |
Protein accession | YP_590084 |
Protein GI | 94968036 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0460692 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0531219 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGATCA AGAAGCAGGG TGAGATTCCG TCGTCGGAAA TTACGGACAA AAAGGTGTAC CTGAACCGTC GTGCGTTTAT CGGCGGGGCG GCTGCGGCTG GGGCGGCGAT TGCGGTGGGA TTCAAGGCGG CGGGGCTATT CGATCCGGCG CTGCACGCGA GCGCGAATGC GAAGTTGCAG TTCAAGCCGA GCAGCTTCAG CACGAACGAG AAGCAGACGC CGCTGAACGA CGTGACCCAC TACAACAACT ATTACGAGTT CGGCACCGAC AAAACCGATC CGGCAGACGA GGCCAAAAAT TTCAAACCAA CGCCGTGGAA GGTGAAGGTG GAAGGCCTGG TCAAGAAGGC GCAGACCTTC GACATTGACA CGTTGCTGAA GATCCCGCTG GAGGAGCGCG TGTATCGCAT GCGCTGCGTC GAGGGATGGT CGATGGTGAT TCCGTGGATC GGATTTCCGT TGTCGGCGCT TTTGAACCAA GTGGAAGTGC AGCCGAAGGC GAAGTTCGTG GAGTTTACCT CGCTGCTCGA TCCGAATCGC ATGCCGGGGC AGCGGAGGGC GGTGCTGGAA TGGCCGTATG TGGAGGGGTT GCGGCTGGAT GAGGCGATGC ATCCGCTGAC GACGATGGTG GTAGGCCTGT ATGGCGAGAC GCTTCCGAAC CAGGATGGCG CGCCGTTGCG ATTAGTGGTG CCGTGGAAGT ATGGGTTCAA GGGGATCAAG GCGATCGTGA ACATCAAGCT GGTGGAGAAA CAGCCGACCT CGACGTGGAC GCAGGCGGCG TCAAACGAAT ATGGTTTCTA CTCCAATGTG AATCCGAACG TGGACCATCC GCGATGGAGC CAGGCGAAGG AGCGGAGGAT CGGGGAGTTT TTCAAGCGTC CGACGCTGAT GTTTAACGGG TACGGCGACC AGGTGGCGAG TTTGTATTCG GGCATGGATT TGAAGAAAAA CTTCTAA
|
Protein sequence | MLIKKQGEIP SSEITDKKVY LNRRAFIGGA AAAGAAIAVG FKAAGLFDPA LHASANAKLQ FKPSSFSTNE KQTPLNDVTH YNNYYEFGTD KTDPADEAKN FKPTPWKVKV EGLVKKAQTF DIDTLLKIPL EERVYRMRCV EGWSMVIPWI GFPLSALLNQ VEVQPKAKFV EFTSLLDPNR MPGQRRAVLE WPYVEGLRLD EAMHPLTTMV VGLYGETLPN QDGAPLRLVV PWKYGFKGIK AIVNIKLVEK QPTSTWTQAA SNEYGFYSNV NPNVDHPRWS QAKERRIGEF FKRPTLMFNG YGDQVASLYS GMDLKKNF
|
| |