Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3877 |
Symbol | |
ID | 5901339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4197045 |
End bp | 4197965 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641564399 |
Product | putative sulfite oxidase subunit YedY |
Protein accession | YP_001685501 |
Protein GI | 167647838 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATCC GCCACGCCCC CGACCTGACC GACAACGACG TCACGGGCCA CGGCCTCTAT CTGCGGCGAC GCGACTTCAT CGGCGGGGCG GCTGGGCTGG GCCTGATGGC GGCGGCCGGG TCGGCCAGCG CGCGCGGGCT GACCTACGGC CCGGGTTTCT CGACCACCGA GGCCCCAACG CCGAAGAAGG ACATCACCAG CTACAACAAC TTCTACGAAT TCGGGGTCAA CAAGGAGGAC CCGTCGGAGA ACGCGGGCTC GCTGAAGACC CGGCCCTGGA CCGTCCGCGT CGATGGCGAA TGCGAGAAGC CGGCGACCTT CGCGATCGAC GACCTGATCA AGGGGAACAA GCTGGAGGAG CGTATCTACC GCATGCGCTG TGTCGAGGGC TGGTCGATGG TCATTCCATG GGTGGGCTTC CCGCTCAAGG ACCTGATCGC CCAGGTGAAG CCGACCTCGA AAGCCAAGTT CGTGGCCTTC GAAACCCTGA TGCGGCCGTC GGAAATGCCT GGTCAGCGGT GGGACACCCT GCAGTGGCCC TATCGCGAGG GCCTGCGCAT CGACGAGGCG GTCCACCCGC TGGCGATCCT GGCCGTCGGC CTGTACGGCG ACGTCCTGCC CAACCAGAAC GGCGCGCCGC TGCGGCTGGT CGTGCCGTGG AAGTATGGCT TCAAGGGCAT CAAGTCGATC GTGCGGATCA GCCTGGTCGA GACGATGCCG GCCACCGCCT GGAACGTGCT GGCGCCGCGC GAATACGGCT TCTATTCCAA CGTCAATCCG GCCGTGGACC ACCCGCGCTG GTCGCAGGCC ACCGAGCGCC GCATCGGCGA GTTCCGCCGC CGCGAGACCC TGCCGTTCAA CGGCTATGGC CAGTATGTCG CCGACCTCTA TCGCGGCATG GACCTGAAAC GGAACTTCTG A
|
Protein sequence | MLIRHAPDLT DNDVTGHGLY LRRRDFIGGA AGLGLMAAAG SASARGLTYG PGFSTTEAPT PKKDITSYNN FYEFGVNKED PSENAGSLKT RPWTVRVDGE CEKPATFAID DLIKGNKLEE RIYRMRCVEG WSMVIPWVGF PLKDLIAQVK PTSKAKFVAF ETLMRPSEMP GQRWDTLQWP YREGLRIDEA VHPLAILAVG LYGDVLPNQN GAPLRLVVPW KYGFKGIKSI VRISLVETMP ATAWNVLAPR EYGFYSNVNP AVDHPRWSQA TERRIGEFRR RETLPFNGYG QYVADLYRGM DLKRNF
|
| |