Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2134 |
Symbol | |
ID | 5734036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2682249 |
End bp | 2683157 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279275 |
Product | putative sulfite oxidase subunit YedY |
Protein accession | YP_001544902 |
Protein GI | 159898655 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA TTCCTTCCTC GGAGATAACA CCCGAAGCGT TGTTCTACTC ACGTCGCCAA TTTATCAAGG GAGCGGCAGC CTTGGTTGGT AGCGCCACCG TTCTAGCCGC CTGTGGCAGC GAGACCAGTG AAACCAGCGA CCTGCCAGCG GGCGTTGATG TGCAAACACC CTATGAATCA ATCATCAATT ACAACAATTT TTATGAATTT ACCACCAACA AAGAAGCAGT TGCCGATGCT TCGAAGAATT TTACGACCAA CCCATGGACA GTCGAAGTCA GCGGCTTGGT CAACAAACCG CAAACCTTTG CAATCGAAGA TTTGCTCAAG CAATTTACCC AAGAAGAACG GGTGTATCGG TTGCGCTGTG TTGAAGGCTG GTCGATGGTC ATTCCATGGA CGGGTTTTAG CCTCGCTGGC TTGTTGAAAC AAGTCGAGCC AACCAGCGCC GCCAAATATG TGCGCTTTGA AACGGTGATG CGCCCCGAAG AAATGCCAGG CCAAAGCAGT AGTTATTACA CATGGCCGTA TGTCGAGGGT TTGCGGCTCG ATGAAGCCAT GCACGATTTA ACCTTGATGG CAACTGGCGT GTATGGCAAG CCAATTTTGC CCCAAAATGG CGCACCCTTG CGGCTGGCAG TGCCATGGAA ATATGGATTC AAAAGCATCA AATCAATCGT TAAAATTGAG CTGGTGGCCG AGCAACCAAC CAGTTTATGG ATGAACGCAG CACCTGATGA ATATGGGTTT TATGCCAATG TTAATCCCGA TGTGCCGCAT CCACGCTGGT CGCAAGCCAC CGAACGCCGC ATTGGTGAGG CTGGTCGGCG ACGAACCTTG GCCTTCAATG GTTATGCCGA TGAAGTTGCT GCGCTCTACA AAGATCTGGA TTTGAAAGCT AACTATTAA
|
Protein sequence | MKKIPSSEIT PEALFYSRRQ FIKGAAALVG SATVLAACGS ETSETSDLPA GVDVQTPYES IINYNNFYEF TTNKEAVADA SKNFTTNPWT VEVSGLVNKP QTFAIEDLLK QFTQEERVYR LRCVEGWSMV IPWTGFSLAG LLKQVEPTSA AKYVRFETVM RPEEMPGQSS SYYTWPYVEG LRLDEAMHDL TLMATGVYGK PILPQNGAPL RLAVPWKYGF KSIKSIVKIE LVAEQPTSLW MNAAPDEYGF YANVNPDVPH PRWSQATERR IGEAGRRRTL AFNGYADEVA ALYKDLDLKA NY
|
| |