Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2864 |
Symbol | |
ID | 5734735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3634876 |
End bp | 3635967 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280007 |
Product | sulfite oxidase |
Protein accession | YP_001545630 |
Protein GI | 159899383 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGATA GTCTGCCTTC ATTTGATGTT GCCAAAAGTG CGACGATGCA GGTGGTTCAG GCCGAGCCAT TTAATGCTGG CACACCCTTA GATGAGCTGG CCAGCAGCTA CATTCAACCC ACCAGCCAAT TTTTTGTGCG TACCCATGGC ACGATTCCTA GCCTCGACCC TGAAACCACC ACAATTCAAC TGCAGGGCTT ATTGGCTCAA CCATTGAGCA TCAGCATTGC CGACATTAAG CAACAATTGC CCTATGTTGA GCAGGTTTCG ACCTTGCAAT GTGCTGGCAA CCGCCGCCAA GAGATGCACG CCTACCAGCC AATTTATGGC GAATTGCCCT GGGGGGCCAA TGGCTTGAGC ACAGCAAATT GGGGTGGTGC ACCCTTGCGA TCACTGCTAG AGCGCTGCGA GATTGATTCG ACTGCCCTGC ATTTGGTGTT TGAAAGCTAC GATCAGGTTG AGCGCCACGG CCAAACCTTT GGCTATGGCG GCTCGATTCC ACTCAACGAG CCGATGATCG AGCATGCCTT GTTGGCCTAC ACGATGAACG GCACAGCCTT GCCAGCGCTG CATGGTGGCC CGTTACGTTT GGTTATTCCT GGAATTGTTG GTGCTCGCAG CGTCAAATGG TTGCGCTCGA TTGAATTCAG TAGCGAGCCA TCACACAACT ATTTTCAGCG CCGCGCCTAT CGTTTGGCCC AAAGCAGCGA GCCTGAAGCT TGGCAAAACG CGCCAATGTT GCACGAATTG CCAGTTAATG CTGTGCTGTA TTTGCCGACA GCTGACCAAA CCTTGCTGGC AGGCACGATC ACCCTAGCGG GCTATGCGAT TACTGGGGGC CAAGCGCTGG TTGAACAGGT CGAAATTTCG CTTGATCACG GCCAACATTG GCAACATGCT CGCTTGATCG ACCCACCAAG GCTTGGTTGT TGGAGTCGCT GGCAAATCGA GCTAGAACTG ACGGCGGGTG AATATCAATG TTGGGTTCGC GCCACCGATT CGCTTGGTCA ACAACAACCA GAGCAGCCAG CTTGGAATGT CAAAGGCTAT CATCACAACG CAATTCAACG AATCCAACTA AGCGTTTGCT AG
|
Protein sequence | MRDSLPSFDV AKSATMQVVQ AEPFNAGTPL DELASSYIQP TSQFFVRTHG TIPSLDPETT TIQLQGLLAQ PLSISIADIK QQLPYVEQVS TLQCAGNRRQ EMHAYQPIYG ELPWGANGLS TANWGGAPLR SLLERCEIDS TALHLVFESY DQVERHGQTF GYGGSIPLNE PMIEHALLAY TMNGTALPAL HGGPLRLVIP GIVGARSVKW LRSIEFSSEP SHNYFQRRAY RLAQSSEPEA WQNAPMLHEL PVNAVLYLPT ADQTLLAGTI TLAGYAITGG QALVEQVEIS LDHGQHWQHA RLIDPPRLGC WSRWQIELEL TAGEYQCWVR ATDSLGQQQP EQPAWNVKGY HHNAIQRIQL SVC
|
| |