Gene Haur_2864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2864 
Symbol 
ID5734735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3634876 
End bp3635967 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content53% 
IMG OID641280007 
Productsulfite oxidase 
Protein accessionYP_001545630 
Protein GI159899383 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGATA GTCTGCCTTC ATTTGATGTT GCCAAAAGTG CGACGATGCA GGTGGTTCAG 
GCCGAGCCAT TTAATGCTGG CACACCCTTA GATGAGCTGG CCAGCAGCTA CATTCAACCC
ACCAGCCAAT TTTTTGTGCG TACCCATGGC ACGATTCCTA GCCTCGACCC TGAAACCACC
ACAATTCAAC TGCAGGGCTT ATTGGCTCAA CCATTGAGCA TCAGCATTGC CGACATTAAG
CAACAATTGC CCTATGTTGA GCAGGTTTCG ACCTTGCAAT GTGCTGGCAA CCGCCGCCAA
GAGATGCACG CCTACCAGCC AATTTATGGC GAATTGCCCT GGGGGGCCAA TGGCTTGAGC
ACAGCAAATT GGGGTGGTGC ACCCTTGCGA TCACTGCTAG AGCGCTGCGA GATTGATTCG
ACTGCCCTGC ATTTGGTGTT TGAAAGCTAC GATCAGGTTG AGCGCCACGG CCAAACCTTT
GGCTATGGCG GCTCGATTCC ACTCAACGAG CCGATGATCG AGCATGCCTT GTTGGCCTAC
ACGATGAACG GCACAGCCTT GCCAGCGCTG CATGGTGGCC CGTTACGTTT GGTTATTCCT
GGAATTGTTG GTGCTCGCAG CGTCAAATGG TTGCGCTCGA TTGAATTCAG TAGCGAGCCA
TCACACAACT ATTTTCAGCG CCGCGCCTAT CGTTTGGCCC AAAGCAGCGA GCCTGAAGCT
TGGCAAAACG CGCCAATGTT GCACGAATTG CCAGTTAATG CTGTGCTGTA TTTGCCGACA
GCTGACCAAA CCTTGCTGGC AGGCACGATC ACCCTAGCGG GCTATGCGAT TACTGGGGGC
CAAGCGCTGG TTGAACAGGT CGAAATTTCG CTTGATCACG GCCAACATTG GCAACATGCT
CGCTTGATCG ACCCACCAAG GCTTGGTTGT TGGAGTCGCT GGCAAATCGA GCTAGAACTG
ACGGCGGGTG AATATCAATG TTGGGTTCGC GCCACCGATT CGCTTGGTCA ACAACAACCA
GAGCAGCCAG CTTGGAATGT CAAAGGCTAT CATCACAACG CAATTCAACG AATCCAACTA
AGCGTTTGCT AG
 
Protein sequence
MRDSLPSFDV AKSATMQVVQ AEPFNAGTPL DELASSYIQP TSQFFVRTHG TIPSLDPETT 
TIQLQGLLAQ PLSISIADIK QQLPYVEQVS TLQCAGNRRQ EMHAYQPIYG ELPWGANGLS
TANWGGAPLR SLLERCEIDS TALHLVFESY DQVERHGQTF GYGGSIPLNE PMIEHALLAY
TMNGTALPAL HGGPLRLVIP GIVGARSVKW LRSIEFSSEP SHNYFQRRAY RLAQSSEPEA
WQNAPMLHEL PVNAVLYLPT ADQTLLAGTI TLAGYAITGG QALVEQVEIS LDHGQHWQHA
RLIDPPRLGC WSRWQIELEL TAGEYQCWVR ATDSLGQQQP EQPAWNVKGY HHNAIQRIQL
SVC