Gene Haur_1871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1871 
Symbol 
ID5733760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2212472 
End bp2213548 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content51% 
IMG OID641279015 
Product(S)-2-hydroxy-acid oxidase 
Protein accessionYP_001544642 
Protein GI159898395 
COG category[C] Energy production and conversion 
COG ID[COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACCAA TTAACCTCAT GGATTACCAG CCATTGGCCA AGCAACTGAT TGATCGATCA 
GCTTGGGATT ATATTCAGGG TGGCAGCGAT GATGAAATCA CGTTGCAGGC CAACCAAGCC
GCTTATACCA AGTTGAAGCT ACGCCCGCGA GTCTTGGTTG ATGTTAGTCA GTGCACATTA
GAAACAAGCG TGCTTGGCCA AACAATCGCA ATGCCGATTG GCATCGCCCC AATGGGTTGC
CAAGGCTTAG TTCATGCTGA AGGCGAGTGT GCTATGGCGC GAGCAGCCGA AGCAGCTCAG
ACCGTGATGA TTGCTAGTGC CATGGCCAAC TACTCGCTTG AAGCGATTGC CCAAGCCGCG
AACGGCCCCC TGTGGTTTCA ACTGTATGTG TACCGCGAAC GCCAGATTAC TGAAGCGTTA
GTACGACGGG TCGAGGCTGC TGGCTATCAG GCTTTAGTGC TCACTGTTGA TGTCCCATTT
TTGGGCCGTC GCGAGCGTGA TTTACGCAAT GGCTTTGCCT TACCGCAACA CTTGCATTTT
GCGAATTTTG CACCAACTGA TGCCGCAGGC CAACATCAAC AAACGCTTGG AGCTTCAGGG
ATTGCAACGC ATGCGGCTGG ACGCTTTGAC GCAGCATTAA CCTGGGAAGC GATTGACTGG
TTACGCTCAC TTACGCGGTT GCCAATTGTG TTGAAAGGCA TTTTAAGTGC TGAAGATGCC
CAATTGGCAG TGCAACACGG CGTTGATGGG CTGATTGTGT CAAACCATGG TGGACGGCAA
TTGGATACGG TTGCGGCAAC TATTGAATGC CTACCTGCAA TTGTTGATGC CGTCGGCTCG
ACCTGCGAAG TGTATCTTGA TGGCGGTATT CGCCGGGGAA CCGATGTGCT CAAGGCGCTC
GCCTTGGGCG CAAAGATGGT TTTTGTGGGA CGACCCTTGT TATGGGGTTT GGCGGTTGAT
GGGCAGCAAG GCGCACACCA TGTGTTGGAA TTACTCAGAT CTGAATATAG CTTAGCCTTA
GGATTAATCG GCTGTCCCCA CAGTCATCAA TTAAATCGAC ATTATATTAG CTCCTAA
 
Protein sequence
MPPINLMDYQ PLAKQLIDRS AWDYIQGGSD DEITLQANQA AYTKLKLRPR VLVDVSQCTL 
ETSVLGQTIA MPIGIAPMGC QGLVHAEGEC AMARAAEAAQ TVMIASAMAN YSLEAIAQAA
NGPLWFQLYV YRERQITEAL VRRVEAAGYQ ALVLTVDVPF LGRRERDLRN GFALPQHLHF
ANFAPTDAAG QHQQTLGASG IATHAAGRFD AALTWEAIDW LRSLTRLPIV LKGILSAEDA
QLAVQHGVDG LIVSNHGGRQ LDTVAATIEC LPAIVDAVGS TCEVYLDGGI RRGTDVLKAL
ALGAKMVFVG RPLLWGLAVD GQQGAHHVLE LLRSEYSLAL GLIGCPHSHQ LNRHYISS