Gene Sare_0496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0496 
Symbol 
ID5703301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp565768 
End bp566916 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content76% 
IMG OID641270022 
Product(S)-2-hydroxy-acid oxidase 
Protein accessionYP_001535416 
Protein GI159036163 
COG category[C] Energy production and conversion 
COG ID[COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.150259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.194541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCACG AACCGGCGGT GGCCCACGAA CCGGCGGTGG CCCACGAACC TGCGGTGGCG 
GCCGGGATCG CCAGCGTGGA CGACCTGCGC CGACTGGCCC GCGCGCGGCT CCCCGGCCCC
GTCTGGGACT ATGTGACCGG CGGCGCCGGG GAGGAACGGA CGGTCCGGGC CAACCGCGAC
GCGTTCCGCC GACTCACGCT GCTGCCCCGG GTGCTCGTCG ACGTGGCCGC GCGGGACCCG
CGCACCACCG TCCTGGGCAC CGGGGTGGCC GCGCCGGTCG GCATCGCGCC GACGTCCTAC
CAGAGCCTCG CGCATCCGGA CGGGGAGTTG GCCACCGCCC GGGCAGCCGG CTCCCGGGGC
CTGCTCGACG TGGTGAGCGT CTTCTCCAGC GTGTCGCTGG AGGATGTCGC CGAGGTGGCC
ACCGGACCGC TGTGGTTCCA GCTCTACTGC CTGCGGGACC GGGGGGTGAC CCGGGAGCTG
GTGCAGCGGG CCGCCGCGGC CGGCTACCGC GCGCTCGTGC TCGGCGTCGA CCTGCCGGTG
ATCGGCTACC GCGACCGGGA CATCCGAAAC CGGTTCCAAC TGCCCCCCTC GGTGGCTCCG
GTCAACCTGC CGACCAGAGT CGCCCCCGGC GGCAGCGTCC TGGTCGAGCT CAACCGGGCC
CTGGTGGATC CGGCGCTGAC CTGGCGGGAC GTCGAGTGGA TCCGGGAGAT CAGCCCGCTG
CCGGTCGTGG TCAAGGGGAT CGTCGCGGCC GACGACGCCG ACCGGGCCGC GCGTATCGGC
GCCGACGCGG TGCTGGTCTC CAACCACGGC GGGCGGCAGC TGGACGGCGC TCCGGCGAGC
ATCACCGCGC TGCCGGACGT GGTGAGCGTG GTGGCCGACC GGTGCGAGGT GTACCTCGAC
AGCGGCGTCC GCCGCGGCAC CGACGTGCTG GCGGCGGTGG CCCGTGGCGC CCGGATGGCG
TTCGTCGGCC GCCCGGTCAT GTGGGGGCTG GCCGCCGGCG GAGCGGACGG GGTCCGCGCC
GCCCTCGACC TGTACCTGAC CGAACTCGAC CTGGCCATGG CGGTGTGCGG GTGCCCGGAC
GTGCCGAGTA TCGGGCCGCA CCTGCTCGGG CCGATCGACC GGCCGGGCGA TCGGCCGGCC
GACAGGTAG
 
Protein sequence
MAHEPAVAHE PAVAHEPAVA AGIASVDDLR RLARARLPGP VWDYVTGGAG EERTVRANRD 
AFRRLTLLPR VLVDVAARDP RTTVLGTGVA APVGIAPTSY QSLAHPDGEL ATARAAGSRG
LLDVVSVFSS VSLEDVAEVA TGPLWFQLYC LRDRGVTREL VQRAAAAGYR ALVLGVDLPV
IGYRDRDIRN RFQLPPSVAP VNLPTRVAPG GSVLVELNRA LVDPALTWRD VEWIREISPL
PVVVKGIVAA DDADRAARIG ADAVLVSNHG GRQLDGAPAS ITALPDVVSV VADRCEVYLD
SGVRRGTDVL AAVARGARMA FVGRPVMWGL AAGGADGVRA ALDLYLTELD LAMAVCGCPD
VPSIGPHLLG PIDRPGDRPA DR