Gene Sare_2802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2802 
Symbol 
ID5706158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3181995 
End bp3183503 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content68% 
IMG OID641272258 
Productaldehyde dehydrogenase 
Protein accessionYP_001537628 
Protein GI159038375 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.163668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000224903 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTTCTCT CGATCAAGTC ACGACGGGAC GGGGTGGGAC TGGGCCGGGG GCACCTGCTT 
GTCGCGGGTG CGTGGCGACC GTCGCGCGAT GGTGGGACGT GGGTGCACCT GCACCCGGCG
ACGGGTGAGG AGGTGGGGGA GTTCGCGATC GCGGACCCCG CCGACGTCGA CGCTGCTGTC
CGGGCTGCCC GACAGGCGTT CGACGAAGGG CCCTGGCCCC GCAGCCGGGC GAGAGAACGC
ATCCGGGTCC TGCGACGCGC CGCAGACCTG ATCCGCGAGC ACTCCGACGA ACTGCTCGCG
CTCCAGGCGC TCGACAACAG CGTCCCGTTG AGCTTCAGCG GTGCCTACGT GATGTCGGCC
GAGTGCGCGG CCGACGTCTT CGACCACCAC GCGGGCTGGA TCGACAAGCT TGGTGGTGAG
ACGCTACCGC CCTACCAAGG GGGTGACCAC CTGGTATTCA CCCTGCGCGA GCCGATTGGG
GTGGTGGCGG CGGTCATTCC GTGGAACGCC CCGCTCTTGT TGGCGGCGCA GAAGCTCGCG
CCGGCGCTGG CCTCCGGGTG CACGGTCGTG CTGAAGCCGT CAGAGTACGC CACCTTCGCG
GTACTGCGGT TGGTGCAGAT TCTCGACGAG GCGGGAGTGC CACCGGGTGT GCTCAACGTG
GTGACCGGGC CCGGCGAATC GACCGGTGAG GCGTTGATCA CCCATCCGAT GGTAGACAAG
ATCACCTTCA CCGGCAGTCG TGCTGTGGGT CGCCGTATCC TGCACGCCGC AGCCGACGGA
ATCACCAAGG TGAGTCTGGA ACTCGGTGGG AAGAGCCCAT CGATCGTATT CGCAGACGCC
GATGTCTACG CGGCGGCGGC GATGACCATG GGCACCGTCA CCGTAGGACT GTCTGGTCAG
GTGTGTGTGG CCCACAGTCG GGCACTGGTC CAGCGCGAGG TTTACGACGA GTTCGTGTCG
ATCGCCACCG GGGCGACCGC GCTCGCGTGC TACGGGGATC CGTTCGACGC CGAGACCACC
GCCTCACCGC TGATCAACGG ACGACAGCTC GACCGGGTGC TCGGCTATGT CGCACAAGGC
CAGGCGGAGG GCGCTCGCCT GGTGTGCGGG GGCGAACGGG TTGGGGGAGA GCTGGCTGCG
GGCAACTTCG TGACCCCGGC GCTCTTCGCC GACGTGGCCA GCGACATGAC CATCGCCCGT
GAGGAGATTT TCGGTCCCGT GCTGGGTGTG ACTCCGTTCA CCGACGAGCA GGAGGCGATA
CGCCTGGCGA ACGACACCGA GTATGGACTC GCCGCCATGG TGTGGACCGC GGATGTGAAG
CGGGCCATGC GCCTGACCCG AGCCGTGCGG GCGGGAACCA TCGGCGTCAA CGGCTACCAG
GTGGAGCCAC ACGCGGCCTT CGGTGGATTC GGTCAGTCCG GGCTCGGACG CGAGGGCGGG
CGAGGCTCGG CAGAGGCTTT CACCGAGGTG AAGACCGTGC TGGTGCCGAC CACCGAGGAG
CTCATGTAG
 
Protein sequence
MVLSIKSRRD GVGLGRGHLL VAGAWRPSRD GGTWVHLHPA TGEEVGEFAI ADPADVDAAV 
RAARQAFDEG PWPRSRARER IRVLRRAADL IREHSDELLA LQALDNSVPL SFSGAYVMSA
ECAADVFDHH AGWIDKLGGE TLPPYQGGDH LVFTLREPIG VVAAVIPWNA PLLLAAQKLA
PALASGCTVV LKPSEYATFA VLRLVQILDE AGVPPGVLNV VTGPGESTGE ALITHPMVDK
ITFTGSRAVG RRILHAAADG ITKVSLELGG KSPSIVFADA DVYAAAAMTM GTVTVGLSGQ
VCVAHSRALV QREVYDEFVS IATGATALAC YGDPFDAETT ASPLINGRQL DRVLGYVAQG
QAEGARLVCG GERVGGELAA GNFVTPALFA DVASDMTIAR EEIFGPVLGV TPFTDEQEAI
RLANDTEYGL AAMVWTADVK RAMRLTRAVR AGTIGVNGYQ VEPHAAFGGF GQSGLGREGG
RGSAEAFTEV KTVLVPTTEE LM