Gene Sare_3542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3542 
Symbol 
ID5703923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4086063 
End bp4087061 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content68% 
IMG OID641272969 
Productaldo/keto reductase 
Protein accessionYP_001538335 
Protein GI159039082 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00316128 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAATTTC GACACCTGGG CCGTTCCGGC CTGATGGTCA GCGAGATCTC GTACGGTAAC 
TGGCTCACCC ACGGCTCCCA GGTGGAGGAG GAGTCGGCCT TCGCCTGCGT CCGGGCCGCC
CTGGACGCCG GCATCACCAC CTTCGACACC GCGGACGCGT ATGCGAGCAC CCGCGCGGAG
GACGTCCTCG GTCGCGCCCT GCAGAACGAA CGGCGGGCCG GAGTCGAACT GTTCACCAAG
GTGTTCTTCC CGACCGGTCC GGGTCACAAC GACCGTGGCC TGTCCCGTAA GCACATCATG
GAGTCGATCG ACGGTTCGCT GCGTCGGCTG CGCACCGACT ACGTCGACCT CTACCAGGCG
CACCGCTACG ATCACAGCAC TCCGATCGAG GAGACGATGG AGGCGTTCGC CGACGTCGTC
CGCTCCGGGA AGGCCCTCTA CATCGGGGTC TCCGAATGGA CGGCGACGCA GCTGCGCCAA
GCCCACCAGC TCGCCCGTGA GCTGCGGATT CCGCTGATCT CCAACCAACC GCAGTACTCG
ATGCTGTGGC GGGTCATCGA GGCCGAGGTC ATACCGGCCA GCGAGGAGTT GGGCGTCGGC
CAGATCGTCT GGTCCCCGAT GGCCCAGGGC GTCCTGTCCG GCAAGTACCG GCCGGGCCAC
CCCCCGCCGA CGGGTTCCCG GGCCACGGAC GAGAAGTCCG GCGCGAACTT CATCGCCAAG
TGGCTGACCG ACGACGTGTT GACCCGGGTG CAGCAGCTCA AGCCGCTCGC CGAGCAGGCG
GGGCTGAGCC TGGCCCAGCT GGCCATCGCC TGGGTGCTGC AGAACCCGAA CGTCTCCTCG
GCGATCGTCG GCGCGTCCCG GCCCGAGCAG GTGAACGACA ACGTCAAGGC AGCCGGAGTG
CGGCTGGACG CCGACCTGCT CAAGGCGATC GACGACGTCG TCGAGTCGGT CGTCGAGCGG
GATCCGGCCC GTACCGAGTC CCCCGCGCGA CGGCCCTGA
 
Protein sequence
MEFRHLGRSG LMVSEISYGN WLTHGSQVEE ESAFACVRAA LDAGITTFDT ADAYASTRAE 
DVLGRALQNE RRAGVELFTK VFFPTGPGHN DRGLSRKHIM ESIDGSLRRL RTDYVDLYQA
HRYDHSTPIE ETMEAFADVV RSGKALYIGV SEWTATQLRQ AHQLARELRI PLISNQPQYS
MLWRVIEAEV IPASEELGVG QIVWSPMAQG VLSGKYRPGH PPPTGSRATD EKSGANFIAK
WLTDDVLTRV QQLKPLAEQA GLSLAQLAIA WVLQNPNVSS AIVGASRPEQ VNDNVKAAGV
RLDADLLKAI DDVVESVVER DPARTESPAR RP