Gene Sare_3557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3557 
Symbol 
ID5705050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4105014 
End bp4106021 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content71% 
IMG OID641272984 
Productaldo/keto reductase 
Protein accessionYP_001538350 
Protein GI159039097 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.729958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0108203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA CACGGACACT GGGCCACAGC GGAATCGAGG TCAGTGCGAT CGGAATGGGT 
TGCTGGGCGA TCGGGGGGCC GCTGTGGGGC GACGGCGGGC AGCCGTTCGG CTGGGGCGAC
GTCGACGACG ACGAATCGGT GCGCACCGTC CACGCCGTAC TCGACCACGG CGGGACCTTC
TTCGACACCG CCAGCAACTA CGGCGCCGGG CACAGTGAGC GGATCCTCGG CCGCGCCCTC
GCCGGCCGCC GGGACCAGGT GGTGATCGCC ACCAAGTTCG GCAACTGCTT CGAGGAGACG
ACCCGCCGGT GGACCGGGAC CGACCACCGT CCCGAGCACG CCGTGACGAG CCTGGAGGCG
TCGCTGCGCC GCCTCGGGAC CGACCACGTC GACCTCTACC AGTTGCACCT CAACGAACTG
CCGACGTCCG CCGCGCTCGA CCTGGTCGAC ACGCTGGAGG ACCTGGTCAG CAACGGCAAG
ATCCGGGCGT ACGGCTGGAG CACCGACAAT CCCGAGTCGG CGGCGGCGTT CGCGGCGGCC
GGCCCGCACT GCGCCACCGT CCAGCACGAC CAGTCGGTGT TGGCGGACAA CGCGGCAGTG
CTGGCTATCT GCGACACGTA CGACCTGGCG AGCATCAACC GGGGCCCGCT GGCGATGGGT
CTGCTCACCG GCTCGACCCG GGCGGTCGGC TCCGACGACA TTCGCGGAGT GGCTCCACCG
TGGCTGGTCT GGTTCACCGA CGGCCAACCC ACACCGCGGT GGTCTCGGCG CGTGGCGGAG
ATCCGGGACG TGCTCACCGC CGACGGTCGC ACCCTGGCGC AGGGCGCGCT GGGCTGGTTG
CTGGCCCGCA GCCCGCGGAC CGTCCCGATC CCGGGCTGCC GCACCGTCGC CCAGGCAGCG
GAGAACATCG GCACGCTCAC CCGTGGTCCG CTCCCAACGG ACGCGTACGC CGAGGTCGAG
CGGCTGCTGT CGGATCTTCG GCAAACGCCA GCCGAACCGG TCAGGTGA
 
Protein sequence
MTMTRTLGHS GIEVSAIGMG CWAIGGPLWG DGGQPFGWGD VDDDESVRTV HAVLDHGGTF 
FDTASNYGAG HSERILGRAL AGRRDQVVIA TKFGNCFEET TRRWTGTDHR PEHAVTSLEA
SLRRLGTDHV DLYQLHLNEL PTSAALDLVD TLEDLVSNGK IRAYGWSTDN PESAAAFAAA
GPHCATVQHD QSVLADNAAV LAICDTYDLA SINRGPLAMG LLTGSTRAVG SDDIRGVAPP
WLVWFTDGQP TPRWSRRVAE IRDVLTADGR TLAQGALGWL LARSPRTVPI PGCRTVAQAA
ENIGTLTRGP LPTDAYAEVE RLLSDLRQTP AEPVR