Gene Sare_4281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4281 
Symbol 
ID5706993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4856603 
End bp4857556 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content68% 
IMG OID641273700 
Productaldo/keto reductase 
Protein accessionYP_001539053 
Protein GI159039800 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.198466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0048015 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACTGAGA TGACGTACCG CCGGTTGGGT GACTCCGGGC TCGTGGTGTC CGTGGTCGGC 
ATCGGCTGCA ACAACTTTGG CCCCAGGGTC GACCTCGAAG GGACTCGGGC GGTGGTGGAG
GCCGCCCTCG ACGTCGGAAT CAACTTTTTC GACTCCGCCG ACATCTACGG GAAGGGCGGT
TCCGAGGAAC TGCTGGGCCA GGTACTCAAG GGACGCTGGG ACGATGTGGT GATCGCCACC
AAGTTCGGCA TGGACATGGA CGGCCGTAAC GGGCCGGACC ACGGCGCCCG CGGTGCCCGT
CGCTACATCA TGCGGGCGGT GGAGGCGTCG TTGCGTCGGC TCGGCACCGA CCACATCGAC
CTGTACCAGT TCCATGAGCC GGACCCGGGT ACTCCGATCG ACGAGACCCT CGCCGCACTG
GACGATTTGG TCACTGCCGG CAAGGTGCGC TATCTGGGTA ACTCCAACTT CAGCGGCTGG
CAGATCGCCG ATGCGGACTG GACCGCCAGG TCGCGGGGGC AGACCCGATT CATCTCCGCG
CAGAATCACT ACTCTCTAGT CGAACGGGGT GTCGAGGCTG AGGTCGTCCC GGCGTGTGAG
CGATTCGGGC TGGGCCTGCT GCCGTTCTAT CCCCTCGCCA ACGGCCTGCT CACCGGCAAG
TACAAGCGGG GCGAGGCCGC TCCGCCGGGC AGTCGCCTTG CCGGTGGCGG CCGGTACACG
GCCCGGCTGG CCGCCGCCGA ATGGGACACG ATCGAGGCGC TCGAGGCGTA CGCCGCCGAC
CGTGGACTCA CTCTGCTTCA GGTGGCCATC GGTGGGCTGG CCGCGCAACC GGCGGTGACC
TCGGTGATCG CCGGCGCCAC CACACCCGAC CAGGTTCGGG CAAACGCGCA GGCGGGTGCC
TGGCAGCCCG GCGTCGACGA CCTGGCGGCC CTGCGTGAGG TGCTCAGCAG GTAA
 
Protein sequence
MTEMTYRRLG DSGLVVSVVG IGCNNFGPRV DLEGTRAVVE AALDVGINFF DSADIYGKGG 
SEELLGQVLK GRWDDVVIAT KFGMDMDGRN GPDHGARGAR RYIMRAVEAS LRRLGTDHID
LYQFHEPDPG TPIDETLAAL DDLVTAGKVR YLGNSNFSGW QIADADWTAR SRGQTRFISA
QNHYSLVERG VEAEVVPACE RFGLGLLPFY PLANGLLTGK YKRGEAAPPG SRLAGGGRYT
ARLAAAEWDT IEALEAYAAD RGLTLLQVAI GGLAAQPAVT SVIAGATTPD QVRANAQAGA
WQPGVDDLAA LREVLSR