Gene Sare_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1061 
Symbol 
ID5705674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1189052 
End bp1190524 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content69% 
IMG OID641270577 
ProductUbiD family decarboxylase 
Protein accessionYP_001535961 
Protein GI159036708 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0281996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACT TCACCGACCT ACGTGGCTAT CTCGACGCAC TCGACGCGCT CGGTGACCTG 
AGGACCATCG AGCGGTCCGT GAGTGTCGAC CTGGAAGCAG CGGCGATTAC CCGCCGCTCG
TACGAGATCC GCGCCGCCGC ACCGCTGTTC ACCAACATCG CCGAGGACCG GACAGGCATG
CGGATGTTCG GGGCTCCCGC CGGTGTCAGC TCCCGAGCCG ACATGCCGCT CGCCCGGCTC
GCGCTCTCCG TCGGCCTGCC ACCCGAGACC GGTGCGGCGG CACTCGTGGA CCACCTCGTC
CGCGTCCGCG ACGCGGTGCC CGTACCGCCG CGGGCGGTGC CGCGCGAGAA CGCGCCGTGC
AAGCAGAACG TGTTGCTCGG TAGGGAGGCG ACACTGGACC GGTTCGCGGT CCCACGTCTG
CACGAGTCCG ACGGCGGCCG GTACCTCAAC ACCTGGGGTG TGATCGTTGT CAGGACGCCC
GACGGTGCGT GGGTCAACTG GTCGATCTCG CGGATCATGA TGCTCGACGG CAAACGGATG
ACAGGCCTGG TGGTGCCACC GCAGCACCTC GGTCTGGTCT GGCAGGCGTG GGCCGAGCGC
GGTGAGCCGA TGCCCTACGC GCTGGTGCAG GGCGGCGCCC CGGCGATTCC CTTCGTGGGC
GGTATCCCGC TGCCGCGCGG GGTGGACGAG GCCGGGTACA TCGGCGCGCT GCATGGGGAG
CCGGTCGAGG TGGTGCGCTG CGAAACCTCC GACCTGGAGG TACCCGCGCA CGCCGAGGTG
GTCATCGAGG GACACATGTC GGTGGGCCGG GACAGCCGTG AGGGGCCGTT CGGCGAGTAC
GCCGGCTACG CCTCCACCCA GTCCTCCACC CAGCCGGTGT ACTCGGTGGA GGCCATCACC
TACCGCGACG ACCCGATCTG GCCGATCGTC CCGGAGGGCC GGCCGCCCGA CGAGTACCAC
ACCGTGACCG GCACCGGTCG CGCCGCGAAC GTCCTGCACG CGCTGCGACG GGCAGGGCTG
CCGGTGACCA CGGTGTGGAT GCCGTTCCCG GCAGCGATGC ACTGGACCGT GGTGACCGTC
CCGGACGACT GGCGGTCGCA CCTACCCGGG GTGGACTCCG GAGAGTTCGT ACGACGAATC
GGCGAGGTCA TCCACAACAG CGGTGGACCC AGCGCGATGA TGCCGGTCAC CTTCGTTCTG
GATGATGACA TCGACCCCTC CAACGAGGCC GACCTGCTGT GGGCGCTGTC CACCCGGTTG
CATCCGAAGG ACCGACGCTT CGCCTGGGAC GGTGTGGTCC TACCGTTCAT GGCCTGCTAC
ACCGAAGACG AGCGCAAGAC GATGCGTGGT CCGAGTGTCG TCCATGACGG GCTGCTGCCT
GCCTGGGGCG AGGGCCGGCT GCACCACAGT TCCTTCGCCC AGGCCTACCC CGCCGACATC
CGCCGCAGGG TGCTCGAGCA CGAAGACGGT TGA
 
Protein sequence
MSHFTDLRGY LDALDALGDL RTIERSVSVD LEAAAITRRS YEIRAAAPLF TNIAEDRTGM 
RMFGAPAGVS SRADMPLARL ALSVGLPPET GAAALVDHLV RVRDAVPVPP RAVPRENAPC
KQNVLLGREA TLDRFAVPRL HESDGGRYLN TWGVIVVRTP DGAWVNWSIS RIMMLDGKRM
TGLVVPPQHL GLVWQAWAER GEPMPYALVQ GGAPAIPFVG GIPLPRGVDE AGYIGALHGE
PVEVVRCETS DLEVPAHAEV VIEGHMSVGR DSREGPFGEY AGYASTQSST QPVYSVEAIT
YRDDPIWPIV PEGRPPDEYH TVTGTGRAAN VLHALRRAGL PVTTVWMPFP AAMHWTVVTV
PDDWRSHLPG VDSGEFVRRI GEVIHNSGGP SAMMPVTFVL DDDIDPSNEA DLLWALSTRL
HPKDRRFAWD GVVLPFMACY TEDERKTMRG PSVVHDGLLP AWGEGRLHHS SFAQAYPADI
RRRVLEHEDG