Gene Sare_0896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0896 
Symbol 
ID5704231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1006358 
End bp1007638 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content74% 
IMG OID641270414 
Productputative phytochrome sensor protein 
Protein accessionYP_001535804 
Protein GI159036551 
COG category[K] Transcription
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3284] Transcriptional activator of acetoin/glycerol metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0355459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGACC CGTGGCTCGC CCTGGAAATC GGCGTCGATC CGGCGGAGCG GATCGCGCAG 
GTGGGTGTTG CCCACACGGC GTTCCTCACC GGGGCGACGC CACAGCGGGT ACGGGACGTG
GTGCGCCGAT CGTGGGAACG GTCGATGCCC CTCGACCCGG AGGGCACCCC GCCCGTCGAC
CTCGTCGACG ACACGCTGGA GAGCTACCGT GCCGGGCATC CGCTGGCCCC GGTGCTGCCG
ATCTTCCGGG ACCTGCTCGG TGGGATCGCC CAGGACGGCG CGCACCTGTT GGCGGTCTGC
GACACGCACG GCCGCCTGCT CTGGGTGGAG GGGCACCCGG GGTTGCTACG ACGGGCCGAG
GGGATGAACT TCGTGCCCGG GGCCCGCTGG GACGAGGCGC ACGCTGGCAC CAACGCACCC
GGCACCGCCC TCGCCGTGGA CCACAGCGTG CAGATCTTCG CCACCGAACA TTTCTGCCGC
CGTGTCCAGC GCTGGACCTG CGCCGCCGCG CCCATCCACG ATCCGGCCAC CGGCCGGATA
CTCGGCGCGG TGGACATCAC CGGCGGTAAC CACCTCGCCA CCCCGCAGAG CCTGGCGCTG
ATCCGGGCCA CCGCACGGGC CGCCGAGGCG TTCCTGGCCG CGCACGCGCC GATCGAACCG
GACGTGGTGC AGGTGTCCGC GCTTGGCCGG GACGAGGCGC AACTACAGGT CGGTGGCCGT
CGCATCCGGC TCGGTCGCCG GCACAGCGAG CTGCTGGTGC TGCTGGTGGA CCACCCCGAG
GGGCGCACCG GGGAGCAACT CGCCCTCGAC CTCTACGGCG AGGACCGGCC GCACCCGGTC
ACTGTTCGGG CCGAGCTGTC CCGGCTCCGC CGCGTGCTGG GCCCGGAGTT GCTCGACTCC
CGCCCGTACC GGCTGCGCTG CCGGGTACGC GCCGACTTCC GTACTGTCAC CGACCGACTG
GAGCGGGGCG ACCCGGCGGG CGGGCTCGAC GCGTACCCCG GGAGTCTGTT GCCCGGCTCG
GACGCGCCGG GCGTGGCGAG GCTGCGCCGG CTGGTCGACG GCCAGCTACG CGCGGCCGTG
CTGGCTGCCG CGGATCCGGC CCTGCTCGCG GCGTGGACCG CGACGCCCGC CGGCACCGAC
GACCTGACGG CCTGGGAGGC CCTGGCGCGG GTGTTGCCGC CGGGCGCACC TCGCCGGCCC
CTCGCCCTCG CGCGGGCCCG CCAGCTCGCC CGAGAGTACG GCGTGAGCCG TGCAACGTCG
CTGCAACGTC GACAAAAATA G
 
Protein sequence
MVDPWLALEI GVDPAERIAQ VGVAHTAFLT GATPQRVRDV VRRSWERSMP LDPEGTPPVD 
LVDDTLESYR AGHPLAPVLP IFRDLLGGIA QDGAHLLAVC DTHGRLLWVE GHPGLLRRAE
GMNFVPGARW DEAHAGTNAP GTALAVDHSV QIFATEHFCR RVQRWTCAAA PIHDPATGRI
LGAVDITGGN HLATPQSLAL IRATARAAEA FLAAHAPIEP DVVQVSALGR DEAQLQVGGR
RIRLGRRHSE LLVLLVDHPE GRTGEQLALD LYGEDRPHPV TVRAELSRLR RVLGPELLDS
RPYRLRCRVR ADFRTVTDRL ERGDPAGGLD AYPGSLLPGS DAPGVARLRR LVDGQLRAAV
LAAADPALLA AWTATPAGTD DLTAWEALAR VLPPGAPRRP LALARARQLA REYGVSRATS
LQRRQK