Gene Sare_4700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4700 
Symbol 
ID5708160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5321924 
End bp5323099 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content69% 
IMG OID641274098 
ProductDNA integrity scanning protein DisA 
Protein accessionYP_001539444 
Protein GI159040191 
COG category[R] General function prediction only 
COG ID[COG1623] Predicted nucleic-acid-binding protein (contains the HHH domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.018924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGATCG ACCGCGATGC CACCAAGCCT GCCGGCGCGA CGCCGCATGC CCGCACCGCT 
GCCGTGGGTT CGCCCGCCCG TCCGATCAGC GTGCACGTGA CCGGAGGCGT GGCCAGTGAC
CCGCTGCGCG CCAACCTGGC CCTGATGGCA CCGGGCACCG CCCTACGCGA CGGTCTGGAG
CGGATCCTCC GCGGACGCAC CGGAGCCTTG ATCGTCCTCG GCTACGACAA GGTCGTCGAG
AGCCTCTGCA CTGGCGGCTT CCCGCTCGAC GTGGAGTTCT CCGCCACCCG CGTACGAGAA
TTGTGCAAAA TGGATGGCGC AGTGGTGCTT TCCAGCGACG GTAGCCGGAT CGTCCGCGCG
GCAGTGCACC TGATGCCCGA TCCCGCGATC CCGACCGAGG AGTCCGGCAC CCGTCACCGT
ACCGCCGAGC GGGTCGCCCG CCAGACCGGC TACCCGGTCA TTTCGGTGAG CCAGTCCATG
CGGATCATCA GCCTCTACGT CAACGGTCAG CGGCACGTGC TGGACGACTC GGCCGCCATC
CTCTCCCGAG CCAACCAGGC GCTCGCCACG CTCGAGCGAT ACAAGCTGCG CCTGGATGAG
GTGTCCGGCA CCCTCTCCGC CCTGGAGATC GAGGACCTGG TCACCGTTCG GGACGCGGTC
GCCGTCGTCC AACGACTGGA GATGGTCCGC CGGATCGCGG ACGAGATCGC CGGGTACGTG
GTGGAACTGG GCACCGACGG CCGGCTGCTC GCCCTGCAAC TTGACGAGTT GATGGCCGGC
GTGGACGCCG ACCGCACCCT GGTCATCCGG GACTACCTGC CCACCGGCCG CAAGTCACGC
ACCCTTGACG AGGCCCTGGT CGAATTGGAC CTGCTGACCG CAACCGAACT GATCGATCTG
GTTGCGGTCT CCCGAGCGAT CGGCTATCCG GCGGCCTCCG ACGCGCTGGA CGCCGCGCTC
AGCCCGCGCG GCTTCCGGCT ACTGGCCAAG GTACCGCGCC TGCCGGTAGC GATCGTGGAC
CGTCTGGTGG GGCACTTCGG CAGCCTTCAG CGGCTACTCG GCGCGACCGT GGAGGACCTG
CAGGCCGTCG AGGGCGTGGG AGATGCCCGC GCCAGGGGCG TGCGGGAAGG GCTTTCCCGG
CTCGCCGAGG CATCGATCCT GGAACGCTAC GTCTGA
 
Protein sequence
MPIDRDATKP AGATPHARTA AVGSPARPIS VHVTGGVASD PLRANLALMA PGTALRDGLE 
RILRGRTGAL IVLGYDKVVE SLCTGGFPLD VEFSATRVRE LCKMDGAVVL SSDGSRIVRA
AVHLMPDPAI PTEESGTRHR TAERVARQTG YPVISVSQSM RIISLYVNGQ RHVLDDSAAI
LSRANQALAT LERYKLRLDE VSGTLSALEI EDLVTVRDAV AVVQRLEMVR RIADEIAGYV
VELGTDGRLL ALQLDELMAG VDADRTLVIR DYLPTGRKSR TLDEALVELD LLTATELIDL
VAVSRAIGYP AASDALDAAL SPRGFRLLAK VPRLPVAIVD RLVGHFGSLQ RLLGATVEDL
QAVEGVGDAR ARGVREGLSR LAEASILERY V