Gene Sare_0180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0180 
Symbol 
ID5706337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp192672 
End bp195074 
Gene Length2403 bp 
Protein Length800 aa 
Translation table11 
GC content67% 
IMG OID641269706 
ProductXRE family transcriptional regulator 
Protein accessionYP_001535106 
Protein GI159035853 
COG category[K] Transcription 
COG ID[COG3620] Predicted transcriptional regulator with C-terminal CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.476576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00129717 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCAGATC AACTCGGATT GCTGTTGCGT CGGCTACGAA ACCAAGCGGG ATTGACACAG 
GAACAGGTCG CGGAACGGTC CGGGGTGAGT GTCCGCACCA TTCGCCGCCT GGAATCGGCG
AGGAACATCG ATCACCGCCT GGGCACCCTG AACCTGTTGG CTGACGCGCT GGAACTTGGA
TCCGAGGATC GCGAACTCCT CGCCACCATG CTCGCGAGAA CGAGCACCCC GCCCACGTTC
GCGGTTCACG CTGCTTCCAC GGAGACTCGG CCAACTGCAT CGGCCAGCGG GCCGCCCAGG
GCGCCATCGA GGCAGACGTC CGTCCTGGTG CCGGCGCGTG CCGTGCTGGA CGCTGCTGAG
ACGTTGGCGA GGGAAGCCAA ACGTCGGTGG CAGCGCGAGG AGGAGCAGCG CCAGGTCCAC
GACCCGTTTC CGTTGCCGCT GTGCTGGCAG CCGGCCCCCG CGCAGCTGGT CGACTACGCG
GCGAACGTCC AGCGCCTTCC ACCAGGAGCT ACCCCACAAC TGGATCTCAG TGGTGATATG
GGCAGTATCG CCGAGGTCTA CCGAAGGATC CGGTCCGGCC GGTTGGTGAT CCTGGGCCGG
GCGGGTTCGG GTAAGTCGAT CATGTTGATC AGGCTCGTCC TGGACTCGCT CGCGGCCTTC
ACCCTCCCCG AGCGGGTGCC GATGATCTTC AACCTCGGAT CCTGGGACGC GACAGCCATC
ACCTTGCGGG ACTGGCTGAT CGGTCAGCTG TTGCGTGACC ACCCGCACCT GGCCCGCAGG
GCACCCGGCG GTCCGACGTT GGCCGCCGCA CTGGTCGACG CTGATCTCAT CCTGCCCATC
CTGGACGGGT TCGACGAGAT CGCCGACGGC CTGCGGCGCG AGGCGCTCGA AGCGTTCAAC
GCGACCTCAC TGCCGCTTGT GCTGACCAGC CGCCGTGACG AGTACGCCGA GGCGGTGCGC
GGAGCCGGGG CCCCGCTGAA CTGGGCGGCC GGTATCGAGC TCGTGGACCT CACCCTCGAT
GACCTCGCGG CCTACCTGCC CCGGACCGCC AGGCAGGCCG GCCGCGACGA CACCGTAGCG
GTGTGGGATC CCGTCCTGAA GCTGCTGCAG GCCACGACGT GCCCGGCGAG CGTGAACCTC
ACGAGAGTAC TGTCCACTCC TCTGATGGTC GTCCTGGCGC GGACGATGTA TAGCGAGACA
CCGGAACGGG ATCCGGCCGA ACTGCTCGAC ATAACGCGGT TTCCCAGCGC GAAGTCCGTC
GAGGAGCATT TACTGGCAGG ATTTGTCCCG ACGGTCTACC GACCGTCCGT CCCCGACCGG
GAGGCTGGCG GCTTCCGGCA GCGGACCTGG AACCCGCACC ACGCAGAGCG TTGGCTCGGC
TACCTCGCCC ATCAGTTGGT ACGGCACGGC CAGGACCGGC GGGATCTCGC GTGGTGGCAG
ATCGGCGACT CCCTTCGTCG TTCGACGCGT ATCGGGACCG CAACGCTGGT TTCTGCGCTG
TGCACTGCCG TGTCGGCCTG GATGACCGGG CTGGTCGCCG GGCAGGTCGA CCCCGAGCAG
ATCCTGGTGG AAGGGGCCAT GATGGGGCTG TCGGCCGGCC TCGCCTTCGG AGCTGTCTAT
GCGGCGATAA CCGCCTTCGG CGGCACCTTC CAGCCGACCC ACGTGCGGCT GCGACTGCGT
CGCCGCCACA GCGTCGTCGC CCGCCCGCCA ATCCAGACAT TCACCGTCAG GTTCGCAGTC
GTCCTGCTGG GCGGGTTCGT CATGGGCGTT GGAAGCGCCT GCGCCACCGC CCTGGTACGC
GCACGGTACT GGGAAACCCC GCTCGCGAGC CTTGAGGTGA TCAGGGCGAC CCTCATCAAC
ATGCTGGTCT TCGGACTGAT CTTCGGTTTG GCGGCCGGAC TGGTGTTCGG GCTCCTGGCC
GCGTTGGAGG TGCCGGTGGA CGTTTCCTTC GTCGCCACTC CGGTCAGCCT ATTGTCCGCG
AACCGTGCGA CCGTGAGCCA ACAGATCCTC TTCCTTGCCC CCGTTCTCGC CCTGACAATC
GCCGTCGGTG GACGGCTGGT CGTCGACCTG CTCGAGGGAG GCGTCCTCGG AGAGCTGAGA
TGGGCCTGGC CCGACGCGTT TCTCATCGGG GCTGCCGGGG GGCTGGGAGG CGCATTGTCG
TACATGTTCT GCTTCACCGC CTGGGGGCAG TGGGTGATCC TGACCCGGGT GTGGCTACCG
CTGACTGGCC GGCTGCCCTG GAACACGATG GCCTTTCTGG AAGGCGCCTA CCGACGGGGC
GTGCTCCGTC AGACCGGCGC GGTTTACCAG TTCCGCCACG TTCGGCTTCA ACAGCACCTG
AGTCACTCGT ACTGCGAGCG GCGGCGGAGA AGTCGACGAA CTCCCGTTCA CTCCGAGGGC
TGA
 
Protein sequence
MADQLGLLLR RLRNQAGLTQ EQVAERSGVS VRTIRRLESA RNIDHRLGTL NLLADALELG 
SEDRELLATM LARTSTPPTF AVHAASTETR PTASASGPPR APSRQTSVLV PARAVLDAAE
TLAREAKRRW QREEEQRQVH DPFPLPLCWQ PAPAQLVDYA ANVQRLPPGA TPQLDLSGDM
GSIAEVYRRI RSGRLVILGR AGSGKSIMLI RLVLDSLAAF TLPERVPMIF NLGSWDATAI
TLRDWLIGQL LRDHPHLARR APGGPTLAAA LVDADLILPI LDGFDEIADG LRREALEAFN
ATSLPLVLTS RRDEYAEAVR GAGAPLNWAA GIELVDLTLD DLAAYLPRTA RQAGRDDTVA
VWDPVLKLLQ ATTCPASVNL TRVLSTPLMV VLARTMYSET PERDPAELLD ITRFPSAKSV
EEHLLAGFVP TVYRPSVPDR EAGGFRQRTW NPHHAERWLG YLAHQLVRHG QDRRDLAWWQ
IGDSLRRSTR IGTATLVSAL CTAVSAWMTG LVAGQVDPEQ ILVEGAMMGL SAGLAFGAVY
AAITAFGGTF QPTHVRLRLR RRHSVVARPP IQTFTVRFAV VLLGGFVMGV GSACATALVR
ARYWETPLAS LEVIRATLIN MLVFGLIFGL AAGLVFGLLA ALEVPVDVSF VATPVSLLSA
NRATVSQQIL FLAPVLALTI AVGGRLVVDL LEGGVLGELR WAWPDAFLIG AAGGLGGALS
YMFCFTAWGQ WVILTRVWLP LTGRLPWNTM AFLEGAYRRG VLRQTGAVYQ FRHVRLQQHL
SHSYCERRRR SRRTPVHSEG