Gene Sare_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0783 
Symbol 
ID5705026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp876898 
End bp878082 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content69% 
IMG OID641270302 
ProductXRE family transcriptional regulator 
Protein accessionYP_001535693 
Protein GI159036440 
COG category[K] Transcription 
COG ID[COG1396] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.500633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00490863 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCCACTG ACACGGTCGG CTCGCGCATC AGGTACTGGC GGATCCGCCG GGGCGGCATG 
AGCCAAACCG TGCTCGCCGG ACTGGCTGGG CTCTCACAGC CCTACATCTC CCAGGTCGAG
TCTGGGCACC GCAGCATCGA CCGCCGATCC ACGCTCATCG CCATCGCGGC GGCGTTACAG
GTGACCGTCG CTGACCTGCT CGACCAGCCC GGTGATCCCA CCGATCCGGC GCTACTCGGA
GCTGCCGACG CGGTCCCGGC CATCTGGGCC GCACTCCTGG AGATCAGCGA CGGTGATCGA
CGTGTCACGA CCATGCCCAG CGAGCAGGTC ACCGTCGGAT TGCGGGACAG CAACCAGAGA
CGGTTGGCCG CTGACTACCC GGCTATGGCC CGAGGTCTGC CGGCACTACT CGGCGAGGCC
GCCGGGCACG GCGGGCCGAT CCTCGCCGAG GCCGCCTACC AGGCGGCCAC GTGTCTACGG
CATCTCGGGC ACCGGCACCT TGCCGTCGAT GCTGCCCGGA TTGCTCAGGC GGCGGCAGAC
GATGTCGACC ATCACGCGTG GGCGGGGGCG GCCCGGTTCG CGTACGTGCA GGCGCTGCCG
GTGGAGGCGG CGGGGGTGGC ATCCGGGGTC GCGGCTCGGG CCTTGGGCGA CCTGCAACAG
CAGGCGGCGG ACCCACGAGT ACGGCAGGTT CTTGGTCAGC TGCATCTGGC TGCGGCGCTG
CGAGCAGCGA GCGACGGCCA CCTTGCCGAC GCCGACGGGC ACTTGGTCGA GGCCGAACGG
GAGGCCGCCA CGTTAGGCGA TCCAGGCTCT GGCGGTGGTT TCAACACCAT GTGCTTCGGC
CCCACCAACG TGGTGCTCTG GCGAATGGCA GTCGCCGCCG AGTCCGGCGA GTACGGACGC
GTCATTGAAC TGTCGCGCAC TGTTTCGGTC GATGTGTTGC CGATCGCGAA CCGCCGCCAG
GCATACTGGA TGGATCTTGG CCGGGCGTTG GCGCACTCAG GGCGAACCGA CACCCAGGCG
CTGATCGCGT TCAGCCGCGC TGACCAGATC GCACCCAGCC TGTTCGTGCT GAACCCCCTG
GCCCGAGAGG CCGTAGCGGC GATGGTCCGC CGCGCCCGCC GCCGCGCGGT GTCGAAGGAA
CTTCGAACCC TCGCCCGTCG CCTGTCGATC AGTACAGACG TATAA
 
Protein sequence
MPTDTVGSRI RYWRIRRGGM SQTVLAGLAG LSQPYISQVE SGHRSIDRRS TLIAIAAALQ 
VTVADLLDQP GDPTDPALLG AADAVPAIWA ALLEISDGDR RVTTMPSEQV TVGLRDSNQR
RLAADYPAMA RGLPALLGEA AGHGGPILAE AAYQAATCLR HLGHRHLAVD AARIAQAAAD
DVDHHAWAGA ARFAYVQALP VEAAGVASGV AARALGDLQQ QAADPRVRQV LGQLHLAAAL
RAASDGHLAD ADGHLVEAER EAATLGDPGS GGGFNTMCFG PTNVVLWRMA VAAESGEYGR
VIELSRTVSV DVLPIANRRQ AYWMDLGRAL AHSGRTDTQA LIAFSRADQI APSLFVLNPL
AREAVAAMVR RARRRAVSKE LRTLARRLSI STDV