Gene Sare_3690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3690 
Symbol 
ID5705298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4248483 
End bp4249691 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content68% 
IMG OID641273110 
ProductXRE family transcriptional regulator 
Protein accessionYP_001538474 
Protein GI159039221 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.456142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCCG TCATCGAGCC GGGGATGACC CCCGGCCAAC GCATCAAGCT CTACCGCCGC 
CGAGCCGGCC TGACACAAGA AGTATGTGCC CAACTCAAGG GCGTCACCGT TTCCGCCTGG
CGTAAGTGGG AGTCAGGGGA ACGATCCGTT AATACCCTCT CCGACTGGGT CGACATCGCT
CGTATCCTGC GGGTACGCGA CCTGTACCGC ATCACCGGAC TCCCCGTCGG TCATCTCCCG
GATGACCCGG TGGACCACGA GTTAGTGGGA CCGCTCCGCG CCGCCATCCA CGCCTACGGC
CCGCCAGAGG TCGAGCTGAT GCCGCTGCCC GAGCTACGCG CCGCGGTGCG GTTGGCGTGG
ACGACATGGC ACCAGTCGCG GCAGCGTTAC ACCTACACCG GCCCTGCCCT GCCCGGCCTC
GTCCAGGCCG TTACCGCTGC CGTCGCCAGC ACCGACGGCG ACTCACGGCG TGACGCGCTG
CGGATCTCCG CCGACCTGTA TCTCCTGGTG CGCTCCTACG CCAAGCGCAT CGGCGCACAT
GACGTGGCGC TCGTGGCCGC CGATCGGGCG CTGGCCGCTG CACGTGACGC CGACGATCCG
ATCTACCGCG GCGCCGGGGC GTGGAACCTC GGCCAGGCGC TGTCGATGCG CGGCCACGCG
GAGGAGTCCG CTGAGCTATG CCGGTCCGCT ATCGCTGAGC TACGCGCAAT CGATGACCAG
GACCCGGTGC GGCTGTCTGT CCTCGGCGGG CTCAATCTGC TGCTGTCTGT GCAGGCGGCT
CGGCTGCACA ACGACCGGGA CACCACAGTG GTCCTGAGCG AGGCGGAAAA GCTCGCTGCC
GTCGTCGGGG AAACGTCTCA CCACTGGCTG TTTTTCGGCC CGATCAACGT CGGGATTCAC
CGGGCTGCGG TGGCGCTGGA GCTGTCGCGG CATGGTGAGG CGCTCAAGTT GGGCGAGCGG
GTCGACGTGA CTGGATCGCC GAGCATCGAG CGCCGGCACT CGCACCTGCT GCACCTGGCG
CGTGGGTACG CCACGCAACG CGACGACGTG GCCGCCTCGC TCATGTTGAG CCGCGCACAC
CAGGAGTCAC CCGAGGATTC CCGGCTAAGC CTGACTATGC GGGCGTTGCT GCGAGAGCTT
CTGGCTCGGG AGACTCCGAC GACCCGGCCA GAGCTTCGGA GTCTGGCCGA TCAGGTCGGG
GTCGCCTGA
 
Protein sequence
MDPVIEPGMT PGQRIKLYRR RAGLTQEVCA QLKGVTVSAW RKWESGERSV NTLSDWVDIA 
RILRVRDLYR ITGLPVGHLP DDPVDHELVG PLRAAIHAYG PPEVELMPLP ELRAAVRLAW
TTWHQSRQRY TYTGPALPGL VQAVTAAVAS TDGDSRRDAL RISADLYLLV RSYAKRIGAH
DVALVAADRA LAAARDADDP IYRGAGAWNL GQALSMRGHA EESAELCRSA IAELRAIDDQ
DPVRLSVLGG LNLLLSVQAA RLHNDRDTTV VLSEAEKLAA VVGETSHHWL FFGPINVGIH
RAAVALELSR HGEALKLGER VDVTGSPSIE RRHSHLLHLA RGYATQRDDV AASLMLSRAH
QESPEDSRLS LTMRALLREL LARETPTTRP ELRSLADQVG VA