Gene Sare_4747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4747 
Symbol 
ID5705338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5371854 
End bp5372816 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content72% 
IMG OID641274145 
ProductAraC family transcriptional regulator 
Protein accessionYP_001539491 
Protein GI159040238 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000169446 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTCCGCT CCGTCGCCGT CATCGCCCTC GACCGAGTCG CCCCCTTCGA GCTCGGCGTA 
CTGGCCGAGG TCTTCGGCAC CGACCGCACC GCCGACGGCT TCCCGGGCTA CCGCTTCACC
GTGTGCACCG TCGACGGTGG CCCGGTCCGC ACCTCGTCCG GCTTCCACCT CACCCCGCAC
GGCGACCTGA CCGCGGTCGA CGAGGCCGAT CTGGTGGCCG TGCCCGCGCA CCCCCGGGAC
TCACCTGTTC CGCCGGCCGC ACTCGCCGCG CTCCGCCAGG CTGCCGAACG AGACGCGTAC
GTGTTCAGCG TCTGCTCCGG CGCCTTCGTA CTCGGCGCCG CCGGGCTACT CGACGGACGC
GAATGCACCG CCCACTGGGC GCACGTCGAC GAGTTGCGAC AGCGCTACCC CGCGGCGAGG
GTGCGGTGCA ACTCCCTCTA CGTCGCGGAC GGACGGCTGA TCACCAGCGC CGGCACCGCC
GCCGGCATCG ACGCCTGCCT ACACCTGGTC CGGCAGGAAC ACGGGTCGGC GATCGCCACC
CGGCTGGCCC GCCGAATGGT GGTCCCCCCA CACCGGGACG GCGGGCAGTC CCAGTACGTC
GAGACCCCGA TCTCCAGCGA GCCCGAGGCG CAGACCCTGG AGCCGGTACT GCAATGGCTG
ATGGGCCACC TGAACCGGTC GCTGACCGTG GACGACCTGG CCGCCCGCGC CGACATGGCA
CCCCGTACGT TCGCCCGCCG GTTCCGGGCG GAGACCGGCA CCACACCGCA CGACTGGCTC
ACCAACCAGC GGGTGTTGCT CGCCCGACGG CTCCTGGAAG AGACCCGTCT CAGCATCGAG
GAGGTGGCCG GCCGTACCGG CTTCTCCGAC GCCGCTGCCC TGCGCCACCA CTTCACCCGC
CGGGTCGGGA CCACCCCGAA CGGCTACCGC ATCACCTTTC GGGACCGAAC GCCTGCCCGC
TGA
 
Protein sequence
MLRSVAVIAL DRVAPFELGV LAEVFGTDRT ADGFPGYRFT VCTVDGGPVR TSSGFHLTPH 
GDLTAVDEAD LVAVPAHPRD SPVPPAALAA LRQAAERDAY VFSVCSGAFV LGAAGLLDGR
ECTAHWAHVD ELRQRYPAAR VRCNSLYVAD GRLITSAGTA AGIDACLHLV RQEHGSAIAT
RLARRMVVPP HRDGGQSQYV ETPISSEPEA QTLEPVLQWL MGHLNRSLTV DDLAARADMA
PRTFARRFRA ETGTTPHDWL TNQRVLLARR LLEETRLSIE EVAGRTGFSD AAALRHHFTR
RVGTTPNGYR ITFRDRTPAR