Gene Sare_3194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3194 
Symbol 
ID5705629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3684477 
End bp3685469 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content69% 
IMG OID641272625 
ProductAraC family transcriptional regulator 
Protein accessionYP_001537992 
Protein GI159038739 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.857429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00472645 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGATCA ACGAGCACTA CCGACTGGAC CCGAACGCCC GGGTGCTGCT GTCCGATCTG 
GGCCTGTCCG TCGGCAACAT CCTGCGCCGG GCCGGGCTGC CGGGAGACAC GCTGTCCGAC
GGCCCGGCCA CCCTGACACC AGAGCGGTTC TACGCCCTGT GGGAGGCGGT CGCCGCCGAG
GCCGCCGACC CCGGGCTCCC GATCCGGATC GGTCAGGCCA TCTCCGTCGA GGCGTTCCAT
CCGCCGCTCT TCGCGGCGCT GTGCAGCCCG AACCTCGGCG TGGCCGCTGC CCGCATCGCC
ACGTACAAGG CGCTCATCGG ACCGCTACGA CTCGTCATCG CTACCACCGG TGAAGGGCTC
GAGGTGGAAC TGCACTGGCC GCCAGACCAC CGGCCCCCGG AAGTCCTGAC GACGACCGAG
CTCGTGTGGT GGGTCGCCCT GGCCCGACTG GCCACCCGGA CGAGGGTGGT GCCGGTCGCC
GTCACCAGCG CGCAGCCACC GTCCGCCGCC AGTGCACTCG CCGACTACCT CGGAGTTCGC
GTACAGCAGA CCGAGCGTTT CACCGTGACC TTCAGCGCCC GGGACTCGGC CCGCCCGTTC
CTCACCGCCA ACGAGCCGAT GTGGGAGTTC TTCGAGCCGG AACTCCGCAG CCGGCTCGCC
CACCTGGAGC GCGGCGCCAC GGTACGCCAA CGGGTACAGG CCGCCCTGCT CGAACTACTA
CCCAGCGGCC GGGGCACCGT CGACGGCGTG GCCCGTGAAC TGACGATGGG GGCCCGTACG
CTGCAACGTC AGCTGAAGAG CGAAGGCACC AACTTCCAGA CCGTACTCAA CGACACCAGA
CGATCGATCG CCCACCGCTA TCTCAGCGAG GGGAACCTCT CGGTGGCCGA GATCGCATTC
CTCCTCGGCT ACGACGAACC CAGCTCGTTC TACCGTGCGT TTCACGCCTG GACCGGACGC
ACCCCACTCG CCGCCCGAGC AGAACTCGGG TGA
 
Protein sequence
MSINEHYRLD PNARVLLSDL GLSVGNILRR AGLPGDTLSD GPATLTPERF YALWEAVAAE 
AADPGLPIRI GQAISVEAFH PPLFAALCSP NLGVAAARIA TYKALIGPLR LVIATTGEGL
EVELHWPPDH RPPEVLTTTE LVWWVALARL ATRTRVVPVA VTSAQPPSAA SALADYLGVR
VQQTERFTVT FSARDSARPF LTANEPMWEF FEPELRSRLA HLERGATVRQ RVQAALLELL
PSGRGTVDGV ARELTMGART LQRQLKSEGT NFQTVLNDTR RSIAHRYLSE GNLSVAEIAF
LLGYDEPSSF YRAFHAWTGR TPLAARAELG