Gene Sare_2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2157 
Symbol 
ID5705613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2480680 
End bp2481720 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content71% 
IMG OID641271642 
ProductAraC family transcriptional regulator 
Protein accessionYP_001537013 
Protein GI159037760 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.256833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00609835 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAGCGC ACGTCATTGC GGTGGCGGTG ACCGACAACC TGCCTATCTT CGAGCTGGCC 
GTGCCGCAGG AGGTGTTCGG CACCGACCGC CGGGACATCG CCGATCCGTG GTACGACATG
CGCCTGTGCG CGGCCGAACC GGGCCCACTG CGCACCACCG GAGGCGGCTT CCTCAACCCG
TCGTACGGCT TGGACGATCT GGTCGAGGCA GACACCGTGC TGGTGCCGGC GTGCGCGCGG
GCAGCCCAGG TCAACCCACC GGCCGACCTG GTCGAGGCAC TCCGGGTGGC GCACGCGCGG
GGCAAGCGGA TTGTCGGCAT CTGCACCGGC GCGTACGTGC TGGCCGCGGC CGATCTGCTC
GACGGCCGCC GGGCAACGAC CCACTGGATG AACGCCCAGG ACTTCGCGGC CCGGTTTCCC
CTGGTCGACC TCGACCCTCG GGTGCTCTAC GTGGATGAGG GCGACATCCT CACCTCCGCC
GGGACGGCCG CCGCGATCGA TCTGTGCCTG CACCTGGTGT GGCGGGACCA CGGCGCGGCG
ATCGCCCACG AGGTCGCCCG CCGGATGGTC GTGCCCCCGC ATCGGGGCGG TGAGCACACC
CAGTACCCGT CCGCACCAGC GCGGAGCGTG CCCCCCGACG ACCTGAGCGC GGTGCTGGAA
TGGGCCCGCG GCCGGCTTGA CCAGCCACTG ACGGTCAACG ACCTGGCGCG TGCGGCGAAC
CTGAGCCCGC GTACGTTCGC CCGGCGGTTC CGCGACACGC TCGGGACCAC TCCGTTGCAG
TGGCTACTGG AGCAGCGGGT CCGGCTGGCT CAGGAACTGC TGGAGACCAC GGACGAGCCG
GTGGAACGGA TAGCTCACCG CACCGGCTTC GGTACGGGCG CCAACCTGCG CCAGCACTTC
GGTCGGGTCA GCGGGGTGAC CCCCCAGTCC TACCGGCACG TGTTCCGCTA CCGCAACGCC
GCGGCGGCAT CGCCGGTTGT CCACGACACC TCGGAGCATC CGGCGTTGGT GATCGCCCGC
TCGGGCGGCG AGGCGAGGTG A
 
Protein sequence
MGAHVIAVAV TDNLPIFELA VPQEVFGTDR RDIADPWYDM RLCAAEPGPL RTTGGGFLNP 
SYGLDDLVEA DTVLVPACAR AAQVNPPADL VEALRVAHAR GKRIVGICTG AYVLAAADLL
DGRRATTHWM NAQDFAARFP LVDLDPRVLY VDEGDILTSA GTAAAIDLCL HLVWRDHGAA
IAHEVARRMV VPPHRGGEHT QYPSAPARSV PPDDLSAVLE WARGRLDQPL TVNDLARAAN
LSPRTFARRF RDTLGTTPLQ WLLEQRVRLA QELLETTDEP VERIAHRTGF GTGANLRQHF
GRVSGVTPQS YRHVFRYRNA AAASPVVHDT SEHPALVIAR SGGEAR