Gene Sare_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2066 
Symbol 
ID5703277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2365863 
End bp2366879 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content67% 
IMG OID641271552 
ProductAraC family transcriptional regulator 
Protein accessionYP_001536923 
Protein GI159037670 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.623235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.45327 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAAGAC GACGAGTTGT TGTGGTGGTC TACCCCGAAG TCCAGGCACT GGATGTCACC 
GGCCCGGTGG AGGTGTTCGA CACCGTGAAC CGGTTCCTGC CGGACCCGGC GGCGGGGTAC
CGGATCGAGT ATGTCTCGGC TGCGGCCCCC TTGGTGCGTA CCTCGGCGGG TTTGATGATC
CAGGCTGCGC CGTTGGAGAC AGGCGAAGGG CGAATGGACA ACCTGCTGGT ACCGGGCGGC
TGGGGTCTGG GCCAGGCGCT CGCGGACCAC GATCTGCTGT GCTGGATCCA ACGGGCCGCG
AAACGGGCGC AGCGGGTCAC GTCAGTGTGC GGCGGGTCCT TTCTGCTGGC CGAGGCCGGG
CTGCTCGACG GCCGTCGGGC GACGACACAC TGGGCGTACT GCCAGGACAT GGCCCGGCGA
TATCCGGCGG TGACTGTCGA CCCCGAACCC ATATACGTGT GGGACGGACC CTACGTGACA
TCGGCGGGAG TGACTGCGGG AATCGACATG GCCCTGGCGC TGGTGGAGGC CGACCATGGT
GCCGAGTTCG CCCTGGAGAT CGCCCGCTAC CTGGTGCTCT TCTTCAAACG CGACGGCGGC
CAGCCGCAGT TCAGTGGCAT GCTGGACGCA CAACTGGCTG ACCGGGTGCC GATCCGAACC
GCCCAAGAGT GGGTGCGGGC CCACGTCGAG CACCCACTTC CGGTGCCGGA ACTCGCCGAG
CGGGTGCACA TGAGCCCCCG GAACTTCTCC CGGGTGTTCC GGCGAGAGGT CGGCATGACA
CCCGGACAGT ATGTCGTCCA GACGCGCGTC GGTCGAGCCC GGGAACTGCT GGAGAGTACC
GACCTGTCCA TCAGCCAGAT CGCCCGCCGA TGCGGCTTCG GTCGGGTGGA GACGTTCCTA
CGGACGTTCG ATCGCGCGGT GGGTCTGACG CCGGGAGCCT ATCGGCAACG GTTCCAGGTC
CTGGCACCAG CCGGTCTGCT GATCGAGCCA CCGGTAGCGG CGGGGAGCCA GGGGTGA
 
Protein sequence
MSRRRVVVVV YPEVQALDVT GPVEVFDTVN RFLPDPAAGY RIEYVSAAAP LVRTSAGLMI 
QAAPLETGEG RMDNLLVPGG WGLGQALADH DLLCWIQRAA KRAQRVTSVC GGSFLLAEAG
LLDGRRATTH WAYCQDMARR YPAVTVDPEP IYVWDGPYVT SAGVTAGIDM ALALVEADHG
AEFALEIARY LVLFFKRDGG QPQFSGMLDA QLADRVPIRT AQEWVRAHVE HPLPVPELAE
RVHMSPRNFS RVFRREVGMT PGQYVVQTRV GRARELLEST DLSISQIARR CGFGRVETFL
RTFDRAVGLT PGAYRQRFQV LAPAGLLIEP PVAAGSQG