Gene Sare_3970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3970 
Symbol 
ID5705247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4508499 
End bp4509545 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content72% 
IMG OID641273395 
ProductLacI family transcription regulator 
Protein accessionYP_001538751 
Protein GI159039498 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.149546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACGGA TCGACGACGT TGCCCGGCTG GCCGGAGTCT CGACCGCCAC TGTCTCCCGG 
GCGCTACGCG GGCTCCCGAC GGTCTCGGCG GCGACGCGGC ACCGGGTTCT AGCCGCCGCC
GAACAACTCC AGTACACCGT CTCACCGAAC GCGTCGCGGC TGGCCGGCGG GCGTACCGGC
ACGGTCGCCG TGGTCGTTCC CCGGATCACC CGTTGGTTCT TCGGAGTCGT CGTCGAGACG
GTCGAGGACT TCCTCCACCG AGGCGGCTAC GACCTGCTGC TGCACAATCT CGGCGGGCGG
GAGCGGACCC GACAGCGGGT GCTGCGTACC GCCGACCTAC ACAAGCGGGT CGACGGAATC
ATCCTGGCGG CCACCCCACT GCGGGCACCC GAGCTGGCCT TCCTGTCTGC GCTGGACCTG
CCCGGGGTCA TCGTCAGCTC CGGCACGAAC GTGCCCGGCT GGCCGTGCGT ACGCATCGAC
GACGTCGCCG CCGCGCGGAC CGCCACCCGC CACCTGCTCG ACCTCGGACA CCGGCGGGTC
GCGCACATCT CCGGCGACCC CGACGACGAA CTCGCGTTCA CCGCCCACCT GGACCGGCGG
CGCGGCTACC GGGAGGCGCT GCGCTCGGCG GGCATCCGAC CCGACCCGAG TCTCGACATC
GAATCCCGGT TCGACGTCGA CGGCGGTATC CGAGCCACCG AGGAGTTACT GCGTCGGGGC
GACCCTCCCA CCGCGATCTT CGCCGCCTGC GACGAGATGG CGATGGGGGC ACTGACCGCG
CTGCGGGACG CCGGGCTGCG GGTGCCGGAC GACGTGAGCG TGATCGGCAT CGACAACCAC
TACCTGGCGG GTGTGCTCGG ACTGACCACC GTCGCCCAGT CCCCGACCGA CCAGGGGCTG
ATCGCCGCGA AGACCCTGCT CGGTGCACTG ACCGGTCGGC CTGCCGACCC ACTTCGCGCC
GCGGACGGGC CGGTGGTCCT GCCCACCCGA CTGGTCGTCC GGGAAACGAC CGCGCCGCCA
CGAACGCCGG AGTCCGCCGC CCGATGA
 
Protein sequence
MTRIDDVARL AGVSTATVSR ALRGLPTVSA ATRHRVLAAA EQLQYTVSPN ASRLAGGRTG 
TVAVVVPRIT RWFFGVVVET VEDFLHRGGY DLLLHNLGGR ERTRQRVLRT ADLHKRVDGI
ILAATPLRAP ELAFLSALDL PGVIVSSGTN VPGWPCVRID DVAAARTATR HLLDLGHRRV
AHISGDPDDE LAFTAHLDRR RGYREALRSA GIRPDPSLDI ESRFDVDGGI RATEELLRRG
DPPTAIFAAC DEMAMGALTA LRDAGLRVPD DVSVIGIDNH YLAGVLGLTT VAQSPTDQGL
IAAKTLLGAL TGRPADPLRA ADGPVVLPTR LVVRETTAPP RTPESAAR