Gene Sare_5020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_5020 
Symbol 
ID5705157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5689942 
End bp5691186 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content68% 
IMG OID641274413 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_001539754 
Protein GI159040501 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.417569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000783217 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAGCGA CCACCCCTGC CGGTTCCCCG CCACCGCACC TGCCGCACAG CACCGGGCGG 
GCGCGAGTGT CCAGCTCCGG CTCCACCGGA CGCGCCCGCC CTGCCGAGCC AGGCTGGTAC
CCCTCGCCCA CCGGACCGGC CGGGGGTGGC CCGGGTGGCC CAGGCGGTCC GGGTGGCCCA
GGCGGTCCGG GTGGCCCAGG CGGTCCGGGC GGCCGTCCCG GTCCCGGCCG ACCGGAGCGG
CGTGGCCCGC GTCCCCGCTG GGGCCGGATC GCCCTGGTGG CCGGGGTCGC CGTGCTGGTG
TTCGCGCTGA TCGCGGGTGC CGGTGCCTGG GTCTACGCCC GTGGCCTCGA CAATGATCTC
GCTCGGACCA ATCCGTTCCC GGAACTGGCC GATGATCGGC CCGTCAAGGC GGTCGACGGC
GCGCTGAACA TCCTCCTCGT CGGCACGGAC TCGCGGGATC CGGACGCCTC GATGGACGAA
CGCGGCAAGT GGCGCGCGGA CACGATCATC GTGATGCACA TCCCCAGCGA TCACCAGAAG
GCATATCTGG TGTCGATTCC CCGCGACCTG TACGTGCCGA TTCCGGAAAG CGCGAGCGCC
GACTGCGGCT CGGGGCAACG GAAGAAGATC AACGCTGCTT TCGCATTCGG TGGACTGCCG
CTGGCGGTCC GCACCGTGGA ATGCTTCACC GACGTCCGGC TCGACCACGT CATGGCGATC
GACTTCGGCG GGTTTCAGGA GGTCACAGAC GCGCTCGGTG GCGTCGACCT CACGGTGGAA
AGGACGATCA CCTCGATCCA CAAGCCCTAC CGGACGTTCA CCGAGGGCAT CAACCACATG
GACGGCGCCG AGGCGCTGGA CTGGATCCGG CAGCGCAAGC AGTTCCCCCG GGGGGACTTC
GACCGGATGC GGCACCAGCA GGAGTTCCTC CGCGCGCTGA TGAACAAGGC GGCCAGCACC
GGAACGCTTA CCAACCCGAT CAAACTGAAC GACTTCCTCA AGGCCGTGAC CGCCGCCGTC
ACCGTTGACG AGGAATTCTC CTTGATCGAC ATGGCTCGCG AGTTTCGCAA TCTGCGCGGG
GAGAACCTGA CTTTCGTGAC CAGCCCGCAC AACGGCAGCC AGACCATCAA CGGCGAATCG
GTCGTGGTCT CCGACCGAGA ACGGGCGCTC GCCATGTACC AGGCCATTTC CCGGGACACC
ATGGCCACCT GGGTCGAGGC GAACAAGAGC AGCGACGACA ACTGA
 
Protein sequence
MSATTPAGSP PPHLPHSTGR ARVSSSGSTG RARPAEPGWY PSPTGPAGGG PGGPGGPGGP 
GGPGGPGGPG GRPGPGRPER RGPRPRWGRI ALVAGVAVLV FALIAGAGAW VYARGLDNDL
ARTNPFPELA DDRPVKAVDG ALNILLVGTD SRDPDASMDE RGKWRADTII VMHIPSDHQK
AYLVSIPRDL YVPIPESASA DCGSGQRKKI NAAFAFGGLP LAVRTVECFT DVRLDHVMAI
DFGGFQEVTD ALGGVDLTVE RTITSIHKPY RTFTEGINHM DGAEALDWIR QRKQFPRGDF
DRMRHQQEFL RALMNKAAST GTLTNPIKLN DFLKAVTAAV TVDEEFSLID MAREFRNLRG
ENLTFVTSPH NGSQTINGES VVVSDRERAL AMYQAISRDT MATWVEANKS SDDN