Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_5020 |
Symbol | |
ID | 5705157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5689942 |
End bp | 5691186 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641274413 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_001539754 |
Protein GI | 159040501 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.417569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000783217 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAGCGA CCACCCCTGC CGGTTCCCCG CCACCGCACC TGCCGCACAG CACCGGGCGG GCGCGAGTGT CCAGCTCCGG CTCCACCGGA CGCGCCCGCC CTGCCGAGCC AGGCTGGTAC CCCTCGCCCA CCGGACCGGC CGGGGGTGGC CCGGGTGGCC CAGGCGGTCC GGGTGGCCCA GGCGGTCCGG GTGGCCCAGG CGGTCCGGGC GGCCGTCCCG GTCCCGGCCG ACCGGAGCGG CGTGGCCCGC GTCCCCGCTG GGGCCGGATC GCCCTGGTGG CCGGGGTCGC CGTGCTGGTG TTCGCGCTGA TCGCGGGTGC CGGTGCCTGG GTCTACGCCC GTGGCCTCGA CAATGATCTC GCTCGGACCA ATCCGTTCCC GGAACTGGCC GATGATCGGC CCGTCAAGGC GGTCGACGGC GCGCTGAACA TCCTCCTCGT CGGCACGGAC TCGCGGGATC CGGACGCCTC GATGGACGAA CGCGGCAAGT GGCGCGCGGA CACGATCATC GTGATGCACA TCCCCAGCGA TCACCAGAAG GCATATCTGG TGTCGATTCC CCGCGACCTG TACGTGCCGA TTCCGGAAAG CGCGAGCGCC GACTGCGGCT CGGGGCAACG GAAGAAGATC AACGCTGCTT TCGCATTCGG TGGACTGCCG CTGGCGGTCC GCACCGTGGA ATGCTTCACC GACGTCCGGC TCGACCACGT CATGGCGATC GACTTCGGCG GGTTTCAGGA GGTCACAGAC GCGCTCGGTG GCGTCGACCT CACGGTGGAA AGGACGATCA CCTCGATCCA CAAGCCCTAC CGGACGTTCA CCGAGGGCAT CAACCACATG GACGGCGCCG AGGCGCTGGA CTGGATCCGG CAGCGCAAGC AGTTCCCCCG GGGGGACTTC GACCGGATGC GGCACCAGCA GGAGTTCCTC CGCGCGCTGA TGAACAAGGC GGCCAGCACC GGAACGCTTA CCAACCCGAT CAAACTGAAC GACTTCCTCA AGGCCGTGAC CGCCGCCGTC ACCGTTGACG AGGAATTCTC CTTGATCGAC ATGGCTCGCG AGTTTCGCAA TCTGCGCGGG GAGAACCTGA CTTTCGTGAC CAGCCCGCAC AACGGCAGCC AGACCATCAA CGGCGAATCG GTCGTGGTCT CCGACCGAGA ACGGGCGCTC GCCATGTACC AGGCCATTTC CCGGGACACC ATGGCCACCT GGGTCGAGGC GAACAAGAGC AGCGACGACA ACTGA
|
Protein sequence | MSATTPAGSP PPHLPHSTGR ARVSSSGSTG RARPAEPGWY PSPTGPAGGG PGGPGGPGGP GGPGGPGGPG GRPGPGRPER RGPRPRWGRI ALVAGVAVLV FALIAGAGAW VYARGLDNDL ARTNPFPELA DDRPVKAVDG ALNILLVGTD SRDPDASMDE RGKWRADTII VMHIPSDHQK AYLVSIPRDL YVPIPESASA DCGSGQRKKI NAAFAFGGLP LAVRTVECFT DVRLDHVMAI DFGGFQEVTD ALGGVDLTVE RTITSIHKPY RTFTEGINHM DGAEALDWIR QRKQFPRGDF DRMRHQQEFL RALMNKAAST GTLTNPIKLN DFLKAVTAAV TVDEEFSLID MAREFRNLRG ENLTFVTSPH NGSQTINGES VVVSDRERAL AMYQAISRDT MATWVEANKS SDDN
|
| |