Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3343 |
Symbol | |
ID | 8666631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 3660225 |
End bp | 3661274 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | LacI family transcription regulator |
Protein accession | YP_003339025 |
Protein GI | 271964829 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.716413 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0750883 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTATA GCGGCGGCCC GCCGGGCAAC GGCGCGGCCC GGCTGACCGA CATCGCCGCG CAGGCGGGGG TGAGCGAGGC CACGGTCAGC CGGGTGCTCA ACGGCAAGCC GGGCGTCTCG GCCGTCACCC GGCAGGCCGT CCTGGCCGCG CTGGACGTCA TGGGCTACGA GCGGCCGCAG CGGCTGCGCC AGCGCAGCAA CGGGCTGATC GGCCTGGTCA CGCCGGAGCT GGACAACCCG ATCTTCCCGG CGTTCGCCCA GGCCTTCGAG AAGGCGCTGA CCCAGCACGG CTACACCCCG CTGCTGTGCA CCCAGCTCCC CGGCGGAGCG GTGGAGGACG AGTTCACCGA GCTGCTCGTG GAGCGCGGGG TCAGCGGCAT CATCTTCGTC TCCGGGCTGC ACGCCGACAT CACTGCGCGC TCCGACCGCT ACACCCAGCT CATCGGGCAG GGCGTGCCCA TCGTCCTGCT CAACGGGCAC GCCGGCGACG TCCCGGCGCC GTTCATCTCC CCGGACGACC GGGCCGCCGC GCGGCTGGCC GTACAGCATC TGGTGGATCT CGGGCACGAG CGGATCGGCC TGGCCGTCGG CCCCGGCCGG TTCGTGCCGG TGATCCGCAA GATCGAGGGT TACCGGCAGG CGATGGCGCA GTTGCTGGGG GCGGGCGAGG TGGATGAGCT GATCTCGCAT TCGCTGTTCT CGGTTGAGGG GGGTCAGGCG GCGGCGGCGC AGTTGCTGGA GCGGGGGTGC ACGGGCATCG TGTGCGCCTC GGATCTGATG GCGCTGGGGG CGATCCGGGC GTGCCGGGAT CGGGGGTTGT CGGTTCCGGC GGACGTGTCG GTGGTGGGGT TCGACGACTC GCCGCTGATC GCCTTCACCG ACCCGCCGCT GACCACCGTG CGCCAGCCCG TCCAGTCGAT GGTGACCGCC GCGGTGCACA CCCTGCTGGA GGCCGTCTCC GGCGCGCCCA TGCAGCACTC CGAGCTGATC TTCCAGCCGG AGTTCATCGT GCGGGGCTCG ACGGGCTCGG GCCCGAAGAT CCTCCGCTGA
|
Protein sequence | MPYSGGPPGN GAARLTDIAA QAGVSEATVS RVLNGKPGVS AVTRQAVLAA LDVMGYERPQ RLRQRSNGLI GLVTPELDNP IFPAFAQAFE KALTQHGYTP LLCTQLPGGA VEDEFTELLV ERGVSGIIFV SGLHADITAR SDRYTQLIGQ GVPIVLLNGH AGDVPAPFIS PDDRAAARLA VQHLVDLGHE RIGLAVGPGR FVPVIRKIEG YRQAMAQLLG AGEVDELISH SLFSVEGGQA AAAQLLERGC TGIVCASDLM ALGAIRACRD RGLSVPADVS VVGFDDSPLI AFTDPPLTTV RQPVQSMVTA AVHTLLEAVS GAPMQHSELI FQPEFIVRGS TGSGPKILR
|
| |