Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5783 |
Symbol | |
ID | 8669077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 6333915 |
End bp | 6336980 |
Gene Length | 3066 bp |
Protein Length | 1021 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003341272 |
Protein GI | 271967076 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0303431 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.017876 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGTTG ATCAGCCCAC ATTCAGCGTG CTCGGTGCCC TGGACGTCCG TGCCTCGGGA CGGCCCCTCC GCATCGCCGG CACGAAACCA CGAATCCTGC TGGCATCGCT GCTGCTCCAC GCGAACCACG TGGTCGGCGC GGACCTCCTC GTCGAGGTGC TGTGGCCCCG GCACCGGCCG CGCTCCGCCC ACGCCAACCT CCGCACCTAC GTCAGCTCGC TGCGCGGCGT GCTCGACGCG GCCGGCGCCC GGATCCAGGC CCGCCCGCCC GGCTACGTGA TCGAGCTGGC ACCCGGACAG CTCGACGCGC TGCTCTTCGC CGACCTGATC GCGCGGGCGC GCGCGGCGGG CCGTACGGAG GAGGCCTTCG ACGGGCTCAG CCGGGCGCTC GGGCTGTGGC GCGGCACCCC CCTCGCCGAC CTGCCGGGCA GCCCGCTCTG GGACGGGCGG CTGCAGTCGC TGGCGGAGCT GCGGCTCGGC GCGGCCGAGG AGCTGATCGC CCTGAGAATG GCCCGGGGCC GGTACGCCGA CGCGATCGGA GAGCTGCGCG GGCTGCTGAA GGAGCATCCC TTCCGCGAGG ACCTCTGGCA GCGGCTCATA CTCGCCCTGC ACTGGAGCGG CCGGCAGGCC GAGGCCCTGC ACGCCTACGC CACGGTCAGG CGGCAGCTGG TCACCGAGCT CGGCATCGAA CCCGGCACGG ATCTGCGCCG GGCGCACGCC GCCGTCCTGG CGGGGGAGCT CCCGCCCGCC GCGACCCCGC CCGATCTCCC CCCGCCCGCC GCCGAGACCG TCTCCGGCCC GCCGTCCCGC GCCGAGACCG TCGCCGGCCC GCCGCCCCCC GCCGAGACCG TCCCGCACCA CCTGCCGGCC GTCCCCGCCC CCGGCTCCAC TCCTCACCAG CTCCCGTCGG ACATCCCGGA CTTCACCGGC AGGTCTGAGG ACGTCGCCGT CCTCACGCGG GCGCTGTCAC CGGTGGAACG GCCGCCGGAC GGACCGCCGT CGATCGTGGT GGTGGTGGGC CCGCCGGGCG TGGGCAAGTC GGCGCTGGCC GTGCACTGCG CGCACGCTGT ACGGACCGGG TATCCGGGCG GGCAGCTCTA CCTGGACCTC GGCGGGACCG AGTACGCGCC GGCCGACCCG GGCGAGCTGC TGGCCGAGGC GCTGCGGGCG CTGGGGGTAG GCGAGGCCGG CCTGCCGTGC ACCGTGCGCG AACGCTCCGC GCTGTACCGG TCCCTGCTGG CCGAACGCCC GATGCTCGTC CTCCTCGACG ACGCCGCCGG CGCCGCCCAG GTACGGCCGC TGCTGCCCGG CAACGGCTGC GCGGTGCTCG TGACCAGCAG GCGGCGGATC ACCGAACTGC CCGGCGCCCT CCAGCTCGAA CTGGACGTCC TGTCACCGGA GGAGGCCGAG GAGCTCCTGG GCAGGATCGT CGGCTCCGAG CGGCTGGGAC GGGAGAGGGA GGCGGCCTCG GCGATCCTGC GCGCCTGCGG CTACCTGCCG CTCGCCGTCA GGGTCGCCGG AGCGCGGCTC GCGGGACGGC CCAGGTGGTC GCTGGGGGTG CTGCGGCAGC GGCTGGAGGA CGAGGCGGGC AGGCTCGGCG AGCTGCGGGC CGGCGATCTG GAGGTGCGGG GCAGCTTCGA CCGGAGCTAC CGGCTGCTGC CCGACGACGC GGCCCTGGCC TTCCGGGCAC TCGGTCTCCT GGGACCGCAG TCCCTGCCCG GCTGGGTGGT CGACGCCGTG CTGGACCGGC ACCGGGCCGA CGACGTGACG GACGTCCTGG TCGATGTGAA CCTGCTCCAG CTGGTCGGGA CCGACCCGAT CGGCCAGCCG CGCTACCGGC TGCACGACCT GGTCCGCTGC AACGCCAGGG AGAAGGCCGG CGGCGCCCCG GAGCGGCACG CCCTCATCAG GGTGCTCGGA GCGTGGATGG CCACCACCGA GAGTGCCACG GCACGGTTGC CGACCACGCT CTTCAGCCTG ACATCGGCCA GGGCGACCCG CTGGAACCTG GCGGAGGACA CCCTCGGACG GCTGACCGCC GACCCGCTGT CGTGGTTCGA CGTCGAGCAC GAGGCGCTCG TGGGCGCGGT GCGTCTGGCC GTCGACGCCG GGCTGGCCGA GCCGGCGTGG GGGCTCGCCG CGGCCCTCGT CCCCTACTTC GACCTGCGCT GCCACTTCGA GGAGTGGCAG TCCACCCACC GGATCGCCCT GGACGCCGCC CGCCTGGCGC AGGACCGCTA CGGTGAGGCG GCCATGCTCC GGGGTCTGGC CCAGGTATGC CTCTACCAGG ACCGCTACGC CGAGGCGACG GAGATGTTCC GGCGATCCCT CACGATCTTC CACGAGCTGG GCGACGTGCG GGGCGAGGCG ACCTCGATCT GCGGGCTGGG GGCGGTCAAC CAGTTCTGCG GCGAGCACCT CAGGGCACTG GCCTACTTCC GGCGGGCCCT GGCCATGTTC CTCGCCATGG GCGACCAGAG CGGTGAGGCC TACGCGCGGC AGGCCATCGG GCGCGTCTGC CTGGCCTCCG GCGATCCCGG CCAGGCCTCG AAGTGGCTGG GCGAGGCGCT GCGGCTGGCC AGGGAGCTCG GCGACTCCCA CCGCGAGGGC TGCGTGTCCA TGCAGTTCGG GCGGCTGCAC GATCTGGCGG CGGAGCCCGA GCGGGCGATG CGGTTCCAGG GACACGCGCT GGACATCTTC GAAGGCCTCG GCGACCTCCA CTGCGGGGCT TACGCCATGC AGAGTCTGGG CGGCCTCCAG GTGGTCCGCG GCGACCAGTC GCACGCCTCC GACCAGCTGG AGCGGTCACT GCTGATCTTC CAGCGGCTCG GCGACCGGAG CGGGGAGGCG GCCACGGTCC AGAAGCTCGG CGAGCTGCAC CGGTCGGCGG GCCGTACCCG CCTGGCGCAG GACTACCTCC ACCACGCCCT CGCGCTCCGG CGCGAGCTGC GGGGAGGCGC CGACCTCGCC GGCGTCGCCG AACCGTCCGC CCCCTCCGGC GACGGGTGGT GGAGCCGGGA CGCCGTGCCA CCACCGCTCG TCCTCGACGG CAACGCCGTG GAGTGA
|
Protein sequence | MVVDQPTFSV LGALDVRASG RPLRIAGTKP RILLASLLLH ANHVVGADLL VEVLWPRHRP RSAHANLRTY VSSLRGVLDA AGARIQARPP GYVIELAPGQ LDALLFADLI ARARAAGRTE EAFDGLSRAL GLWRGTPLAD LPGSPLWDGR LQSLAELRLG AAEELIALRM ARGRYADAIG ELRGLLKEHP FREDLWQRLI LALHWSGRQA EALHAYATVR RQLVTELGIE PGTDLRRAHA AVLAGELPPA ATPPDLPPPA AETVSGPPSR AETVAGPPPP AETVPHHLPA VPAPGSTPHQ LPSDIPDFTG RSEDVAVLTR ALSPVERPPD GPPSIVVVVG PPGVGKSALA VHCAHAVRTG YPGGQLYLDL GGTEYAPADP GELLAEALRA LGVGEAGLPC TVRERSALYR SLLAERPMLV LLDDAAGAAQ VRPLLPGNGC AVLVTSRRRI TELPGALQLE LDVLSPEEAE ELLGRIVGSE RLGREREAAS AILRACGYLP LAVRVAGARL AGRPRWSLGV LRQRLEDEAG RLGELRAGDL EVRGSFDRSY RLLPDDAALA FRALGLLGPQ SLPGWVVDAV LDRHRADDVT DVLVDVNLLQ LVGTDPIGQP RYRLHDLVRC NAREKAGGAP ERHALIRVLG AWMATTESAT ARLPTTLFSL TSARATRWNL AEDTLGRLTA DPLSWFDVEH EALVGAVRLA VDAGLAEPAW GLAAALVPYF DLRCHFEEWQ STHRIALDAA RLAQDRYGEA AMLRGLAQVC LYQDRYAEAT EMFRRSLTIF HELGDVRGEA TSICGLGAVN QFCGEHLRAL AYFRRALAMF LAMGDQSGEA YARQAIGRVC LASGDPGQAS KWLGEALRLA RELGDSHREG CVSMQFGRLH DLAAEPERAM RFQGHALDIF EGLGDLHCGA YAMQSLGGLQ VVRGDQSHAS DQLERSLLIF QRLGDRSGEA ATVQKLGELH RSAGRTRLAQ DYLHHALALR RELRGGADLA GVAEPSAPSG DGWWSRDAVP PPLVLDGNAV E
|
| |