Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_9294 |
Symbol | |
ID | 8672642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 10246809 |
End bp | 10249820 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003344655 |
Protein GI | 271970459 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.705577 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTTC ACGGCAGCGA AGACGGGTTG CGCTTCGCCG TGCTCGGCCC TGTACAGGCA TGGCGTGACG GCACCGAACT CGATCTGGGC ACCCCCCTGC AACGTTCCAT TCTGGCGATG CTGCTCCTGC GCGAGAGCCG GGCGGTCACC CCGGCCGAGA TGATCGACGC GGTCTGGGGG GAGGACGCCC CGCCGCGCGC GCTCGGCGCG CTGCGCACCT ACGTCTCCCG GCTGCGGGCC GTGCTGGAGC CCGGCAGATC CCCTCGCACC CGCCCCGAGC TGCTCATCTC GGTGGGCCGG GGCTACGCGC TGCGGCTCGG CGGCGTCCTG GACCTGACCC TGTTCGATCG GGGCGTCCAG GAGGCCGAGG CCGCCCGACG CGCCGGAGAC CGCGCCGGGG CCGCCGAGAG CCTGCGCGCG AGCCTGGCGC TGTGCACCGG TGAGCCGCTG GCCGGGGCCG TCGGCCCCTA TGCCGAGCAT CAGCGCGACC GGCTCGTCGA GCGCCGGATG AGCGTGCTGG AGACCCTGAT GGACCTGGAC CTGGAGCTGG GCCGGCACGC CGACGTCGTC TCCGAGCTGA TCGCGCTGAC CGCCGACCAC CCGCTCCGCG AGCGGCTCCG CGCCCAGCTG ATGCTGGCCT ACTACCGGTG CGGGCGGCAG GCGGACGCGC TCGCGACCTT CGCCGACACC CGTAACGCGC TGATCGAGGA GCTCGGCATC GAGCCGGGAC CCGACCTGGC CGCCCTGCAC CAGCGCATCC TCACCGGCGA TCCGAGCCTC GCCCCCGCCT CGCCCGCCGC CGCCCCGGTA CGGCTCGCGC CCGCCGACCC CCCGGAGCCC GCCCGGCAGC CGGAGGCCCC CGAGCCCGGC GCTCCCGAGC TGCCGCGCCC GGCGCAGCTC CCCGCGGCGG TGAACGACTT CACGGGCCGC CGCCAGATCA TCGCCCGGCT GTGCACCCTG CTGTCCACCC AGGGCAGCGG CGACGGCGTG CCGGTGGCCG CCATCTCCGG CATCGGGGGC GTCGGCAAGA CGACCCTGGC GGTGCACGTC GCCCACGCCC TGCACGACCT GTTCCCCGAC GGCCAGCTCT ACGCCGACCT GCGCGGATAC GGCGAGGAGC CGGTCGCGCC CGAGTCCGTG CTGGCGGCGT TCCTGCGCGC CCTCGGCCTG CCCGCCGACA TCATCCCGGA CGGCCTGGCC GAGCGGTCGG CACTGTTCCG CTCGCTCCTG ACCGACCGGC GGATGCTGGT GCTGCTCGAC AACGCCAGGG ACGCCGCCCA GGTCAGCCAC CTGCTGCCGG GGTCGACCGG CTGTGCGGCG ATCGTGACCA GCCGCGGCAA GCTCGCCGAC CTGGCCGCGG CCCGGCTGGT CGACCTGGAC GTCATGGAGC CGGAGGAGGC GCTGACCCTG TTCGGCACGG TCGCCGGCGC CGAACGGGTG GCCGCGGAGC GTGCCGCCGC CATGGACGTC GTCGCAGCCT GCGGCTTCCT GCCGCTCGCG GTGCGGATCG TGGCGGCGCG CCTGGCCGCC CGGGCCTCCT GGACGGTCGC CTCGCTGGTG CCCCGGCTGG CCGACGAGCG CCGCCGCCTC GACGAGATGC GCGTGGGCAA CCTCGCGGTG GAGGCGACCT TCGCCCTCGG TTACGGACAG CTCAGCCCGG CGCAGGCGCG GGCGTTCCGG CTGCTCTCCC TGCCGGGCGG CCCGGACATC TCGGCCGGGG CCGCCTCCGC GCTGCTCGCG CTGAGCCCGA TGGACACCGA GGACATCCTG GAGTCCCTGG TGGACGCCAG CCTGCTGGAG GCTCCCGCCC CCGGCCGCTA CCGCTTCCAT GACCTGCTCA AGCTCTTCGC CCGCCGTACC GGCGAACGGG CCGAGGGGGG CGCGGGGCGG GGCGGGGAGG GGGCGCCCGC GCTGCGCCGG CTGCTCGACT TCTACCTGGC CTCCGCGCGG TCGGCGCACC GGCTCGCCTA CGAGGGCAGC ACGGTCGCCG ACCAGCTCGC GGCGGCCGGG CCGGGTCACG CGTTCGGCTC CGCCGACGAG GCCGTGGCCT GGCTGTCGGT CGAGGCGGAG GCGCTGTTCG CGTCGATCGC CCAGGCGAGC GGCGCCGAGG AGGCCGGGGC CCTGCTGCCC GGCGCCGACC TGCTGCTCGC CATGGAGCCG CTGCTGGAGT CGGGCAGCCA CGCCCGGGAG TTCGACCAGC GGGCCAAGGA GGTGCTGGCC GCCGCCCGGC GGCTCGGCGG CACCTCCAGC GAGCTGCGCT GCCGCTACGT GCTGGGCAGG GTGCTGTTCA ACGCCAACCG GCTGGCCGAG GCGGAGAGCG AGTTCCGGGC GTCGCTGGAC CTGGCCGCCG GGGGCGACCG GATCGTCACC GGCGAGGCGA TGAACGCGCT GGCCGTCGTG GCCGGGCGTC AGCGCCGCCA CGTGGAGGCG CTGGCCTGGT TCGACTCGGC GCGCAGGGTG TTCAGGGAGG TCGGGGCGCG GGGCGGCGAG GCGCTGACGC TCAGCTACTC CGCCCGTGAC CACCTGTTCC TGGGCCAGCA GGAGGAGGCG ATCGCCGCCG CGGAGCAGGG CCTGGCCGTC TTCATCGAGC TCGGCTTCAG CGCCGGCACC GCCAGGGCCC GCTACCACCT GGGGATGATC CTGTCGCGGG TCGGGCGGCT CAACGAGGCG GTCCACCATC ACGCCGAGTG CCTGGCCTTC TTCCGGGCCA GCAAGCAGCG GGTGTGGGAG CAGCGGGTCT GCTCCCGGCT GGCCGAGACG TTCATCACCG CCGGCCGGTT CTCCGACGCG ACCCGCCACG CGGAGCAGGC CCTGACGGTC AGCCGCGAGA TCGGTCATCC GTACGGCGAG GCGCTGTCCC TGTGGGTGCT AGGCAGGGCG CTCGCCGGGC TCGGCAGCAC GGGCCGCAGC CACGACTGCC TCGAACGCGC CCATGACATC TTCGTCAGGC TCGGCGCCCC CGAAGCCGCG GACCTGCGCG CCCTGCTCGA TCGTGATCAC GTCGGCAACT GA
|
Protein sequence | MAVHGSEDGL RFAVLGPVQA WRDGTELDLG TPLQRSILAM LLLRESRAVT PAEMIDAVWG EDAPPRALGA LRTYVSRLRA VLEPGRSPRT RPELLISVGR GYALRLGGVL DLTLFDRGVQ EAEAARRAGD RAGAAESLRA SLALCTGEPL AGAVGPYAEH QRDRLVERRM SVLETLMDLD LELGRHADVV SELIALTADH PLRERLRAQL MLAYYRCGRQ ADALATFADT RNALIEELGI EPGPDLAALH QRILTGDPSL APASPAAAPV RLAPADPPEP ARQPEAPEPG APELPRPAQL PAAVNDFTGR RQIIARLCTL LSTQGSGDGV PVAAISGIGG VGKTTLAVHV AHALHDLFPD GQLYADLRGY GEEPVAPESV LAAFLRALGL PADIIPDGLA ERSALFRSLL TDRRMLVLLD NARDAAQVSH LLPGSTGCAA IVTSRGKLAD LAAARLVDLD VMEPEEALTL FGTVAGAERV AAERAAAMDV VAACGFLPLA VRIVAARLAA RASWTVASLV PRLADERRRL DEMRVGNLAV EATFALGYGQ LSPAQARAFR LLSLPGGPDI SAGAASALLA LSPMDTEDIL ESLVDASLLE APAPGRYRFH DLLKLFARRT GERAEGGAGR GGEGAPALRR LLDFYLASAR SAHRLAYEGS TVADQLAAAG PGHAFGSADE AVAWLSVEAE ALFASIAQAS GAEEAGALLP GADLLLAMEP LLESGSHARE FDQRAKEVLA AARRLGGTSS ELRCRYVLGR VLFNANRLAE AESEFRASLD LAAGGDRIVT GEAMNALAVV AGRQRRHVEA LAWFDSARRV FREVGARGGE ALTLSYSARD HLFLGQQEEA IAAAEQGLAV FIELGFSAGT ARARYHLGMI LSRVGRLNEA VHHHAECLAF FRASKQRVWE QRVCSRLAET FITAGRFSDA TRHAEQALTV SREIGHPYGE ALSLWVLGRA LAGLGSTGRS HDCLERAHDI FVRLGAPEAA DLRALLDRDH VGN
|
| |