Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4296 |
Symbol | |
ID | 8667590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4796436 |
End bp | 4799426 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003339928 |
Protein GI | 271965732 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.300157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGCTCG GAGGCATGCG CCAGCAACGT GTCATGGCCA CGCTACTGCT GGAATCGAAC AGGATCATTC CCGTCTCCCG CCTGGTCCAG GCGACCTGGG GGGAGGCGGC CCCCGCTACA GCGCACCATC AGATCCGGAA AATGGTCGCC GATCTACGCA ATCGCCTGGG CGGTGGCGGG CAGAGCATCG TGACGGACGG CCCCGGCTAT CGCCTGGTGG TCGAAGAGGA CCAGTTGGAT CTGAGCCTTT TCGATATCCG TACACGGCGA GCCGGCGAGG CCGAGAAGTC CGGACTACCC GTCAATGCGA TGACGGAGCT GCAGGAAGCG CTCGACCTGT GGACGGGGCA GGCGCTGAGC GGAATCGACG GCCCGGTGAT CCGGGCGGCT GCCGAGTTGC TCGACGAACG CCGGGTCGCT GTGACGGAGC GGCTGATGGA TCTCCGTCTT ACCCATGGTC ACGCGCGTGA TCTGATTTCC GACCTGCGTG CCCTGGTGTC CGAGCATCCA TTACGGGAAA CGCTGTGGAC CCAGCTGATA TTGGCCCTGT ACCGGTCGAA CCGTCATGCT GAGGCTCTGC GGACCTACGA AGACGTCCGC TGCCTCCTGG CCGACGAGCT CGGGGTCGAT CCGGGGATCG AGTTGATGAA GTTGCATGAG CTGATCCTCA GAAACGATCC GACGCTGGAT TGCCCGTCCC GCCGGACGAA GAGCGGCATC GGCGTTCCCG TGATGGTCAG ACCTTGCGCC CTGCCGTACG ACGTTCCCGA CTTCACAGGT AGGGAGGCCG AGCTCCAAAG GCTGACGAGT ATGGTGGCCG AGAACGCCGA TCGGACGGCA CTGATCGTCA CCATCGACGG GATGGCGGGT ATCGGCAAGA CCGCATTCGC GGTCCACGCC GCCCACAAGC TCGCTCGGCT CTTCCCCGAC GGCCAGCTCT TCGTGGACTT CAACGGCTTC ACACCGGGTC GGAAACCCAT CCCGGTGGCC GAGGCGATCG CCACCCTGCT CTCCACCCTG GGAGTTCCGG ACGGCGAGAT ACCGCATGAC CTGCAAGGCA GGATCGCGAT GTGGCGCATG AGGACGGCCG GCCGGCGCCT GTTACTCCTG CTCGACAACA CGGCGGACGC GGCTCAGGTG CTTCCGCTCC TGCCCGGCAT GCCGGGGTGC GTGACGCTGA TCACCAGCCG TGCGCCCCTG TCCGGAGTGG ATGGCGCGGT TCTCCAGTCT CTGGAGCTTC TCACCCCGGA CGAGAGCCGT GCGCTCCTGG AGCGCGTAGT CGGGACGATG CGGCTGGCCA CCGAGAGAGA GACCGTAACG ACGCTGATAG AGATGTGCGG GCGCCTGCCC CTTGCCCTGC GTATCGTCGC CGCCCGGCTC AACAACCGTC CACAGTGGAG TGTCGCCCAC ATGGTCGACC GCCTGGGGAA CGAACGGCGG CGGCTGAGCG AGCTGGTTGT GGGGGACCGG AGCGTCCACG CGGCGATCGC GCATTCCTAC GGTTCGCTGA GGCCGGATCA GCAACGGCTC TTCCGCCTGT TGGGGCTGCA CCCCGGGCAT GACTACGACG CCTATGCCGC TGCGGCGCTG GCGGGGATAC CTGTCGACGA GGCGGAGTCG TTACTCGAGG GTCTACTCGA CTCCAGGCTT CTGGTCCAAC GCGAGGTCGG TCGCTACAAC TACCACAACC TGGTCCTGAG TTATGCCCGC GACGCCGCAT CCTCCCACGA AACCGAGCAC ACCGGCACGC GTGCGATCCA TCGCCTCCTC GACTACTACC TGCATACGGC GGAGGCGGCC GCGAACCTCC TGGATCTCGG CCGCCGGCAG GTAGCGCTCT GCCTGAGCTG CCCTCCGTCG CATGTTCCGC CCTTGGCCGA CGGGAAAGCA TCCCTGCGGT GGTTCGACAC CGAGCGCCAC AACCTCCGCA TGGCGGTGGA AGAGGCCCAG CGCCAGGGCC TGCATGGTCA CTGCTACCAC CTGCCTCATA TGATGGCGCA CTACCTCCAG CTCAGGGGGT GTTCCGACGA TCAGCTGACT CTCCTGAAGA CGGCTTTCGA CGCCGCCGCC CAGCTCAATG ATCACCATGC CCAGGGGCGC ACTCTGCTCA ACCTCGCCGT GTCGGACTGG TTCTACGGCA GGTTCAGCGA GGCTCTCGAC AGGGCTGTTC AGGCCCTGGC CGTGGCCGAG AAGCTGGGAG ATGCGGATTT CCAGGGTGCC TGCCTGAGCC GGATCGGTAT CTTCCACGCG GCCTTGGGAC GCTACGAGGA GGCCCTGCAC CACTACAAAT GCGCGCTGGA CATCCACGAA GGTCATGGGA ACTGGAGCGA GGTGAGCATG ACGCTGGTGA GCACGAGCTC GACCAAGGCA ACGCTCGGGC TCTTCGATGA CGCGCATCGT GACGCCTGTC GCGCGGTCTC CATCGATCGG CGTTCCGGCT ACCGGAACGG AGAAGTCATG GGGCTGCTCG CCATGTCAAC CGCCCAAGCG GGTATGAACG ATCTCGAAGG AGCGGCCTCG TCCCTGAACA CCGCGTACGA GCTCGTGCAC ACGGACGAGA TGCCCGGTTA CGAGGCCGCC GTTCTCGTGC AGCAGGGCCA CCTGCATCGA CGCCTCGGGC AACTGGACGA GGCTACGGCG GCCGGCCGGC GCGCCATGGA AATCCTCGGC ACGACCCGGC GCTCCGTCAC CGCCATTCGC GGGCAGAACC TCCTCGGTTC GATCGACTGT GATCGTGGTG ACTACATCCT CGCTTTCGAA CGGCACAGGT ACGCCCAGCG GCTCGCTCTG CGCTCGGGCC ATCGCCTCGA GACCGCGCGT GCGCTCGACG GGATCGCTCG GGCCTTGGCC GGGCTGGGCC AGTGTGAGGC GGCGAGGACG GTGTGGCGAG AAGCCCTCGA TCACTTCGAG GAGATGGGGA CGCACGAGGT GTTCGCGGTG CGCAGGAGGC TGGCCCGACA GGTGAGCAGC GTACCCGCGC CGCATCCGTA G
|
Protein sequence | MPLGGMRQQR VMATLLLESN RIIPVSRLVQ ATWGEAAPAT AHHQIRKMVA DLRNRLGGGG QSIVTDGPGY RLVVEEDQLD LSLFDIRTRR AGEAEKSGLP VNAMTELQEA LDLWTGQALS GIDGPVIRAA AELLDERRVA VTERLMDLRL THGHARDLIS DLRALVSEHP LRETLWTQLI LALYRSNRHA EALRTYEDVR CLLADELGVD PGIELMKLHE LILRNDPTLD CPSRRTKSGI GVPVMVRPCA LPYDVPDFTG REAELQRLTS MVAENADRTA LIVTIDGMAG IGKTAFAVHA AHKLARLFPD GQLFVDFNGF TPGRKPIPVA EAIATLLSTL GVPDGEIPHD LQGRIAMWRM RTAGRRLLLL LDNTADAAQV LPLLPGMPGC VTLITSRAPL SGVDGAVLQS LELLTPDESR ALLERVVGTM RLATERETVT TLIEMCGRLP LALRIVAARL NNRPQWSVAH MVDRLGNERR RLSELVVGDR SVHAAIAHSY GSLRPDQQRL FRLLGLHPGH DYDAYAAAAL AGIPVDEAES LLEGLLDSRL LVQREVGRYN YHNLVLSYAR DAASSHETEH TGTRAIHRLL DYYLHTAEAA ANLLDLGRRQ VALCLSCPPS HVPPLADGKA SLRWFDTERH NLRMAVEEAQ RQGLHGHCYH LPHMMAHYLQ LRGCSDDQLT LLKTAFDAAA QLNDHHAQGR TLLNLAVSDW FYGRFSEALD RAVQALAVAE KLGDADFQGA CLSRIGIFHA ALGRYEEALH HYKCALDIHE GHGNWSEVSM TLVSTSSTKA TLGLFDDAHR DACRAVSIDR RSGYRNGEVM GLLAMSTAQA GMNDLEGAAS SLNTAYELVH TDEMPGYEAA VLVQQGHLHR RLGQLDEATA AGRRAMEILG TTRRSVTAIR GQNLLGSIDC DRGDYILAFE RHRYAQRLAL RSGHRLETAR ALDGIARALA GLGQCEAART VWREALDHFE EMGTHEVFAV RRRLARQVSS VPAPHP
|
| |