Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3432 |
Symbol | |
ID | 8666720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 3775618 |
End bp | 3777945 |
Gene Length | 2328 bp |
Protein Length | 775 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003339112 |
Protein GI | 271964916 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00818273 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.368034 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACA TACGGTTCCG CCTGCTTGGT CCGCTCCGCG TGTGGAGGGG AGAGACCGAA GTCAAGATCG GCTCGGACAA GCAACGGGCC GTGCTGGCCC TGCTCCTGCT CCGGGCCGGC TCGCCGGTCA GACGTCAGGA GATCATCGAC ACGCTCTGGG GCGACGACAC CCCCGAGTCG GTGGTCAACC TGGTACAGAC CTACGTGGGA AGGCTGCGGC GCCAGATCGA TCCGGGCAAG GGCGCCTATT CGGCCTCGAC CTGGCTGGCG GGGATGGGCA CCGCCTACGT GGTCCGGCTC GACCGGTGCG ACGTGGACCT GGTCCGCTTC CGCGCGGGGG TGGCGGGGGC CCGCTCGGCC ACCTCCCCCG AGGAGTCCCT GGCGCTGCTG CTCTCCGCGC TGCGGATGTG GAACGGGCCA TGTCTGGCCG ACCTCGACCA CGTGCTGCGC GGTCACCCTT GGGTCCGCGC CATCGAGCAC GAGCGGATCG ACACGCTGCT GGACGCCGCG AAGACCGCGC AGCGGCTCGG CCGGTCGGCC GACGTCATCC CGCAACTGCG CGCGGTCGCC GCCGCCGAAC CGCTGAACGA GGCCGTCCAC ACCGCGCTCG TGCTCGCCCT GGCCGCGTCC GGGATGCAGG CCGAGGCACT GGCCGAGTAC GGGCTCATCC GCCTCAGGCT GGCCGAGGAG CTGGGCGTCG ACCCCGGATC CCAGCTGCGC GAGGCGTACT TCCAGGTACT GCGCCAGGAG ACCCGCTACG AGGGCGCCGG CCCCGCCGAG CCGCCCTGCC CGTCCCTGCT GCCCGCCGAC ATCGCCGACT TCACCGGCAG GGACAAGCTG GTCGAGCAGC TGAGCGGTCT GATCGCCGAC CGCAGGCCTG GACCGATCCC GGTGTCGACC ATCACCGGCA GGGCGGGCGT CGGCAAGTCG ACGCTGGCCG TCCACCTGGC GCACCGCATG ATCGGCGACT TCCCCGGCGG CCAGCTCTAC GCCGACCTTC GCGGCTCCGC CGAGCAGCCG GCCGATCCCT CCCGGGTGCT CACCCGGTTC CTGCGCTCGC TGGGCATCAG CGGCCAGGCG ATCCCCGAGG ACGCGGACGA GCGCGCCGAG CTGTACCGCA CGCAGCTCGC CGGCCGCCGT GTCCTCGTCG TGCTGGACGA CGCCGCCGAC CAGGCCCAGG TACGGCCGCT GCTGCCCGGA TCGCCCTCCT GCTCCGTCAT CGTGACGAGC CGGTCCCGGA TGGCCGGATG GCCCGGCGCG CACGCCGTCG ACCTGGACCT GCTGGAGCCT CACCACGCCG GCGACCTGCT CGCGGTGATC GTCGGCGCGG AGCGTGTCGC GCCCGAGCCC GAGGCCGCCA CCGAGCTCGT CCGGCTCTGC GGCCGGCTGC CGCTGGCCAT CCGGGGCGCC GCCACCCGGC TCGCCGCCCG CCCACACTGG ACGCTCGCCA GGATGGCCGG CCGGATGGCC GACGAACGGC ATGGCCTCGA CGAGCTCTCG GACGTGCGGG CCACCCTCGC GCTCGGCTAC CGCAGGCTCG ACGGGCCGGC CCAGCGGGCC CTGCGCCTGC TCGGGCTGCT GGACCTGCCG ACCTTCGCCC CGTGGCTGGT CGCGGGGGTG CTGGAGGCGT CCACGGAGTC GGCCGAGGAC CTCATCGACG CGCTCGCCGA CGCCTACTTC CTGGACACCG CAGGGGTCGA CGCCGTGGGC CAGCCCCGCT ACCGCTTCCA CGAGCTGGTG CGCCGCTACG CCCGGGAGCT GGCGCTCAGG GAGGAGAGCG AGGCCACCGT CAGCACGGTC GTCATCCGCG CCCTGGCCAT CCTGCTCGCG CTGGCCCAGG ACGCCGACGG CCGCCTGCCG TACACTGTCC GGGCGCCGCT CTACGGGCGG TCGCCGCGCT GGCCGCCGCC GGCGGCCGTC CGCGAGCCGC TGCTCGCCGA CCCGCTGGCC TGGTTCGACA GCGAGCGGTC CTGCCTGGTC GCCGCGGTCC TGCAGGCGTC CGACCTGGGC CACGACGAGC TGGCCTGGGA GCTGGCCGCC GCCACGCTGA ACGCCGCGAT CATCCGGACG CCGTGGGCCG AGATCGGGGC CACCCACCGC TCGGCGCTCC TGGTCTGCCG TGCCACCGGC AACAGGCGTG GCGAGGCCGT CATGCTGCGC GGCCTGGGCG AGCTGGACCA CCACCTGGGG CGGCGGCAGG AGTGCCTGGA CACCCTGAAC CGTGCGCGCG CCCTCTTCGC CGAGATCCGC GACGCCCCGG GCGAGGCCGA CACGGCCGCC CGGCTCGACG CGCTCCGGGC GAGGGCGGCG CCGGCCCCCG CGACCTGA
|
Protein sequence | MTDIRFRLLG PLRVWRGETE VKIGSDKQRA VLALLLLRAG SPVRRQEIID TLWGDDTPES VVNLVQTYVG RLRRQIDPGK GAYSASTWLA GMGTAYVVRL DRCDVDLVRF RAGVAGARSA TSPEESLALL LSALRMWNGP CLADLDHVLR GHPWVRAIEH ERIDTLLDAA KTAQRLGRSA DVIPQLRAVA AAEPLNEAVH TALVLALAAS GMQAEALAEY GLIRLRLAEE LGVDPGSQLR EAYFQVLRQE TRYEGAGPAE PPCPSLLPAD IADFTGRDKL VEQLSGLIAD RRPGPIPVST ITGRAGVGKS TLAVHLAHRM IGDFPGGQLY ADLRGSAEQP ADPSRVLTRF LRSLGISGQA IPEDADERAE LYRTQLAGRR VLVVLDDAAD QAQVRPLLPG SPSCSVIVTS RSRMAGWPGA HAVDLDLLEP HHAGDLLAVI VGAERVAPEP EAATELVRLC GRLPLAIRGA ATRLAARPHW TLARMAGRMA DERHGLDELS DVRATLALGY RRLDGPAQRA LRLLGLLDLP TFAPWLVAGV LEASTESAED LIDALADAYF LDTAGVDAVG QPRYRFHELV RRYARELALR EESEATVSTV VIRALAILLA LAQDADGRLP YTVRAPLYGR SPRWPPPAAV REPLLADPLA WFDSERSCLV AAVLQASDLG HDELAWELAA ATLNAAIIRT PWAEIGATHR SALLVCRATG NRRGEAVMLR GLGELDHHLG RRQECLDTLN RARALFAEIR DAPGEADTAA RLDALRARAA PAPAT
|
| |