Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3391 |
Symbol | |
ID | 8666679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 3727790 |
End bp | 3730705 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003339071 |
Protein GI | 271964875 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.436064 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGTTG ATCAGCCCGT ATTCAGCGTT CTCGGTTCTC TGCGGGTCCG GATGCCGGAA GGGACGCTTC CCGTCGCGGG CACGAAGCCG CGGGTCCTGC TGGCCTCGCT GTTGCTCAAC GCGAACCAGG TGGTCGGTTC GGACCTGCTC GTAGAGACGC TGTGGCCCCA GCGCCGTCCG CGGTCCGCCC TTGCGAACCT CCGCACCTAC GTCAGCTTCC TGCGCGGCAC CCTCGGCGCG GCAGGCGCGC AGATCCTGGC CAGGCCGTCC GGCTACGCGG TCGAGCTGCG GATCGACCAG CTCGACGCGC TGCTGTTCGA GGACCTGGTC GCCAGGGCGC GGGCGGCGGG CCGTGACGAG GAGGCCATCG AGTGCCTACG CCGGGCGCTC GCGCTCTGGC ACGGCACGCC GCTGGGCGAC CTGCCGGCCA GCCCGCAGTG GGACGGGCGG CTGCGGTCGC TCACAGAGGC GCGCCTCGGC GCCGCCGAGG ACCTGGCCGC CATGAGAATG GAGAGAGGGG AGTACCCGGC CGCGATCGGT GACCTGCGGG AACTCGTCAA GGTCCATCCC TTCAGGGAGG ACCTCTGGCG GCAGCTGATG CTCGCCCTGC ACGGGAGCGG CCGGCAGGCC GAAGCGCTGC AGGCCTACGC CACCGTCAGG CAGCAGCTGG TCGACGAGCT CGGCATCGAA CCGGGGCCGG ACCTGCGCGC GGTGCACGCG GCCGTCCTCG CGGGAGAGCT CCTGCCCGCC GTGCCGGCCG CATCCGCTCT CCAGCGGCCC CCGGCGCCCT CCGCGCGAAG CGTGATCGCG CCCCAGCAGC TCCCGCCGGA CATTCCGGAC TTCACCGGCA GAACGGGCGC CGTCGCCGAC CTGGCGCGAG CCCTGTCGGC CAAGGGGCGG CCGTCGGACG AGCCTCCGTC GATCGCGGTG GTCGTGGGGC CGCCGGGCGT GGGCAAGTCG GCGCTCGCCG TGCACTGCGC GAACGCCGTA CGGGCCGACT ACCCCGCCGG CCAGCTCTAC CTCGGCCTCG GCGGTACGGC CGCCGCCCCT GCCGACCTCG GCGAGCTCCT GGCCGAGGCG TTGCGGGCGC TGGGGGCCGG TGAGGCGGAC CTGCCGCCCA CGGTGCACGA ACGCTCCGCC CTGTACCGCT CCCTGCTGGC GGAACGTCCC ATGCTCGTCC TGCTCGACGA CGCCGCCGAC GCGGCGCAGG TGCGGGCCCT GCTTCCCGGC AACGGTTGCG CGGTGCTCGT GACGAGCCGG CGGCGGATCA CGGAGCTGCC CAGCTCGCTC CGGCTGGACC TGGGCGTCAT GTCGCCCCCT GAGGCCGAGG AGTTCCTGGG GAAGATCGTG GGTGCCGAAC GGCTGTCAGA GGAGAGGGAG GACGCCTCGG CGATCCTCCG CTCCTGCGGA TACCTGCCGC TCGCTGTCAG GATCGCAGGA GCCAGGCTCG CCGGCCGGCC GGGCTGGCCG CTGAGCGTGC TGCGACAGCG GCTGGACGAC GAGTCGAACC GGCTCGACGA GCTGCGAGCG GGCGACCTGG AAGTACGGGA CTCCTTCGAC CGCAGCTACC GGCAGCTGCC CGACGAGGTG GCCAGGACCT ACCGGACGCT GGGCCTTCTC GGCCCGCAGT CCATGCCGGG CTGGGTGGTC GACGCCGTCC TGGACCGCAC CCGGGCCGAG ACGGTGATGG ACACTCTCGT GGACGTGAAC CTCGTGCAAC CGGCCGGGAC GGACGCGATC GGCCAGCGCC GTTACCGGTT GCACGACCTG GCCCGCTGCA ACGCCAGGGA GAAGGCCGGC GGCGAGCGTC ACACCCTCGT CAGGGTGCTC GGAACATGGA TGACCGCCAT CGAGCAGGCC GCGTCGCGGC TGCCGACCAC GCTCTTCAGC CTGACGTCCG CGGCGGCACC CCGGTGGGAC CCGGCGGAAG AGACCCTCAG GCGCCTGACC GCCGACCCGC TGCCGTGGTT CGACGCCGAG CGGGAGTCAC TGGTGGCGGC GGTGCGGCTG GCCGCCGACG CGGGACTGTC GCAGGCCTCG TGGGGACTCG CGGCGGCGCT CGTCCCCTAC TTCGACCTCA ACTGCCGGTT CGACGAGTGG CGGCACACGC ATCAGGTCGC GCTGGACTCC GCGCGCATGG CCGAGGACCT CAACGGCGAG GCCGCCATGC TCCGCGGCCT GGCTCAGGTC TGCCTCTACC AGGATCGATA CGCCGAGGCG CGAGAGATGC TCCGGCGATC TCGCGCGATC TTCCACGAGC TGGGCGACCT ACGCGGCGAG GCGATCTCGA TCTGCGGGCT GGGAGCGGCC AGCCAGTTCT CCGGTGAACA TCTCACGGCG CTCGGATACT TCCGGCAGGC CCTGGCCATG TTCCTCGCCA TGGACGACAG AAGCGGTGAG GCCTACGCCC GGCAGGCGAT CGGGCGTGTG TACCTGACGC TGCGCGACTT TCGCCGGGCC TCGGGATGGC TCGGGGAGGC GTTGCGGCTG GCCGAGGAGC TCGGCGACGC CCATCGTGAA GGGGGCGTGT CCATGCAGCT CGGGCGGCTG TACGACCTGG TGGCCCAGTC CGACGAGGCG ATGCGCGTCC AGGGGCGCGC GCTCGACATC TTCGAGACGC TCGGCGATCG TCACTGCGGC GCCTACGCCA TGCGGAACCT TGGCGGGCTG CAGGTGAAGA AGGGCGATCG GTCCAGCGGT TCCGACCAGC TGCAGCGCTC GCTGGCGATC TTCCAGCAGC TCGGCGACCG GAGCGGGGAG GCCGCCGCGT TCCAGACGCT CGGCGAGCTG CACCAGTCGG CGGGCCGTAC CGCTCTCGCC CAGTACTACC TGCACCAGGC CCTCACGTTG AGGCGCGAGC TGCGAAGCGG CGCGGGAGGT GGGCAGGGGC CCGCGCTGAT GGCTTCGCAT CCCTGA
|
Protein sequence | MVVDQPVFSV LGSLRVRMPE GTLPVAGTKP RVLLASLLLN ANQVVGSDLL VETLWPQRRP RSALANLRTY VSFLRGTLGA AGAQILARPS GYAVELRIDQ LDALLFEDLV ARARAAGRDE EAIECLRRAL ALWHGTPLGD LPASPQWDGR LRSLTEARLG AAEDLAAMRM ERGEYPAAIG DLRELVKVHP FREDLWRQLM LALHGSGRQA EALQAYATVR QQLVDELGIE PGPDLRAVHA AVLAGELLPA VPAASALQRP PAPSARSVIA PQQLPPDIPD FTGRTGAVAD LARALSAKGR PSDEPPSIAV VVGPPGVGKS ALAVHCANAV RADYPAGQLY LGLGGTAAAP ADLGELLAEA LRALGAGEAD LPPTVHERSA LYRSLLAERP MLVLLDDAAD AAQVRALLPG NGCAVLVTSR RRITELPSSL RLDLGVMSPP EAEEFLGKIV GAERLSEERE DASAILRSCG YLPLAVRIAG ARLAGRPGWP LSVLRQRLDD ESNRLDELRA GDLEVRDSFD RSYRQLPDEV ARTYRTLGLL GPQSMPGWVV DAVLDRTRAE TVMDTLVDVN LVQPAGTDAI GQRRYRLHDL ARCNAREKAG GERHTLVRVL GTWMTAIEQA ASRLPTTLFS LTSAAAPRWD PAEETLRRLT ADPLPWFDAE RESLVAAVRL AADAGLSQAS WGLAAALVPY FDLNCRFDEW RHTHQVALDS ARMAEDLNGE AAMLRGLAQV CLYQDRYAEA REMLRRSRAI FHELGDLRGE AISICGLGAA SQFSGEHLTA LGYFRQALAM FLAMDDRSGE AYARQAIGRV YLTLRDFRRA SGWLGEALRL AEELGDAHRE GGVSMQLGRL YDLVAQSDEA MRVQGRALDI FETLGDRHCG AYAMRNLGGL QVKKGDRSSG SDQLQRSLAI FQQLGDRSGE AAAFQTLGEL HQSAGRTALA QYYLHQALTL RRELRSGAGG GQGPALMASH P
|
| |