Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_1788 |
Symbol | |
ID | 8665066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 1905552 |
End bp | 1908476 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003337521 |
Protein GI | 271963325 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGACGG CGACATCCGA GAGCGGCGGT GCTTCGGCAC GCGACCGCGG CAAGCGGCCT GCCGCCCGGT TCGGCTTGCG GAACTGGCGT GTGCGGTCGC GCCTCACGGC TCTGATCCTG GTGCCGACCG CGGTGGGCGT CGTGCTCGCG GGCACCCGCG TGGTCGCCTC CGTCGACAGC GTGGCGGGCT ACCAGCGGAC CGCCTCGGCG GCCGACTACT CCGGAGACAT CCGCGACCTG GCCCAGGCCC TCGGTTTGGA ACGTGACCGG GGTGCGTGGT CCTCCTTCCA GGCGTCGAAC AAGGGGCTGA AGGACAGCGC CGAGGCGCAG AAGAAGACGG TCGACGCGCT GATCGGCAAG GTCCGGCTCG ACCTGCAGGC GATCGACGAC TCCTACGGCG TCCGGGCGGC CGAGGCCGCC AGGGACGTGG GCTACCAGCT CTCCGGACTG AAGGAGCTCC GCCAGATACC GGGGACGAAC CGTACGGAAC GGTACGGCCT CATGATCGCC CCGCTGCTCC AGCTCCACGA GGAGCTCACG CTGGTCAGCG ACGACCCGGA GATCATCGGC AACACCCGTG CGCTCAGCGC GCTGGCCTAC GCCAAGGAGG AGGTCTCCAA GCAGCGGGCC CGCCTGCTGG CCGGCTACTA CGCGCCCTCC CTGGTCAACG CCCAGGAGAT CGAGAAGTTC ATCGCCTCCC GGTCGCGCAT GGAGGAGTAC AAGGCGGACT TCGGCGTCGA GGCGAGCCCG GAGAACGGCC AGCTGCTCGT CAAGGCCATG ATCGACGAGA AGGTGCACCG CGCCGAGCTG ACCAAGTCCC GGGCGATCGT GTTGGCCAGC GACCCGCGGA ACGAGGCGCG CCTGCTCTCC GACTTCGACG AGGTCAAGCA GTGGTTCTCG GACAACGGCG CGATCCTCGA CCGGATGAGG AAGGTCGAGG TCAAGGTCGC CGGTGACGTG ATCACCCGGG CCCGGGTGCT GGAGGAGACC GAGCAGCGCA ACGCGATCAT CGCGTCCGGG CTCATCCTGC TCCTGCTCCT GCTGGTCCTC GGTCTCACCG TCGTCATCGC CCGCTCGATG GTCCTGCCGC TGCGCCGCCT GCGCGCCGAG GCACTGGACG TCGCCGGATT CACCCTCCCC GAGGTCGTGC GCAGGCTCCG GGTCTCCGGA GACTCCCAGA CCCCCGAGAT CGCCTCCATC TCGGTCGACA CCAAGGACGA GATCGGGGAG GTCGCCAAGG CCTTCGACGA GGTCCACCGG CAGGCCGTAC GGCTGGCCGC CGAGGAGTCC GAGCTCCGCT CCAACATCAG CGCGATGTTC GTCAACCTGT CGCGCCGCAC CCAGACCCTG GTCGAGCGGC AGATCTCGCT GATCGACGGC CTGGAGAAGG GCGAGGAGGA CGGCGGCCGC CTGGCGGACC TGTTCAAGCT CGACCACCTC GCCACCCGCA TGCGCCGCAA CTCCGAGAAC CTGCTGGTCC TCGCCGGTCA CGAGCCCACC CGCAGGCGCA GCCAGCCGGC CAAGCTCGTC GACGTCGTGC GCGCCTCGCT GTCGGAGGTC GAGGACTACG AGCGGGTCCA GGTCAAGGTG CACCGCACGA TCTCGGTCGC CGGGAGCGCC GCCAACGACA TCGTCCACCT GGTCGCCGAG CTGGTGGAGA ACGCCATCCA GTTCTCGCCC CGGGCCAGCC AGGTCGTGGT CTCCAGCAGC ATGATCGAGG GCGGTGGCGC GCTGCTCGCG GTCAGCGACG CCGGCATCGG CATGACCACC GAGGAGCTGA TCGAGACCAA CCGGCGCCTG GCCGACCCGC CCGTCGTCGA CGTCTCGGTG TCACGGCGGA TGGGCCTGTT CGTGGTCGGC CGGCTGGCCC TGCGCCACGG CATCCGCGTC CAGCTCCGCC CGCAGGAGGT CGGCGGCCTC ATCGCGATGG TCCTCTTCCC GCCGGAGCTG ATCGTCGAGG CGATCCAGCC GCCGTCCATC ACCCCGTCGT GGGGCGCCGA GTCGCGGGCC CAGCAGCCTT CCCCGGACCA GAACCCCTTC GGGCAGACGC CGTTCGGCCA GACCTCCTTC GGCCAGACGT CGTTCGGGCA GGCCTCGATC CCCCCGGCCT CCTCGTTCGG GCAGACCTCG TTCGGGCGCA CTCCCGACCC GCAGGCCTTC CCGTCCGGCT CCACGTCGTT CCCCGGCGAC CAGCCGCTCC CGGACCGCCG GCAGCCGCTC CCCGCGAGCC CGCCCGGCGC TCCGGCGTCG GCCGGCGACT TCCCCCTCCC CAAGCGCCCG GTCCCCGGGG CGAGCGGTGG CGGGAGCGTT CCAAGCCCGG GTTTCTCCCC CTGGGGACAG AACTCCCAGG ACGACCCGGC GACCGCGTCG ATGCCGGCCG TACGGGTCTC CCCGCTCGAA TCCGAGCAGG AGGAGTTCCT GCCGATCTTC GCGTCCATCG AGTCGGCCTG GTTCCGCCGG GCCGAGGACA CCGGCGAGCA CGCCGTCGCC GCCGGGGAGG GGGAGAGCGC GCAGCGCGCG GGCTCCCCGC TGACCGACCC GTTGACCGAC CCACTGGCCG GCCCGCCGGG CTCGGAGGCC GAGGAGGCCC CCACGCCCCC GGACGGCCTG CCGGTCCGTG AGCCGGTGGC GGCCCAGGAG GCGGGCGGCT GGCAGACCCC GGCCGACGCC GGGTGGCAGG CGGCCCAGGC GGCGAGCGAT CCCACCCTCG GCGGGATCAC CTCGGCGGGC CTGCCCAAGC GGACTCCCAA AGCCAACCTG GTGCCCGGCG CCGCGGCCAG CGTGCCGTCC ACCCCGATGC CGTCTCTCTC GCCCGAACGG GTACGTAGCC GCTTGTCGAG TTTCCAGCAG GGTGTACGGC GGGGCCGTGC CGAACTGAAC GAGGACACCG CCAGGAGCCT GGCGGATAGG GAGGAGGGCT CGTGA
|
Protein sequence | MTTATSESGG ASARDRGKRP AARFGLRNWR VRSRLTALIL VPTAVGVVLA GTRVVASVDS VAGYQRTASA ADYSGDIRDL AQALGLERDR GAWSSFQASN KGLKDSAEAQ KKTVDALIGK VRLDLQAIDD SYGVRAAEAA RDVGYQLSGL KELRQIPGTN RTERYGLMIA PLLQLHEELT LVSDDPEIIG NTRALSALAY AKEEVSKQRA RLLAGYYAPS LVNAQEIEKF IASRSRMEEY KADFGVEASP ENGQLLVKAM IDEKVHRAEL TKSRAIVLAS DPRNEARLLS DFDEVKQWFS DNGAILDRMR KVEVKVAGDV ITRARVLEET EQRNAIIASG LILLLLLLVL GLTVVIARSM VLPLRRLRAE ALDVAGFTLP EVVRRLRVSG DSQTPEIASI SVDTKDEIGE VAKAFDEVHR QAVRLAAEES ELRSNISAMF VNLSRRTQTL VERQISLIDG LEKGEEDGGR LADLFKLDHL ATRMRRNSEN LLVLAGHEPT RRRSQPAKLV DVVRASLSEV EDYERVQVKV HRTISVAGSA ANDIVHLVAE LVENAIQFSP RASQVVVSSS MIEGGGALLA VSDAGIGMTT EELIETNRRL ADPPVVDVSV SRRMGLFVVG RLALRHGIRV QLRPQEVGGL IAMVLFPPEL IVEAIQPPSI TPSWGAESRA QQPSPDQNPF GQTPFGQTSF GQTSFGQASI PPASSFGQTS FGRTPDPQAF PSGSTSFPGD QPLPDRRQPL PASPPGAPAS AGDFPLPKRP VPGASGGGSV PSPGFSPWGQ NSQDDPATAS MPAVRVSPLE SEQEEFLPIF ASIESAWFRR AEDTGEHAVA AGEGESAQRA GSPLTDPLTD PLAGPPGSEA EEAPTPPDGL PVREPVAAQE AGGWQTPADA GWQAAQAASD PTLGGITSAG LPKRTPKANL VPGAAASVPS TPMPSLSPER VRSRLSSFQQ GVRRGRAELN EDTARSLADR EEGS
|
| |