Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_2745 |
Symbol | |
ID | 8666031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 2976586 |
End bp | 2979783 |
Gene Length | 3198 bp |
Protein Length | 1065 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003338448 |
Protein GI | 271964252 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00032694 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.196454 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCGGC GGCGGGACGT CCAGAGTGGG GCGTACATCG TCGTCGGGAC GGGAGCAGCG TTGAGCGAAG GACTGGCGTC CGCGAAATCG ATCCTCGAAA CCGCCCCCCG GGTGGGCACC CGCAGGGCCC TGGTCTTCCT CGACCCGCCT CCGGACCCCC TGCCCGGACT GCCGCCGGTC TCGGCCCTCG CCGCCGCCCC GCCGGACGTG CGCCTGGCGA GCGCGCTGCG CGCCGAGGGC TTCGCCGTCG ACTCCGGGCG CAGCCGGGGG GCGCAGGCCG ACGGCGAGCG GGTGTGGGAG TTCCTGCGCT CGGCCCAGGA GCGCGACCTG CTGGTGGTCC GCCTGCCCGG CGACGACGGC TGGACCTCCG GGCAGCTCGG ATCGATGATC GGCGCGTTCG GCCGGCAGGC CGGCGAGACG CTGGCGGCCG CCGTACTGCT AATCATGGAT TTCGGCTGGA GCGCGCTGGA CGGCTCCGCC GCGCCGGTCC GGCAGCTCGC CGTGGCGGGA GAGCCCCGGG TGGCCGCGGC CGCTCTCACC CTGACCCTGA AACTCGGCGA GACGCTCTCC GGCCGGGCGC CCAGCCTGAC CGGCGTCCTC GCCGAGGGGC TGGCCACCCG GGCGGCCGAC CTGGACGGCG ACGGGATGAT CAGCGTCGGC GACCTGCATG CCTACGCGCT GCGGCACTAC GAGACCGCCT CCGCCCCGCG GACCCCCATC CTGATCGCCT ACGGCGCCGT CACCGGCGTC GCCCTGACCC GGGCCCGGGA CCTGTCCACC CCGCCCCACG CCGCCTTCGC GCGGGTCCTG GCCCGGCTCG GCGACCCCGC GCCCCCGGCC TCGGGCGCCG AGATCGTCCG GCGGATCTAC GAGCTCCACC TGAGCGAGCC GGCCAGGGAG TTGCTGCGCA GGGCCTCCCT GCTCGACGAG GGCGAGTCGA TCGACGAGGT CCTCGGCGAC GCCGCCGAGC TCCGGCACTG GGGGCTGCTC GGCGAGACGA CCATGTTCGT CCATCCGGAC GTGCGGGCCT TCGGCTACTC GCGGCTGTCG GCGGCCGAAC GCGCCCAGAC CTCCGCCGTG CTGCGCCGTC TGCGGGCCGG GCGGCGCGCC GTGCAGCCGC GGGCCAGGCT CACCGCCGAC CGGTGGACCA CCGAGGACCA GCTCGGGCAC CGGGTCTACG CCGAGGCCAT CGCCGCGTTC GTCCGCCATC CCGAGACGCG GCCGCCGCTC ACGATCGGCG TCAAGGGGCC GTGGGGCACC GGCAAGACCT CCCTGATGCG GATGATCCAG GATCTGCTGG ACCCCGGGGC CGCCGGGGAC CGCCCCGCGG AGATCCATCT GCCCGGCTCC CGCCGGGACA GGGCCCTGAC CAACGCGGAG GTGCTGGCCA GATCCCGGCA GCGGCCCCGG GAGAGCACGG GCCAGGCCGA GCCGGGCGCC CTTCCGGTAC GGCGGGCGGA CTGGCGGCCG ACCGTCTGGT TCAACCCGTG GATGTACCAG AACGGCGAGC AGGTGTGGGC GGGGCTCGCC CACGAGATCA TCGGCCAGGT CACCCGGCGG CTGCCCCTCG CGGAGCGGGA ACGGTTCTGG CTGGAGCTCA ACCTTGCCAG GATCGACCGG GAGGCCGTCC GCCGGCGTGC CTACCACCTG GCGATGACCC GGCTGGTCCC GCTGGCGCTG GGCCTGGTCG CCACCCTGGT GCTCACCGGA GCCTTTCTCA CGGCCTCCGC GCTGCTGCCC GCGTTCGGGG CCCTCCTGCG GAACGCGGCC GCCGGGATCG GCTCGGCCGG TTCGGTGGCC GTGGTCGCGG CGGGCGCCGT ACGGCTCGCG AGGTTCTTCC GGGAGTCGGC GGACACGGCC TTCCAGGGGC TGGTCCGCCA GCCCGACCTG CTGTCCCCCG GTGTGGCGGA GGGGCTCGCC GCCGAGGCCG TGACCACCCC CGGCTACGGT TCCAGGACCG GCTTCCTGCA CCTGGTGCAG ACCGACATGC GGCAGGTGCT CGATCTGATC GCCACCGAGG AACGGCCGCT GGTGGTGTTC GTCGACGACC TGGACCGGTG CTCGCCGGGG ACGGTGGCGC AGGTCATCGA GGCGATCAAC CTGTTCCTCG CCGGGGAGTT CCCCAACTGC GTGTTCGTGC TGGCCATGGA GCCGGAGGTG GTGGCCGCGC ACGTGGAGGC CGCCTATCCG GAGCTGGTGG GGACGATGCC GGACGACGGG CGCTCCGGGC TGGGCTGGCG GTTCCTGGAG AAGATCGTGC AGTTGCCGCT GAGCGTGCCG CTGCTGGACG ACGCCGACCG GCTGCCCGGC TTCGTGCGGG CGCTGCTGGG CATGCCCGGG ATCGCCGGGC CCCGCGAGCC CGTGGAGTGC TGCCTGCCCG GCGGGGCCAC GCCCCGGCGG CACGGCCGGC CGCCACGTGC GACGGGGCTC CGGTCAGGTG AGGCCAGCGC GCAGGCGCGC GAGAGCGGGC CGCCGCCGCG CGGGAGCGTG CCGGAACCAC GGGGAGCCGT GCCGGAACCA CGCGGGGCCG AGGACGAGGC CGTGCCGGGG TCGCGCGCGC CGGTCGACGT GCCGCTCGCC GTACCCAGGT CACGGCCGGT GGAACCGCCG GATCCGGCGC TGGTGAGCCG CCTGGAGGAC GCGATCTGGG CACTGCGGCC GACGGCCGCG GACCTGGACG AGGCGGCGAG GCAGGCGCAG GAGATCCTGG GCATCGAGGC CCTGGACGCG ATCGGCGGGC TGGCCTCGGC GACGCGTGAG GCGGCCGACC GGGTCTTCGA CGACCTCTAC AGCGACGAGA ACGCCTACCG GGCGATCGAG TTCGTGCTCC CGGTCATGAC CTTCTTCAAC CCCCGCGAGA TCAAGCGCTA CGTGAACGTC TTCCGCTTCT ACTCCTTCCT CGTCTACCGG CGCACCCTCG CCGGTGCGGC GCCCGCCTCC GACGGCGAGG TGGCCAAGCT CGCGGCGCTG ACCATCCACT GGCCCCACCT GCTGTCGCCG CTGGTCAAGG AGGTCGACGG GGTGAGCGTG CTGCAACGGC TCGAACGCGC CGCGGACGAC GACCGCGGCT GGGAGCGGAC CGTCCGCGAG GCGGGCCTGG CCGGTCCGGA GACCCCGGGC GTCGCCCACC TGGACACCCT GCGCGACCTG CTCTCCTGCC CCCACCCCAT CGCCGGGCTC GCCCGCCACC TCCTCTGA
|
Protein sequence | MRRRRDVQSG AYIVVGTGAA LSEGLASAKS ILETAPRVGT RRALVFLDPP PDPLPGLPPV SALAAAPPDV RLASALRAEG FAVDSGRSRG AQADGERVWE FLRSAQERDL LVVRLPGDDG WTSGQLGSMI GAFGRQAGET LAAAVLLIMD FGWSALDGSA APVRQLAVAG EPRVAAAALT LTLKLGETLS GRAPSLTGVL AEGLATRAAD LDGDGMISVG DLHAYALRHY ETASAPRTPI LIAYGAVTGV ALTRARDLST PPHAAFARVL ARLGDPAPPA SGAEIVRRIY ELHLSEPARE LLRRASLLDE GESIDEVLGD AAELRHWGLL GETTMFVHPD VRAFGYSRLS AAERAQTSAV LRRLRAGRRA VQPRARLTAD RWTTEDQLGH RVYAEAIAAF VRHPETRPPL TIGVKGPWGT GKTSLMRMIQ DLLDPGAAGD RPAEIHLPGS RRDRALTNAE VLARSRQRPR ESTGQAEPGA LPVRRADWRP TVWFNPWMYQ NGEQVWAGLA HEIIGQVTRR LPLAERERFW LELNLARIDR EAVRRRAYHL AMTRLVPLAL GLVATLVLTG AFLTASALLP AFGALLRNAA AGIGSAGSVA VVAAGAVRLA RFFRESADTA FQGLVRQPDL LSPGVAEGLA AEAVTTPGYG SRTGFLHLVQ TDMRQVLDLI ATEERPLVVF VDDLDRCSPG TVAQVIEAIN LFLAGEFPNC VFVLAMEPEV VAAHVEAAYP ELVGTMPDDG RSGLGWRFLE KIVQLPLSVP LLDDADRLPG FVRALLGMPG IAGPREPVEC CLPGGATPRR HGRPPRATGL RSGEASAQAR ESGPPPRGSV PEPRGAVPEP RGAEDEAVPG SRAPVDVPLA VPRSRPVEPP DPALVSRLED AIWALRPTAA DLDEAARQAQ EILGIEALDA IGGLASATRE AADRVFDDLY SDENAYRAIE FVLPVMTFFN PREIKRYVNV FRFYSFLVYR RTLAGAAPAS DGEVAKLAAL TIHWPHLLSP LVKEVDGVSV LQRLERAADD DRGWERTVRE AGLAGPETPG VAHLDTLRDL LSCPHPIAGL ARHLL
|
| |