Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3336 |
Symbol | |
ID | 8666624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 3643944 |
End bp | 3646937 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | Glycosidase-like protein |
Protein accession | YP_003339018 |
Protein GI | 271964822 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.310456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.29304 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTCAT CCCCCCGACG CGCTCTCCGG CTGCTCACCG TCGCGGTCCT CGGCCTGGCC GGCGTCGCCG TCCCGCTCTC GGCGGCGCTG GCGGCCAACA GCGTGACCGT CTACTACGCC ACCGCCTGGA CCGGCGCCAA CATCCACTAC CAGCCCACCG GCGGAAGCTG GACCCCCGTC CCCGGCGTCG CCATGGACGA GACCGCCTGC GCGGGCTGGA AGAAGAAGAC CGTCGACCTC GGCACCGCCA CCGGGCTCAA GGCCGCCTTC AACAACGGCT CCGGCACCTG GGACAACAAC GGCGGCCGCG ACTACACCAT CGGCGCCGGG GCCGTGAAGG TCTCCGGCGG CCAGGTCACC GCCGGCGACC CCTGCCCCGG CGGTGGACCG ACCGAGCCGC CCGGCAAGCA GGCCGTCGTC TACTTCCACA AGAAGACCAG GGGCTGGAGC GCCGCCAACA TCCACTACCA GCCCACCGGC GGGACCTGGA CCCCCGTCCC CGGCGTCGCC ATGGACGAGA CCGCCTGCGC CGACTGGGCC AGGAAGACCG TCGACCTCGG CACCGCCACC GGCCTCAAGG CCGCCTTCAA CAACGGCTCC GGCACCTGGG ACAACAACAA CGGCGCCGAC TACGCCATCG GCACCGGACT GACCACCGTC AAGGACGGGG CGGTGCGCGC GAACGCCACC GAGCCGTGCA CCCCCGAACC CCCGGACACC ACCGCCCCCT CCGTCCCCAC CGGGCTGACC GCCACGGCGG AGGGCACCAC CGTCAAGCTG AGCTGGAACG AGGCCGGCGA CGACATCGCG GTGACCGGCT ACGAGATCAC CCGCAGGAAG GGCTCCGAGG CCCCGGTGAC GCTCACAACG GGCACCAACA CCGTCTACAC CGACGCCCGC CTGGAGGAGC AGACCACCTA CGCCTACACG GTCAGGGCCC GCGACGCGGC GGGCAACCGG TCCGGGGCGA GCCCCGAGGC GGCGGCGAAG ACCGGCGACG CCCCGCCAGG ACCGGCCAAG GGCTCGCCAC TGGGCGGCGA CCCGCGCGAG GACTCCATCT ACTTCGTGAT GACCGCCCGC TTCAACGACG GCGACACCTC CAACAACCGC GGCGGCGGCC AGCACACCGT CTCCGGCAAC GCCAGGAACA ACGACCCGAT GTTCCGCGGC GACTTCAAGG GCCTGATCGA CAAGCTCGAC TACATCAAGG GCCTCGGCTT CTCCGCTCTG TGGATCACCC CGGTCGTGCT CAACCGCTCC GACTACGACT TCCACGGCTA CCACGGCTGG GACTTCCACC GTGTCGACAC GCGGCTGGAG ACCCCCGGCG CGACCTACCA GGACCTGATC GACAAGGCGC ACGCCAAGGG CATCAAGATC TTCCAGGACG TGGTCTACAA CCACAGCTCC CGCTGGGGAG CCAAGGGGTT GTACGTCCCC CCGGTGTGGG GCGCGCGTGA CGAGCAGTGG AAGTGGCTCT ACAGCGAGAA GGTCCCGGGC AAGGAGTACG ACCCGCTGGA GGAGCACCAG GGCGACGACC CGGAGCTGAC GGCCAGGCAG AACGAGATGG CCAAGGGCAG GCCGTACAAC GGCGACCTGT GGTCCACCGC GGCCCCGGCG GGCAACACCT GCAGGGACTA CGGCACGCCC ACCCAGTGGA AGAGCCCCGA GGGCTTCACC ATCCACAACT GCCAGTGGCC CAGCCCTACC TCGGGCATGT TCCCCGCGGA GTCCTATCAC CAGTGCTGGA TCGGCAACTG GGAGGGCTCG GACTCCAAGA GCTGCTGGCT CCACGACGAC CTGGCCGACC TCAACACCGA GAACAAGGCG GTGCAGGACT ACCTGATCGA CGCCTACAAC AAGTACATCG ACATGGGCGT CGACGGCTTC CGCGTCGACA CCGCCGTGCA CGTCTCGCGG CTGGTGTGGA ACCGCCGCTT CCTGCCGGCC CTGCAGCAGC ACGCCGTGGC CACGCACGGG GACAAGGGCA AGGACTTCTA CGTCTTCGGC GAGGTCGGGG CGTTCGTCAA CGACAAGTGG AACCGCGGCT CGGTCAACCA CTCCGCGCAG TTCTACACCT GGAAGGAGCG CAGGGAGTTC AACCCCGACG ACGCCAGGGC GGTCGTCGAG CAGTTCGACT ACGAGAACCT GATGGGCACC GGCAACCAGC CCACCACCGA CAACGCCTTC CTGCGCGGCA ACGCCTACCA CGCGCCCGAC CACTCGAAGT TCTCCGGCAT GAACGTCATC GACATGCGCA TGCACATGAA CTTCGGCGAC GCGCCCAACG CCTTCAACAA CGGCAAGGAC TCCGACGACA GCTACAACGA CGCCACCTAC AACACCGTCT ACGTCGACTC CCACGACTAC GGCCCCAACA AGTCCAACGA GCGCTACACC GGCGGGACCG ACGCGTGGGC GGAGAACATG TCGCTGATGT GGACCTTCCG CGGCATCCCG ACGCTCTACT ACGGCTCGGA GATCGAGTTC CAGAAGGGTC AGAGGATCGA CTGCGGGCCC ACCTGCCCGC TGGCGACCAC CGGCCGCGCC TACTACGGCG ACCACCTCGC GGGCAGCGTC ACCGCCTCCG GCTTCGGCGT CGTCTCCTCG GCCACCGGCG CGGTGGCGCA GACGCTGGAC AAGCCGCTGG TCAAGCACGT GCAGCGGCTC AACCAGATCC GCCGGGCCAT CCCGGCACTG CAGAAGGGGC AGTACTCGAC CGAGGGCGTC ACGGGCCAGA TGGCCTACAA GCGCCGCTTC ACCCAGGGGG CGGTGGACAG CTTCGTGCTG GTCTCGGTCT CCGGGGGCGC CACCTTCACC GGCATCCCGA ACGGCACCTA CGTCGACGCG GTCACCGGTG ACAGCAAGGC CGTGACCGGC GGGACGCTCA CGATCGCCGG CGGCGGCAAA GGCAACCTGC GGGCCTACGT CCTGTCACTG CCCGGCAACC CGGCCCCCGG GAAGATCGGT ACGGCGGGGC CGTACCTGAG GTAG
|
Protein sequence | MRSSPRRALR LLTVAVLGLA GVAVPLSAAL AANSVTVYYA TAWTGANIHY QPTGGSWTPV PGVAMDETAC AGWKKKTVDL GTATGLKAAF NNGSGTWDNN GGRDYTIGAG AVKVSGGQVT AGDPCPGGGP TEPPGKQAVV YFHKKTRGWS AANIHYQPTG GTWTPVPGVA MDETACADWA RKTVDLGTAT GLKAAFNNGS GTWDNNNGAD YAIGTGLTTV KDGAVRANAT EPCTPEPPDT TAPSVPTGLT ATAEGTTVKL SWNEAGDDIA VTGYEITRRK GSEAPVTLTT GTNTVYTDAR LEEQTTYAYT VRARDAAGNR SGASPEAAAK TGDAPPGPAK GSPLGGDPRE DSIYFVMTAR FNDGDTSNNR GGGQHTVSGN ARNNDPMFRG DFKGLIDKLD YIKGLGFSAL WITPVVLNRS DYDFHGYHGW DFHRVDTRLE TPGATYQDLI DKAHAKGIKI FQDVVYNHSS RWGAKGLYVP PVWGARDEQW KWLYSEKVPG KEYDPLEEHQ GDDPELTARQ NEMAKGRPYN GDLWSTAAPA GNTCRDYGTP TQWKSPEGFT IHNCQWPSPT SGMFPAESYH QCWIGNWEGS DSKSCWLHDD LADLNTENKA VQDYLIDAYN KYIDMGVDGF RVDTAVHVSR LVWNRRFLPA LQQHAVATHG DKGKDFYVFG EVGAFVNDKW NRGSVNHSAQ FYTWKERREF NPDDARAVVE QFDYENLMGT GNQPTTDNAF LRGNAYHAPD HSKFSGMNVI DMRMHMNFGD APNAFNNGKD SDDSYNDATY NTVYVDSHDY GPNKSNERYT GGTDAWAENM SLMWTFRGIP TLYYGSEIEF QKGQRIDCGP TCPLATTGRA YYGDHLAGSV TASGFGVVSS ATGAVAQTLD KPLVKHVQRL NQIRRAIPAL QKGQYSTEGV TGQMAYKRRF TQGAVDSFVL VSVSGGATFT GIPNGTYVDA VTGDSKAVTG GTLTIAGGGK GNLRAYVLSL PGNPAPGKIG TAGPYLR
|
| |