Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_1603 |
Symbol | |
ID | 8664880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 1708321 |
End bp | 1710030 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | 4-alpha-D-((1->4)-alpha-D- glucano)trehalosetreha lohydrolase |
Protein accession | YP_003337338 |
Protein GI | 271963142 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGAGG TCTGGGCGCC GCAGGCGACG GCGGTCGATG TGGAGATCGG GGAGATCCGC CATCCGATGG CCGCCGGGAC GGGCGGCTGG TGGTCGGCGG AGGTGCCGGG GGCAGGTCAC GGGACCGACT ACAGGTTCCG GGTGGACGGC GGCGAGCCGC TGCCGGACCC CCGGACCCGG TGGCAGCCCG AGGGGATCTT CGGGCCGAGC CGGGTCTACG AGCACGACCG GTACGTCTGG GGTGACGGGC TGTGGCGCGG GCGCGACCTG CCGGGGTCGG TGATCTACGA GCTGCACGTC GGCACCTTCA CCCCCGCCGG GACCTTCGAC GCGGCGATCG GGAAGCTTGA GCACCTCGCC GGGCTCGGCG TCGACTTCGT CGAGGTCATG CCGGTGCCGC CGGTGCCGGG GGAGCGCAAC TGGGGCTACG ACGGGGTGGA CCTGTGGGCC GTCACCGAGA ACTACGACGG CCCCGACGGT CTCAAGCGCT TCGTGGACGC CTGCCACCGG CAGGGCATCG GCGTCATCCT CGACGTCGTC TACAACCATC TCGGCCCGTC GGGGAACTTC CTGGCCCCGT TCGGCCCCTA CTTCCACTCC AGCGCCTCCT CCTTCTGGGG GCAGGCGGTC AACCTGGACG GCCCCGGCTC CGACGAGGTG CGCCGCTACT TCATCGGCAA CGCGCTGCAG TGGCTGCGCG ACTACCACAT CGACGGCCTG CGGCTGGACG CCGTGCACGC GCTGCACGAC AGGCGGGCCG TGCACCTGCT GGAGGAGATG GCCGCGGAGG TGGAGGCGCT GTCGGCGGCC GTGGGCAGGC CGCTGACGCT CATCGCCGAG TCCGACCTCA ACGACCCCCG GCTGGTGACC CCGCGCGAGG CCGGCGGGTA CGGCCTGGCC GCGGCCTGGA ACGACGACGT CCACCACGCC CTGCACGTGG CGGTGACCGG GGAGCGGCAC GGCTACTACG ACGACTTCGC CGGAGCGCTG CCCAAGGTCC TCGCCTCGGC CTACTACCAC GACGGCACCT ACTCGGCCTT CCGGGGGCGC TCCCACGGCC GTCCGGCCCG TCACGTGCCC GGCTACCGGT TCGTCTGCGC CGCGCAGAAC CACGACCAGA TCGGCAACCG TGCTGAGGGC GACCGGATGG CGCCGGAGGC GCTGCGGCTG GCCGCCGGGC TGCTGCTCAC CTCGCCGTTC ACCCCCATGC TGTTCATGGG GGAGGAGTGG GGAGCGCGCA CGCCGTTCCT GTTCTTCACC GACCACGTCG AGCCGCAGCT CCGCGAGGGC GAGGCGGACC GGCGGCGGCG GGAGTTCGTC GGCTTCGGCT ACGACGACTG GGCCGAGAAG GCGCCGGACC CGGGGGAGGA GCTGACGTTC CTGCGCTCCA AGCTCGACTG GAGCGAGCTC GACGACGACG CCCACCGGGT CCATCTCGAC TGGTACCGGG CCCTGATCGC GCTGCGCCGC GCGCATCCGG ACCTGTCGGA CCCGAGGCTC GACCGGGTCC GGGCCGAGCA CGACGGCTCC TGGCTGGTGG TCCACCGGGG GGCGTTCCGG GTGGCCGTCA ACTTCGGCGC CACGCCGGTC TCCCTCGACC TCACGGCCCC GGCCCAGGTC GTGCTCGCCT CCGACCCCGG CGTCCACCTG GACTCCGGCC TCACCCTCCC GGCCCGTTCC CTGGCGGTGC TCCGCCTCGC CGGGACCTGA
|
Protein sequence | MFEVWAPQAT AVDVEIGEIR HPMAAGTGGW WSAEVPGAGH GTDYRFRVDG GEPLPDPRTR WQPEGIFGPS RVYEHDRYVW GDGLWRGRDL PGSVIYELHV GTFTPAGTFD AAIGKLEHLA GLGVDFVEVM PVPPVPGERN WGYDGVDLWA VTENYDGPDG LKRFVDACHR QGIGVILDVV YNHLGPSGNF LAPFGPYFHS SASSFWGQAV NLDGPGSDEV RRYFIGNALQ WLRDYHIDGL RLDAVHALHD RRAVHLLEEM AAEVEALSAA VGRPLTLIAE SDLNDPRLVT PREAGGYGLA AAWNDDVHHA LHVAVTGERH GYYDDFAGAL PKVLASAYYH DGTYSAFRGR SHGRPARHVP GYRFVCAAQN HDQIGNRAEG DRMAPEALRL AAGLLLTSPF TPMLFMGEEW GARTPFLFFT DHVEPQLREG EADRRRREFV GFGYDDWAEK APDPGEELTF LRSKLDWSEL DDDAHRVHLD WYRALIALRR AHPDLSDPRL DRVRAEHDGS WLVVHRGAFR VAVNFGATPV SLDLTAPAQV VLASDPGVHL DSGLTLPARS LAVLRLAGT
|
| |