Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3724 |
Symbol | |
ID | 8667012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 4129270 |
End bp | 4132269 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003339390 |
Protein GI | 271965194 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.606148 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.741628 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCATCCAT CCCGCAGCCC CGGCGCGGGC CGCCTGAGCC TGCTCATCTC CACTCTGATC GCCCTCATCG CATCCGGCCT GGCCGTCACC GCCCCCGCCG CCGCTGCCGC TGCCGCCGAC ACCCTGCTGT CCCAGGGCAG GGCCGCCACC GCCTCCTCCA CCGAAGGCGC CGCCTACGGC GCCGCCGCCG CCTTCGACGG CGACCCCGCC ACCCGCTGGG CCAGCGCCTT CAGCGACCCG CAATGGATCC AGGTCGACCT CGGCGCCACC GCCACCGTCA GCCAGGTCGT GCTGAACTGG GAGGCCGCCT ACGGCAGGGC CTTCAAGATC CAGATCTCGG CCGACGGGAC CGCCTGGAAC GACGCCTACA CCACCACCGC CGGCACCGGC GGCACCCAGA CCCTCCCCGT CTCCGGCACC GGCCGCTACG TCCGCCTGCA CGGCACCGTC CGGGCCACCG GCTACGGCTA CTCCCTGTGG GAGTTCCAGG TCTACGGCGC CACCGGCCCC GGCCCGGACC CGGGCGCCTG GACCGAGGTG TGGCGGGACG ACTTCGACGG GACCTCCGGG ACCTCCCCGT CGGGGGCGAA CTGGATCCTG CGCACCGGCA CCCGGTATCC CGGGGGTGCC GCGAAGTGGG GCACCGGCGA GGTCGAGACC ATGAGCGCCT CGACCGCCAA CGTCTCCCTC GACGGCACCG GCAAGCTCAA CATCAGGGCC GTCAGGGACG GGGCCGGCGA CTGGACTTCC GGGCGCGTCG AGACCCAGCG CACCGACTTC GCCCCGCAGC CCGGTGAGCG GCTGAGGTTC ACCGCCAGGC TCAAGCAGCC CGACGTCCCC GACGCGACGG GCTACTGGCC CGGATTCAGG GCGACCGGCG CCGCCTACCG GGGCGACTAC AACAACTGGC CGGCTGTCGG CGAGACCGAC ATCATGACCG ACGTCAACGG CCGCAGCCAG CTGTCGAACA CGCTGCACTG CGGCACCGCC CCCGACGGCG CGTGCAACGA GTACGGCGGC CGCACCAGCG GCCTGGCCAC CTGCGAGGGC TGCCAGACCG GCTTCCACGA GTACACCCAG ATCATCGACC GCACCAGGTC GGACGAGGAG ATCCGCTTCT ACCTCGACGG CGTGCAGACA TGGGTGGTCC GCCAGAGCCA GGTCGGCGTC TCCGCCTGGC GGGCCGCCGT CCACCACGGC TTCTACCTCC GGTTCGACCT CGCCATCGGC GGGTCGCTGC CCGACGCGAT CGCCGGCTTC ACCACCCCCA CCGCGGCCAC CACCTCCGGC GGCGTCCTGA GCGTGGACTC GGTCTCGGTC TCCAGGAGCG CCGGCACCGT GCCCGCCCCG ATGACCGACC CGCCCACCCC GGCGGGGCCG AGCACCGTCA GGGTGACCGG CGGCCAGGGC AACTGGCGGC TGACCGTCAA CGGCGCGCCG TACGAGGTCA AGGGGCTCAC CTACGGCCCG CCCCAGGCCG CGGCGGACGG CTACATGCGC GACCTCAGGT CGATGGGGGT CAACACGATC CGCACCTGGG GCGTGGACGA CACCCACACT CCGGCCCTGC TCGACAGGGC CGCCCAGCAG GGCATCAAGG TGATCGTCGG GCACTGGCTG AACCAGGGCG CCGACTACGT CAACGACACC GCCTACAAGA CGGCCACCAA GAACGAGATC GTCGCCCGGG TCAACGCCCT CAAGGGCCAC CAGGGCGTGC TCATGTGGGA CGTGGGCAAC GAGGTCCTGC TCACCATGCA GGACCACGGC CTGCCCGCCC CGGTGGTCGA GGAGCGGCGC GTCGCCTACG CCAGGTTCGT CAACGAGCTC GCGCAGGCCA TCCACGCCGC GGACCCGGAC CACCCGGTCA CCTCGACCGA CGCCTGGACC GGCGCGTGGA CGTATTACAA GGACCACTCC CCGGCCCTCG ACCTGCTCGC GGTCAACGCC TACGGCGCCA TCGGCGGGGT CAGGCAGGCC TGGATCGACG GCGGCCACAA CAGGCCGTAC ATCATCACCG AGGCCGGCCC GGACGGCGAG TGGGAGGTGC CCGACGACGT CAACGGGGTC CCCTCCGAGC CCACCGACCT CCAGAAGCGG GCGCAGTACA CCGCGAGCTG GAACGCGGTC AAGGCTCACC CCGGCGTCGC GCTCGGCGCC ACCGAGTTCC ACTACGGCCT GGAGAACGAC TTCGGCGGCG TGTGGCTCAA CACCTTCACC GGCGGCTGGC GCCGGCTGGG ATACCACGCG CTCAGGCAGG CCTACACCGG CCTGCCCTCC GGCAACATGC CCCCGGAGAT CACCGGCATG ACGGTCGGCT CGCAGACCGC CGTCCCGGCG GGCGGCCGGT TCACCGTCGA GGTGACGGCC TCCGATCCCG ACAACGATCA GATCCGCTAC AACCTGATGT ACAGCGACAA GCACGTCAAC GGCAACAGGG GGTTCGGCCA CGTCGGGTTC ACCCAGACCG GTCCGGGCCG GTTCTCGGTG ACCGCGCCCG AGCGGCTGGG CGTCTGGAAG GTCTACGTCT ACGCCTTCGA CGGCCACGGC AACGTCGGCA TCGAGACGAG GTCGTTCCGC GTCGTTCCGC CGCCGGTCGG CGGCACGAAC GTGGCGCTCG GCAGACCGAC CACCGCGTCG TCCCACCAGC CGACGGGCGA CGGCGGGCCG TTCCTGCCCG CCAAGGCCAC CGACGGCAGC TTCACCACCC GCTGGGCGAG CGAGTGGAGC GACGCCCAGT GGCTCCAGGT CGACCTCGGC CAGGTGACCC CGGTCAACCG CGTCCGGCTC GGCTGGGAGG GCGCCTACGG CAAGGCCTAC CAGATCCAGA CCTCGGCCAA CGGCTCGGAC TGGAGCACCG TCCACTCCAC CACGACCGGT GACGGCGGAT TCGACACCAT CGACGTCAGC GCGTCCGCCA GGTACGTGCG GCTGAACCTC ACCCAGCGCG CGACCGCCTG GGGCTACTCC CTGTGGGAGT TCGGCGTCTA CCGGTCCTGA
|
Protein sequence | MHPSRSPGAG RLSLLISTLI ALIASGLAVT APAAAAAAAD TLLSQGRAAT ASSTEGAAYG AAAAFDGDPA TRWASAFSDP QWIQVDLGAT ATVSQVVLNW EAAYGRAFKI QISADGTAWN DAYTTTAGTG GTQTLPVSGT GRYVRLHGTV RATGYGYSLW EFQVYGATGP GPDPGAWTEV WRDDFDGTSG TSPSGANWIL RTGTRYPGGA AKWGTGEVET MSASTANVSL DGTGKLNIRA VRDGAGDWTS GRVETQRTDF APQPGERLRF TARLKQPDVP DATGYWPGFR ATGAAYRGDY NNWPAVGETD IMTDVNGRSQ LSNTLHCGTA PDGACNEYGG RTSGLATCEG CQTGFHEYTQ IIDRTRSDEE IRFYLDGVQT WVVRQSQVGV SAWRAAVHHG FYLRFDLAIG GSLPDAIAGF TTPTAATTSG GVLSVDSVSV SRSAGTVPAP MTDPPTPAGP STVRVTGGQG NWRLTVNGAP YEVKGLTYGP PQAAADGYMR DLRSMGVNTI RTWGVDDTHT PALLDRAAQQ GIKVIVGHWL NQGADYVNDT AYKTATKNEI VARVNALKGH QGVLMWDVGN EVLLTMQDHG LPAPVVEERR VAYARFVNEL AQAIHAADPD HPVTSTDAWT GAWTYYKDHS PALDLLAVNA YGAIGGVRQA WIDGGHNRPY IITEAGPDGE WEVPDDVNGV PSEPTDLQKR AQYTASWNAV KAHPGVALGA TEFHYGLEND FGGVWLNTFT GGWRRLGYHA LRQAYTGLPS GNMPPEITGM TVGSQTAVPA GGRFTVEVTA SDPDNDQIRY NLMYSDKHVN GNRGFGHVGF TQTGPGRFSV TAPERLGVWK VYVYAFDGHG NVGIETRSFR VVPPPVGGTN VALGRPTTAS SHQPTGDGGP FLPAKATDGS FTTRWASEWS DAQWLQVDLG QVTPVNRVRL GWEGAYGKAY QIQTSANGSD WSTVHSTTTG DGGFDTIDVS ASARYVRLNL TQRATAWGYS LWEFGVYRS
|
| |