Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_6890 |
Symbol | |
ID | 8670200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | - |
Start bp | 7590220 |
End bp | 7592523 |
Gene Length | 2304 bp |
Protein Length | 767 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | cellobiohydrolase A (1 4-beta-cellobiosidase A)- like protein |
Protein accession | YP_003342336 |
Protein GI | 271968140 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTTC ATCCGCGCCG TCAAGCCTGG CGCAATCGGG CGGTGACCAC GGCGGCCCTG CTCGTGGCCG CGGCCGGTCT CGCCACCGGG CAGGCCGCCA CGGCACACGC CGCAGTCTCG TGCGACGTCG CCTACAGCAC CAACGAATGG CAGGGCGGTT TCACCGCCAG CGTCACGGTC AAGAACCTCG GTGACGCGCT CGCCGGCTGG ACCCTCGGCT TCGCCTTCCC CGGCACCCAG CAGGTCCAGC AGGGCTGGTC GGCGACCTGG AGCCAGACCG GCAACCAGGT CACCGCGAAG AACCTCGACT GGAACGGCAA CCTCGCCACC GGCGCGTCGA CCAGCATCGG CTTCAACGGG TCGTGGAGCG GGAGCAACCC CAAGCCCACC GCCTTCACGG TCAACGGCGT CACCTGTGGC GGCCAGCCGC CCGTAAACCA GGCTCCGACG GTCAGCCTGA CCAGCCCGGC GAGCGGCGCC TCCATCCCCG CGGGCTCCGC CGTACCGCTC GCGGCGACGG CCGGCGACGA CGGCGCGGTC AGCAAGGTCG AGTTCTACGT CGACGGCGCG CTCGTCAACA CCGACACCTC CGCCCCGTAC GGCTACTCGG CGACGGGCGT GGCGGCGGGC AGCCACACCG CCAGGGCGAA GGCCTACGAC AATGGGACCC CGGTCCTGTC CGCCGAGACC GCCGAGGTCC CCTTCACCGT CGGCTCCGAC GGGGGCACCG CGGCGGTCGT CGCCTCCGCG ACCTCGGTCA GCGTCCCCGA AGGCGGCTCC AAGACGGTCG GCTTCAGGCT CAGCAAGGCG CCCAGCGGCA ACGTCACGGT CAACCTGACC AAGACCGGTG ACGCCGACCT CACCATCGCG CCCTCCACGC TCACCTTCAC CCCCGCCAAC TGGAACACGG CGCAGAACGT GACCGTCTCG GCCGCCCAGG ACGCCGACCA GACCGACGGC ACCGCCACCA TCGCGGCCGC CGCCACCGGG CACACCGGTG TCTCCGTCAC CGCGACCGAG TCCGACGACG ACGTCGTCAC GCAGCCCGGC GAGCACGTCG AGAACCCGTA CGCCGGGGCC ACCGGCTACG TGAACCCCGA CTGGGCGGCG AAGGCCGCGG CGGAGCCGGG CGGCGACGCG GTGGCCGACA TCTCGACCGG CGTGTGGCTC GACCGCATCG CCGCCATCGA GGGCACCGCC TCCGCCAGGG GCCTCCGCGC CCACCTGGAG GAGGCCCTCA GGCAGGACGC GGCCAACGGC AGCAAGCCCC TGACGATCCA GTTCGTCATC TACAACCTGC CCAACCGCGA CTGCTCGGCG CTGGCCTCCA ACGGTGAGCT GCTCATCGCC CAGAACGGGC TTAACCGCTA CAAGACCGAG TACATCGACC CGATCGCGGC GATCATGGCC GAGCCGAGGT ACGCCACGCT CCGGATCTCG ACGGTCATCG AGATCGACTC GCTGCCCAAC CTGATCACCA ACCTCAACGT GCCCAAGTGC CAGGAGGCCA AGTCGAGCGG CGCCTACGTC GACGGCGTCC GCTACGCCCT GAACAAGCTG CACGCGATCA AGAACGTCTA CACCTACATC GACGCCGCGC ACCACGGCTG GCTGGGCTGG GACACCAACT TCGGCCCCTC GGCCGACCTG TTCGCGAGCA CCGTCGCCGG TACCACGGCT GGGTTCGACA GCGTCGACGG CTTCATCACC AACACCGCCA ACTACTCGGC TCTGAAGGAG CCGCACTTCA CCATCAACAC CACCGTCAAC GGCACCACGG TGCGCCAGTC GCGGTGGCTC GACTGGAACT TCTACGTCGA CGAGCTGTCC TACGCCCAGG CCTTCCGGAC CCTGCTCATC CAGAAGGGCT TCAAGCCGGG CCTCGGCATG CTGATCGACA CCTCCCGCAA CGGGTGGGGC GGTACGGCCC GCCCGGCCGC TCCCAGCACC TCCACCGACG TGAACACCTT CGTCAACCAG TCGCGGGTGG ACCGCCGCAT CCACGCCGGC AACTGGTGCA ACCAGAGCGG CGCCGGTCTC GGCGAGCGCC CGCAGGCCAG CCCCGCCGCC GGCCTCGACG CCTACGTCTG GATCAAGCCC CCGGGCGAGT CCGACGGCGC CAGCAAGCTC ATCCCCAACG ACGAGGGCAA GGGCTTCGAC CGGATGTGCG ACCCGACCTA CACCGGCAAC GAGCGCAACG GCAACAACAT GACCGGCTCC CTGGCCGACG CCCCGCTCTC CGGCCAGTGG TTCTCGGCGC AGTTCCGTGA GCTGCTCAAG AACGCCTACC CCGCCCTGCC GTAA
|
Protein sequence | MRVHPRRQAW RNRAVTTAAL LVAAAGLATG QAATAHAAVS CDVAYSTNEW QGGFTASVTV KNLGDALAGW TLGFAFPGTQ QVQQGWSATW SQTGNQVTAK NLDWNGNLAT GASTSIGFNG SWSGSNPKPT AFTVNGVTCG GQPPVNQAPT VSLTSPASGA SIPAGSAVPL AATAGDDGAV SKVEFYVDGA LVNTDTSAPY GYSATGVAAG SHTARAKAYD NGTPVLSAET AEVPFTVGSD GGTAAVVASA TSVSVPEGGS KTVGFRLSKA PSGNVTVNLT KTGDADLTIA PSTLTFTPAN WNTAQNVTVS AAQDADQTDG TATIAAAATG HTGVSVTATE SDDDVVTQPG EHVENPYAGA TGYVNPDWAA KAAAEPGGDA VADISTGVWL DRIAAIEGTA SARGLRAHLE EALRQDAANG SKPLTIQFVI YNLPNRDCSA LASNGELLIA QNGLNRYKTE YIDPIAAIMA EPRYATLRIS TVIEIDSLPN LITNLNVPKC QEAKSSGAYV DGVRYALNKL HAIKNVYTYI DAAHHGWLGW DTNFGPSADL FASTVAGTTA GFDSVDGFIT NTANYSALKE PHFTINTTVN GTTVRQSRWL DWNFYVDELS YAQAFRTLLI QKGFKPGLGM LIDTSRNGWG GTARPAAPST STDVNTFVNQ SRVDRRIHAG NWCNQSGAGL GERPQASPAA GLDAYVWIKP PGESDGASKL IPNDEGKGFD RMCDPTYTGN ERNGNNMTGS LADAPLSGQW FSAQFRELLK NAYPALP
|
| |