Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_5002 |
Symbol | |
ID | 8668296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5527605 |
End bp | 5528885 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | ABC-type sugar transport system periplasmic component-like protein |
Protein accession | YP_003340544 |
Protein GI | 271966348 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.287585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.158884 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAGGT ACTGGACGGG ACTGGCCATG GCGGCCGTCA TGGTCGTCTC GGGGTGCGGG CTCGACGCGG CCGACCCGCA GGACGAGGCC GCGCCGACCG CGGCCGCCAC CGGCGAGGTG AAGGGCAAGG TCACGCTGCA GACGTGGGCG CTGAAGCCCA AGTTCACCTC GTACATGGAG AGCGTGATCA GCGCCTTCGA GAAGAAGCAC CCGGGCACCG ACGTGGCGTG GCTCGACCAG CCCGCCGACG CCTACTCGGA CAAGGTGCTC AGCCAGGCCG CGGGCGGCAC TCTGCCCGAC GTCACCAACC TGCCACCCGA CTTCGCCCGT CCGCTGGTCA AGCAGGGCAT GCTGCTCGAC GTGTCCACGG TGGACTCCAA GCTGGCCGAC GAGTACGTCG CCGGTGGCCT CGACGCATAC AGGTTCAGCG GCCACCAGGG CACCTACGGC TACCCGTGGT ACCTCAACAC GGACGTCAAC TACTGGAACA GCGAGTTGAT GGCGGAGTAC GGGCTGGACC CGAAGCAGCC GCCGGCGTCC TTCGACGAAC TGGTCGAGCA GGCCCGGACC ATGAAGGAGA AGTCGGGCGG CAAGGTCCTG CTGATGAGCC GCCGCCCTGA GATCGGGGAC CTGAGGGAGG CGGGGGTCAA GCTCCTGTCC GACGACGGCA AGAAGTTCGT CTTCAACACG CCCGAGGCCG CCGCACTGGT CGACAAGTAC CGCGCCGCCT TCAAGGAGGG CCTGATGCCG CGCGACGTGC TGACCGACAC CTACGCCGGC AACGCCAAGC TGTTCAACGA AGGCGCGGTG GCCTGGACCA CCGGCGGCGG CAACCACATC ACGAGCGTGA CCAACGAGAA CCCCTCGCTC GCGCCCAAGA TCGTGGCCTC CCCCGACTTC GGTACCCCGC CGCTGTACGT TCAGGGCCTG TCCATCTCGA AGAAGACCAC GAACCTGCCG ACGGCGATCG CGCTGGCCCG CTGGGTGACC AGCCCGGAGA ACCAGGCCGC GTTCGCCCAC CTGGTCAGCA TCTTCCCGAG CACGAAGTCC TCGGCGAACG ACCCGTTCTT CAGCAAGAGC GACGGCACCA ACGCCGGCGA CGCCAAGGTC ATCGCCTTCA ACTCGCTGGC CAAGGCGGAG GTTCTCCAGC CCTACGAGGT CACCGACGCG ATGAGCAAGT TCATCCGGCA GCAGATCTCG GCCGCCCTCA ACGGTCAGAT CTCCTCGAAA GAGGCCCTGG ACGCCGCGGT CGCCAAGTGC AACGAGCTGC TCAACCAGTG A
|
Protein sequence | MRRYWTGLAM AAVMVVSGCG LDAADPQDEA APTAAATGEV KGKVTLQTWA LKPKFTSYME SVISAFEKKH PGTDVAWLDQ PADAYSDKVL SQAAGGTLPD VTNLPPDFAR PLVKQGMLLD VSTVDSKLAD EYVAGGLDAY RFSGHQGTYG YPWYLNTDVN YWNSELMAEY GLDPKQPPAS FDELVEQART MKEKSGGKVL LMSRRPEIGD LREAGVKLLS DDGKKFVFNT PEAAALVDKY RAAFKEGLMP RDVLTDTYAG NAKLFNEGAV AWTTGGGNHI TSVTNENPSL APKIVASPDF GTPPLYVQGL SISKKTTNLP TAIALARWVT SPENQAAFAH LVSIFPSTKS SANDPFFSKS DGTNAGDAKV IAFNSLAKAE VLQPYEVTDA MSKFIRQQIS AALNGQISSK EALDAAVAKC NELLNQ
|
| |