Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4476 |
Symbol | |
ID | 8667770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4992083 |
End bp | 4994278 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Alpha-L-fucosidase |
Protein accession | YP_003340086 |
Protein GI | 271965890 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.766125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.418244 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCGAA AACTGCTCTT AGTTCCCGCT GTCGGCCTCC TTGTCATCGG ATCTATCTTC GCCACGCAGC CTGTCGCGAG TGCCGACACG CCCCCTGTGT ACGAGCCCAC GGTGGAGTCG CTGAACAGCC ATCCGGTGCC TCAGTGGTTC AACGACGACA AGTTCGGCAT CTTCATCCAC TGGGGCGCCT ACTCGGTGCC CGCGTGGGGA CCGCGCGGCA GCTACGCCGA GTGGTACGGG AACTACATGA ACGCGGGCGG GAGTGCGACG AACGCCCACC ACAAGGCCAC CTACGGCCAG GACTTCAACT ACGATGCCTT CCTGCAGCAG TGGAAGGCGG AGAAGTTCGA CCCGGCCGAC TGGGTCAAAC TGTTCAAGGA CGCCGGCGCC AAGTATTTCG TGCTGACCTC CAAGCACCAC GAGGGCATGG CCCTGTGGGA CAGCAAGAGC AGCGGGCGTG ACAGCGTCGA CCTCGGACCC GGCCGTGACC TGGCCAAGGA GCTGTTCGAC GCGGCCCGCA AGGACGAGGC CAAACTGAAG GCCGGGTTCT ACTACTCGCT CTATGAGTGG TACAACCCCG CCTACACCGG ACGGCCGGCG ACCAATCCCT ACACGGGAGC GGAAATTCCG TACACCGGAG CACCGGGCGG CGGTGACTAC GTCAAGGACT ACATGCTCCC GCAGATGCGC GAGCTCATCG ACGGCTACGA CCCCGACATC ATCTGGTGCG ACGGCCAGTG GGAGAAGCCG GCCTCCTACT GGAACACCGC CGGGGTGATC GCCGACTACT ACAACAAGGC GCTCAACAGC GGCAAGGAGG TCGCGGTCGC CAACCGGTGC AAGATCCAGA GCGGCAACCT CGACAGCCCC GAACTGGACT TCCAGACCCC TGAGTACACC GTCAAGCCGG ACATCGACCC GGTCAAATGG GAGTCCAGCC GCGGCATCGC GCACTCCTAC GGCTACAACC AGAACGAGCC GGAGGAGGAC CACCTGACCT CCGACCAGCT CGTGGACTCG CTCGTGGACA TCGTCAGCAA GAACGGCAAC CTGCTGCTCG ACATCGGCCC GAAGGCCGAC GGCACGATCC CCGAGATCCA GCGGCAGCGA CTGCTGGACA TCGGTGCCTG GCTCAAGGCC AACGGCGAGG CGATCTACGG CACCACGTAC TGGAATCGGG CCGAGGAGAA GGGCGGCCCC GACGACGTCC GCTACACGGC GAAGGACGCC ACCCTGTACG CCACGGCGCT GAAGTGGCCG GGCGAGCAGC TTACGCTCGG CGCCGACCTG CCGGTGGACC ACAGCACCCG CATCACCATG CTCGGCTCCG GCGCGCGCCT GTCGTGGAGC CGTGACGAGC AGGGCCGCGT CGTGGTGAAC ACCCCGCAGG AGGCGGGGAA GCACGCCTAC GTCTTCAAGA TCGAGACGCC GGGCGTGCGC AGCCTGCTGC GCACCTCCTC CAGCCTGGCA AAGGAGATCG CCCCCGGTCG GACGATCAGC GGCGAGCTCA CGGTCACCAA CCCCGGCAAG AGGCACACCC CGGCCACCAA GCTCTCCCTC ACCGTGCCCC AGGGCTGGAC GGCCACGCTC GGGGCGCCTC GCGTGCGCCC GCTCGGTCCC GGTGAAAGCG TGAAGGTGCC GATCAGCGTG AGCGCCCCCG AGGGTGTGGC GCCGGCCCCC TACACGCTGG GCCTGTACCA GCGCACCGGG CGCATGGGGA CGACCACCGC CCTGCCCCTG ACGGTCAACC GCCCCAACCT CTCGCTCGGC AAGCCGGCCA CGCAGAAGAG CACCGGCTAC GACGCGCCGG CCTCCCGGGC CGTCGACGGC AACACCGGCG GCGACTGGAG CGCCGGATCG ACGACGCACA CCGCCGAGCC GGAGAAGCAG GCCTGGTGGC AGGTCGATCT CGGCGCCTCG GCCAGGCTGG ACAGCGTGGA CGTCTGGAAC CGGCTCGACT GCTGCGCCGA TCGGCTGAAG GACTTCTGGG TGATGGCCTC CGACCAGCCG TTCACCACCG ACGATCTGGA CCAGGCCCGC ACCGCGCCCG GTGTGACCGC CGTGCACGTC GGCGAGCAGG CCGGCTCGCC GAGCAAGGTC AAGCTGCCCG AGGGCACCAG GGGCAGGTAC GTCAGGATCC AGCTCGCCTC GCCGTCCAAC CCGCTGTCCC TGGCCGAGGT CCAGGTCAGA GGCTAG
|
Protein sequence | MRRKLLLVPA VGLLVIGSIF ATQPVASADT PPVYEPTVES LNSHPVPQWF NDDKFGIFIH WGAYSVPAWG PRGSYAEWYG NYMNAGGSAT NAHHKATYGQ DFNYDAFLQQ WKAEKFDPAD WVKLFKDAGA KYFVLTSKHH EGMALWDSKS SGRDSVDLGP GRDLAKELFD AARKDEAKLK AGFYYSLYEW YNPAYTGRPA TNPYTGAEIP YTGAPGGGDY VKDYMLPQMR ELIDGYDPDI IWCDGQWEKP ASYWNTAGVI ADYYNKALNS GKEVAVANRC KIQSGNLDSP ELDFQTPEYT VKPDIDPVKW ESSRGIAHSY GYNQNEPEED HLTSDQLVDS LVDIVSKNGN LLLDIGPKAD GTIPEIQRQR LLDIGAWLKA NGEAIYGTTY WNRAEEKGGP DDVRYTAKDA TLYATALKWP GEQLTLGADL PVDHSTRITM LGSGARLSWS RDEQGRVVVN TPQEAGKHAY VFKIETPGVR SLLRTSSSLA KEIAPGRTIS GELTVTNPGK RHTPATKLSL TVPQGWTATL GAPRVRPLGP GESVKVPISV SAPEGVAPAP YTLGLYQRTG RMGTTTALPL TVNRPNLSLG KPATQKSTGY DAPASRAVDG NTGGDWSAGS TTHTAEPEKQ AWWQVDLGAS ARLDSVDVWN RLDCCADRLK DFWVMASDQP FTTDDLDQAR TAPGVTAVHV GEQAGSPSKV KLPEGTRGRY VRIQLASPSN PLSLAEVQVR G
|
| |