Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4920 |
Symbol | |
ID | 8668214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5450261 |
End bp | 5451895 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | Spore coat assembly protein-like protein |
Protein accession | YP_003340472 |
Protein GI | 271966276 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAGC TCAAGCATCG TATTCCGGTC AAACTCCGGC ACAACTGGAA GCTGGTCGCG CTGTGCGCGG CGTTCCTGGC CCTGTGCTGG GGTGTGCTCG GCAGCGGGAT GATCCGCCCC TACGTCACCA TCTCCCAGGC CGCCACGGCC GAGACCGTCA TCACGAACCT GACCGGCACC AAGGACCTGT TCGACACCTC TGTGGCGCAC GAGGTGAAGC TCACCTTCAC CGACGCGGCC TATGAGGACA TGCTGAGCGA GTACTTCAAG GACGGCGAGA AGAAGTACCT CGAAGCCGAC CTGACCATCG ACGGCGTCCG CATCCCCAGC GTGGGCGTCC GGCTCAAGGG CAACTCCACC CTGTCCGGGC TGACCTGGAA GGGACAGTCC AAGCAGCGGA GCATGCCCGG CGGCGGACAG CGGCGCGGAG GTATGCCCGA GGGCTTCCAA CCACCCGAAG GTTTCCAACC ACCTGAGGGA TTCCAGCCGC CCGAGGGCGG CGAGCCATCC GAAGGCTCCC GGTCACCGGG CAATGGGCCG CCGCCAGTCG GGCAGCAGGG CAATGGCGAG CAGCCCAGAG GTGGCGGGTT CGGCGCGTCT TTGAAGGGTG AGGAGCCGGA GAACCTGCCG TGGCTGATCA GCTTCGACGA GTTCGTGGAG GGCCGCCGCT ACCAGGGGCA CAGCCAGGTG GCAGTACGGC CGGCCGCGAT GGGCTCGACG ACGATGCTGA ACGAGGCACT GGGCATCGCA CTGGTCGGCG CCTCCGGTGA GCCCACCCAG CGTTCGGCCC ACAGCGCCTT CACTGTCAAC GGCCGCACCT CGACACCGCG TCTGCTCGTG GAATACCTCG ACGAGGGCTA TGCCGAAGGC CTCGGCGAAG GCGTGCTGTA CAAGTCGCTG GCCGGCAGCT CCTTCAGCTA CAAGGGCGAG GACCAGACCG GGTACACCAA CGACTTCAAG CAGATCAACA AGGTCGGCGG CCAGGATCTA CAGCCGGTCA TCGATCTGGT CAAATGGGTG AACCAAGCCT CCGACCCCGA ATTCGCCGCA GGCCTGGGCG AGCGTTTGGA CGTGGAGTCG TTCGCCCGTT ATCTGGTCCT GCAGAACCTC ATGGTCAACT TCGACGACAT GGCCGGGCCC GGACGCAACT ACTACCTGTG GTACGACCTG GACACCAAGA AGTTCAAGGT CATCACCTGG GACCTCAACC TCGCCTTCAG CGGTAACGCC AAGAGCGGTG TGAACGACAC GGTCACGATG GGCTTCGGCC GGGGACGCCC CCAGCAGGAC CAGCAGGACC AGCAGGACCA GCCGCCTCAA GGTTTCACCC CGCCCCAGGA CGGCCCTCAG CAGCCTCCAG AAGGAGGCAT GATGCGCATC GGCCATCCTC TGAAGGAGAG GTTCCTCAAG AACGCCATCT TCAAGAAGGT CTACCAGGAG CAGTATCGTG CTCTGTATGC CAAGCTGCTC GGCAACGGCA CCGCATCCGG CCTGCTCAAC GATCTCGCCA CGTCCTACAA GCTGAACGAG GACGCCGACA CCGCCAAGGC CGACACCGAG GCGCAGAACC TGCGCACGTT CCTGCAGACC CGCACTCAGA CACTGCGGTC CGACAAGGCG ATCAGCGGCG GGTAG
|
Protein sequence | MAQLKHRIPV KLRHNWKLVA LCAAFLALCW GVLGSGMIRP YVTISQAATA ETVITNLTGT KDLFDTSVAH EVKLTFTDAA YEDMLSEYFK DGEKKYLEAD LTIDGVRIPS VGVRLKGNST LSGLTWKGQS KQRSMPGGGQ RRGGMPEGFQ PPEGFQPPEG FQPPEGGEPS EGSRSPGNGP PPVGQQGNGE QPRGGGFGAS LKGEEPENLP WLISFDEFVE GRRYQGHSQV AVRPAAMGST TMLNEALGIA LVGASGEPTQ RSAHSAFTVN GRTSTPRLLV EYLDEGYAEG LGEGVLYKSL AGSSFSYKGE DQTGYTNDFK QINKVGGQDL QPVIDLVKWV NQASDPEFAA GLGERLDVES FARYLVLQNL MVNFDDMAGP GRNYYLWYDL DTKKFKVITW DLNLAFSGNA KSGVNDTVTM GFGRGRPQQD QQDQQDQPPQ GFTPPQDGPQ QPPEGGMMRI GHPLKERFLK NAIFKKVYQE QYRALYAKLL GNGTASGLLN DLATSYKLNE DADTAKADTE AQNLRTFLQT RTQTLRSDKA ISGG
|
| |