Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3476 |
Symbol | |
ID | 8666764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 3846415 |
End bp | 3848043 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | peptide arylation protein |
Protein accession | YP_003339155 |
Protein GI | 271964959 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00809458 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00477913 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCCTGC ACCTGGACGA CCTGGTCCCC GCCACGTTGA GACGGGGGTG GGCCGCCGAC GGCACCTGTC CCGACCTCGA CCTGTACGCG CTGTTCCGCA CCCACCGCCT CGCGGCCCCC GGCCGGGTCG CGGTGATCGA CGCGGCGGGC GAGCTCACCT ACGCCGAACT GGACCACCTC GCCCGCGACG CGGCGGCCGG ACTGCGGAGA CTCGGCGTCT GCGAGGGCGA CGTCGTCGGC GTCCAGCTGC CCAACGGCCG GGACGCGGTC GTCGCCGACC TGGCCCTGGC CGCGCTCGGC GCCGTGGCGC TGCCCTTCCC CGTCGGGCGC GGCATCACCG AGGCGGTCTC CCTGCTGGGG CGCTCCGGCG CGCGGGCCGT CATCGCGGCC ACCGAGCACC GGGGGACGGC GCACGCGAGC GAGCTGGTGG CCGCCGCCGA GCGACTGCCC GGCCTGCGGG CCGTCGTGGC CGCCGGTCCG CAGGAGCCTC CGGCGGGCAG TGCGGCATGG AGCGAGGTCC TCTCCGCCGA CGGCCGCGCC TTCGTCCCGG CGCGGCCCGA CCCGGACGGC GCGGCCCGCA TCCTGGTCTC CTCCGGTTCG GAGTCCGAGC CCAAGATGGT CGCCTACTCC CACAACGCGC TGGCGGGCGG ACGGGGCAAC TTCATGGCCA CGCTGATCGC CGGCGCGGAA CCGCCCCGCT GCCTGTTCCT GGTCCCGCTG GCCTCGGCCT TCGGCAGCAA CGGCACCGCC GTCACCCTGG CCAGGCACGG AGGCTCGCTG GTCCTGCTCG ACCACTTCTC GCCCCGGGGC GCGCTCGCCG CGATCGGCGA GCACCGGCCC ACCCACGTGC TGGCCGTACC GACCATGATC CGCATGATGC TCGACCAGCC GAGGCCCGGA CCGATGCCGC CGATGACCGC GCTGGTGCTG GGCGGCGCGG AGCTGGACGC GGCCACGGCC GCCGAGGCGG GCGGGGTGTT CGGCTGCCCG GTCGTCAACC TGTACGGCTC GGCCGACGGG GTGAACTGCC ACAGCGGGTT CCGTCCGCCC CCGGTGGGCG ATCGCGGTCC CGGGGTCGTG GTGGGCCTGC CGGACCCCCG GGTGGCGGAG ATCCGCATCG CCCCCGCCCC GGACGGGAAT GAGTTCGGCG AGATCATCGC ACGCGGCCCG ATGACCCCGA TGTGCTACGT CGGCGCGCCG GAACTGAACC GGCGCTACCG CACCGCGGAC GGCTGGGTCC GCACCGGCGA CCTGGGGGTG ATCGACGCCG ACGGGCGGCT GCGCCTGGTC GGCAGGCTCA AGCGGGTCGT CATCCGCGGC GGCGCCAACA TCAGCCTGGC CGAGGTGGAG CACGCGCTGG CGACCCACCC CGGGGTGCGC GAGGCGGTGT GCCTGGGCGT GCCCGACCGG GTGATGGGAG AGCGGCTGGC GGCCTGCGTG GTGCCGCGCC CCGGCCACGC CCCCGATCTC GCCGTCCTCA CCGCCCACCT GCTCCGGCAG GGGCTGGAGC GGAGCAAGCA CCCCGAGCAC CTGCTGCTGG TGGAGGAGCT GCCGCTGACC CCGGCGGGCA AGCCGGACCG GGACGCGCTC CGCGACCTGC TGCTCGGCGG GCGGCGCGGA TCGGCGTGA
|
Protein sequence | MTLHLDDLVP ATLRRGWAAD GTCPDLDLYA LFRTHRLAAP GRVAVIDAAG ELTYAELDHL ARDAAAGLRR LGVCEGDVVG VQLPNGRDAV VADLALAALG AVALPFPVGR GITEAVSLLG RSGARAVIAA TEHRGTAHAS ELVAAAERLP GLRAVVAAGP QEPPAGSAAW SEVLSADGRA FVPARPDPDG AARILVSSGS ESEPKMVAYS HNALAGGRGN FMATLIAGAE PPRCLFLVPL ASAFGSNGTA VTLARHGGSL VLLDHFSPRG ALAAIGEHRP THVLAVPTMI RMMLDQPRPG PMPPMTALVL GGAELDAATA AEAGGVFGCP VVNLYGSADG VNCHSGFRPP PVGDRGPGVV VGLPDPRVAE IRIAPAPDGN EFGEIIARGP MTPMCYVGAP ELNRRYRTAD GWVRTGDLGV IDADGRLRLV GRLKRVVIRG GANISLAEVE HALATHPGVR EAVCLGVPDR VMGERLAACV VPRPGHAPDL AVLTAHLLRQ GLERSKHPEH LLLVEELPLT PAGKPDRDAL RDLLLGGRRG SA
|
| |