Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_2107 |
Symbol | |
ID | 8665389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 2261965 |
End bp | 2263707 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Proline--tRNA ligase |
Protein accession | YP_003337835 |
Protein GI | 271963639 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.205273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.90716 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGCTGC GCATGTCGTC GTTGTTTCTT CGAACACTGA GGGACGACCC GGCAGACGCG GAGGTCCCGA GCCACAAGTT GCTCGTCCGC GCCGGGTATG TCCGTCGTGT CGCACCCGGC ATCTACTCCT GGCTGCCGCT CGGCAAGATG GTTCTGGAGA ACGTCACGCG CATCGTGCGC GAGGAGATGA ACCGCATGGG CGGCCAGGAG GTGCTCTTCC CCGCCCTCCT GCCCCGGGAG TACTACGAGG CCACCGGCCG CTGGACGGAG TACGGCGACA CGCTCTTCCG CCTCCAGGAC CGCAAGGGGG CCGACTACCT CCTCGGCCCC ACCCACGAGG AGATGTTCAC CGACATGGTC AAGGGGGAGT ACTCCTCCTA CAAGGACTAC CCGGTGACGC TCTACCAGAT CCAGACGAAG TACCGCGACG AGGCCCGCCC CCGGGCCGGC ATCCTGCGCG GCCGCGAGTT CGTCATGAAG GACTCCTACT CCTTCGACCT GGACGACGAC GGTCTCAAGC GCTCCTACGA GCAGCACCGC GAGACCTACA TCAGGACCTT CGACCGCCTC GGCATCAGCT ACAAGATCTG CTTCGCCACC TCCGGCGCGA TGGGCGGCTC CGCCTCCGAG GAGTTCCTCG CGCCCGCCGC GACGGGCGAG GACACCTTCG TCGCCTGCCA CCAGTGCGGC TACGCGGCCA ACGCCGAAGC GGTCACCACC CCCGCCCCCG CCGCGATCAC CGGCGAGCGG CCGGCCCTGC GGGTCCTCGA CACCCCTGAC ACCCCGACCA TCGAGTCGCT GGTCGACCAC GTCAACGAGC ACCACGGCCT GGGCATCACC GCCGCCGAGA CGCTCAAGAA CATCGTGGTC AAGGTCACCA CTCCCGGCTC CGGCAAGTCC GAGACGCTGA TCGTCGGCGT GCCGGGCGAC CGCGAGGTCG ACTTCAAGCG GCTGGAGGCC TCCCTGGCCC CCGGCGAGCC CGCCATCTTC GAGGCCGAGG ACTTCGCCAG GCACCCCGGC CTGGTCCGCG GCTACATCGG CCCGCAGGTG CTCAAGGACC TCGGCATCCG CTACCTCGTC GACCCCCGCG TGGTCGACGG CAGCGCCTGG GTGACGGGTG CCAACGAGCC CGGCAAGCAC GCCGCCGGCG TGGTCGCCGG CCGCGACTTC ACCGCCGACG GCACGATCGA GGCCGCCGAG GTCCGCGCGG GCGACGCCTG CCCGGTCTGC GGCTCCGGGC TGTCCATCGA CCGGGGCATC GAGATCGGCC ACATCTTCCA GCTCGGCCGC AAGTACGCCG ACGCCGCGAA GCTGGACGCC CTCGGCCCCG ACGGCAAGCC GATCCGCGTC ACCATGGGCT CCTACGGCGT CGGTGTCTCC CGCGCCGTGG CCGTGCTGGC CGAGCAGAGG CACGACGAGC TCGGCCTGGT CTGGCCCCGC GAGGTCGCCC CGGTGGACGT CCACGTCGTG GGCACCGGCA AGGACGGCCA GATCGAGGCC GCGAACCGGC TGGCCGAAGA CCTTCAGGCC CGCGGCCTGC GCGTGCTGGT GGACGACCGC GCCGGGGTCT CCCCGGGGGT GAAGTTCAAG GACGCCGAGC TGCTCGGCAT GCCCACGATC CTGATCATCG GCCGGGGGCT CGCCCAGGGC GTCGCCGAGC TGCGCGACCG CGTCACCGGC GTCAAGGAGG AGATCCCGAT CGACGAGGCC GCCGACCGGG TCGTGGCCGC CTGCCGCGCC TGA
|
Protein sequence | MLLRMSSLFL RTLRDDPADA EVPSHKLLVR AGYVRRVAPG IYSWLPLGKM VLENVTRIVR EEMNRMGGQE VLFPALLPRE YYEATGRWTE YGDTLFRLQD RKGADYLLGP THEEMFTDMV KGEYSSYKDY PVTLYQIQTK YRDEARPRAG ILRGREFVMK DSYSFDLDDD GLKRSYEQHR ETYIRTFDRL GISYKICFAT SGAMGGSASE EFLAPAATGE DTFVACHQCG YAANAEAVTT PAPAAITGER PALRVLDTPD TPTIESLVDH VNEHHGLGIT AAETLKNIVV KVTTPGSGKS ETLIVGVPGD REVDFKRLEA SLAPGEPAIF EAEDFARHPG LVRGYIGPQV LKDLGIRYLV DPRVVDGSAW VTGANEPGKH AAGVVAGRDF TADGTIEAAE VRAGDACPVC GSGLSIDRGI EIGHIFQLGR KYADAAKLDA LGPDGKPIRV TMGSYGVGVS RAVAVLAEQR HDELGLVWPR EVAPVDVHVV GTGKDGQIEA ANRLAEDLQA RGLRVLVDDR AGVSPGVKFK DAELLGMPTI LIIGRGLAQG VAELRDRVTG VKEEIPIDEA ADRVVAACRA
|
| |