Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_2892 |
Symbol | |
ID | 8666178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 3147454 |
End bp | 3150975 |
Gene Length | 3522 bp |
Protein Length | 1173 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | DNA polymerase III alpha subunit |
Protein accession | YP_003338591 |
Protein GI | 271964395 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0503022 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.978852 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACT CGTTTGTGCA TCTTCACGTT CACACGGAGT ACTCCATGCT CGATGGAGCC GCTCGTCTGA AGCAGATGTT CAAACAGGTC GGTGACCTGG GCATGCCCGC CATCGCGATC ACCGACCACG GCAACATGCA CGGCGCCTAC GACTTCTACA AGCAGGCCAC CGGCGCCGGG ATCAAGCCGG TCATCGGCAT CGAGGCCTAC GTGGCGCCGG CCTCCCGCCA CCAGAAGAAG CCGGTGCTGT GGGGCGAGCC CCACCAGAAG CGCGACGACG TGTCGGCCGG CGGCTACTAC ACCCACATGA CGATCTGGGC GAAGAACGCC AAGGGTCTGA CCAACCTGAT GAAGCTCTCC TCGCGTGCCT ACACCGAGGG CTTCGTGCGC AAGTGGGCCC GGATGGACGC CGAGACCCTC GCCGAGCACT CCGAGGGGCT GATGGCCACC ACCGGCTGCC CGTCGGGTGA GGTCCAGACC CGCCTGCGCC TGGGGCAGTA CGACGAGGCG CTCGCGGCGG CGGCGAAGTT CCAGGAGCTG TTCGGCAGGG ACAACTACTA CCTGGAGATC ATGGACCACG GGCTCGACAT CGAGCGCCGG GTCCGCGACG GCCTGACCCG CATCTCCCGG GACCTGAACA TCCCGCCCCT GGTCACCAAC GACTCCCACT ACACCTACGA GTCGGACGCG ACCTCCCACG ACGCGCTGCT GTGCATCCAG ACCGGCAAGC AGCTCGCCGA CCCCGACCGG TTCCGCTTCG ACGGCAGCGG CTACTACATC AAGACCGCCG ACGAGATGCG CGCGGTCGAC TCCTCCGACC TGTGGGCCGA GGGCTGCCGC AACACGCTCC TGGTCGCCGA GAAGGTCGAC CCGACCGGCA TGTTCGGCTT CAAGAACCTG ATGCCGACCT TCCCCATCCC CGAGGGGGAC AGCGAGGAGA GCTGGTTCCG CAAGGAGATC TGGAAGGGCA TGGAGCGGCG CTTCCCCGAG GGCGTCGACG AGGAGCACCG CGCCCAGATC GAGTTCGAGA TGAACGTCAT CCTGCAGATG GGGTTCCCCT CCTACTTCCT CGTGGTCGCC GACTTCATCA TGTGGGCGAA GAACAACGGC ATCCGGGTCG GGCCCGGCCG TGGCTCGGCG GCGGGCTCGC TGGCGGCCTA CGCGCTGGGC ATCACCGACC TCGACCCGCT GCCGCACGGC CTGATCTTCG AGCGGTTCCT CAACCCCGAC CGCGTCTCCA TGCCCGACGT CGACATCGAC TTCGACGAGC GCCGGCGCGG CGACGTGATC CGCTACGTGA CCGAGAAGTA CGGCGCCGAC AAGGTCGCCA TGATCGCCAC CTTCGGCACC ATCAAGGCGA AGGCGGCCAT CAAGGACGCC GCCCGCGTCC TCGGCCATCC GTACGCGCTG GGCGACAAGG TCTCCAAGGC GTTCCCGCCC GCGGTGATGG GCAAGGACAT CCCGCTGTCG GGCATCTTCG ACAAGGACCA CCCGCGCTAC AACGAGGCCG GTGAGCTGCG CAAGCTGTAC GACGAGGACG TCGACGTCAA GTCGGCGATG GACCTCGGCC GGGGCCTGGA GGGCCTGATC CGGCAGACCG GCGTGCACGC CGCCGGCGTG ATCATGTCCT CGGAGGTGCT GACCGACTAC ATCCCGATCA TGCGCCGTGA CTCCGACGGT GTGATCATCA CGCAGTTCGA CTACCCGACC TGCGAGACGC TCGGCCTGCT CAAGATGGAC TTCCTGGGCC TGCGCAACCT CACGATCATC GACGACTGCC TGAAGATGAT CGAGGCCAAC ACCGGCACCA AGATCGACCT GCTGAAGCTG CCGCTGGACG ACCGCAAGAC CTACGAGCTG CTGGGCCGCG GCGACACCCT GGGCGTGTTC CAGCTGGACG GCGGCGGCAT GCGGTCGCTG CTGCGGCTGA TGAAGCCCGA CAACTTCGAG GACATCTCCG CCGTCGGCGC GCTGTACCGG CCGGGGCCCA TGGGCGCCGA CTCCCACACC AACTACGCGC TGCGCAAGAA CGGCCTGCAG GACATCACCC CGATCCACCC CGAGTTCGAG GAGTCGCTGC AGGAGATCCT CGGCACGACC CACGGCCTGA TCGTCTACCA GGAGCAGGTC ATGGCCATCG CGCAGAAGGT CGCCGGGTTC TCCCTCGGCA AGGCCGACCT GCTGCGCCGC GCGATGGGCA AGAAGAAGAA GTCCGAGCTG GACAAGCAGT TCGAGTCCTT TGAGCAGGGC ATGAAGGACA ACGGCTACTC GGCCGCCGCG ATCAAGACCC TCTGGGACAT CCTGCTCCCC TTCTCCGACT ACGCCTTCAA CAAGGCGCAC AGCGCCGCCT ACGGCCTGGT CTCCTACTGG ACCGCCTACC TCAAGGCCAA CTACCCCTCC GAATACATGG CCGGCCTGCT GACCTCCGTC AAGGACGACA AGGACAAGTC GGCCCTCTAC CTGAACGAGT GCCGGCGCAT GGGCATCAAG GTGCTGCCGC CGGACGTCAA CGACTCCGAC TTCGACTTCA CCCCGCGCGG GACCGACGTC CGGTTCGGGC TGTCGGCCAT CCGCAACGTC GGCGGCAACG TGGTCGACGG GATCATCGCC GCGCGCAGGG AGAAGACCCG CTTCGCCGAC TTCAAGGACT TCCTGCGCAA GGTTCCCATG GTCGTCTGCA ACAAGCGGGT CATCGAGTCG CTGATCAAGG CGGGCGCCTT CGACTCGTTC GCGCACGAGC GCAAGGGCCT GGTGATGGTC CACGAGCAGG CCGTCGACAG CATCATCGGG ATCAAGAAGA ACGAGGCGCA GGGGCAGGAC TCCCTGTTCG GGGCGGTCGA GGGCGCCGAG GACCAGACCT TCGACGTGCA GATCCCGCCC GGGGAGTGGG ACAAGACCAC CCTGCTCCAG TTCGAGCGGG AGATGCTCGG CCTCTACGTC TCCGACCACC CGCTGTTCGG CGTGGAGCAC ATCCTCGCCT CCGGCGCCGA CTGCTCGATC GCCGCGCTCC AGGACGAGAA CCGCTCCGAC GGCCAGGTCG TCACGGTGGG CGGCATCCTG AGCGGCGTCC AGCGCAAGGT CACCAAGAAG GGCGACACCT GGGTCCTCAC CATGCTGGAG GACCTGGAGG GCGCCATCGA GGTGATGATC TTCCCCTCGG CGTACCAGCT GTGCGCGACG GTGCTCGCCG AGGACGCCAT CGTCTTCGTC AAGGGCCGCC TGGACAAGCG CGAGGACGTC GGAAAGATCA TCGCGATGGA GGTGACCGCT CCCGACCTGA CCCGCGAGAG CGGCGGCCCC CTGGCGGTCA GCCTCCCCCT GACCCGCTGC ACCCCTCCGG TGGTCGGCCG CCTCAAGGAG GTCCTGACCG CCCATCCCGG CACCACCGAG GTCCACCTCC AGGTCCACAA CGGCCCGAAG ACCACCATCG TGCGCCTGGA CGACCGCCTG CGCGTGGCGC CCTCCCCGGC CCTGATGGGA GATCTGAAGC AGCTCCTGGG TCCGGCCTGT CTCGGAGCTT GA
|
Protein sequence | MSDSFVHLHV HTEYSMLDGA ARLKQMFKQV GDLGMPAIAI TDHGNMHGAY DFYKQATGAG IKPVIGIEAY VAPASRHQKK PVLWGEPHQK RDDVSAGGYY THMTIWAKNA KGLTNLMKLS SRAYTEGFVR KWARMDAETL AEHSEGLMAT TGCPSGEVQT RLRLGQYDEA LAAAAKFQEL FGRDNYYLEI MDHGLDIERR VRDGLTRISR DLNIPPLVTN DSHYTYESDA TSHDALLCIQ TGKQLADPDR FRFDGSGYYI KTADEMRAVD SSDLWAEGCR NTLLVAEKVD PTGMFGFKNL MPTFPIPEGD SEESWFRKEI WKGMERRFPE GVDEEHRAQI EFEMNVILQM GFPSYFLVVA DFIMWAKNNG IRVGPGRGSA AGSLAAYALG ITDLDPLPHG LIFERFLNPD RVSMPDVDID FDERRRGDVI RYVTEKYGAD KVAMIATFGT IKAKAAIKDA ARVLGHPYAL GDKVSKAFPP AVMGKDIPLS GIFDKDHPRY NEAGELRKLY DEDVDVKSAM DLGRGLEGLI RQTGVHAAGV IMSSEVLTDY IPIMRRDSDG VIITQFDYPT CETLGLLKMD FLGLRNLTII DDCLKMIEAN TGTKIDLLKL PLDDRKTYEL LGRGDTLGVF QLDGGGMRSL LRLMKPDNFE DISAVGALYR PGPMGADSHT NYALRKNGLQ DITPIHPEFE ESLQEILGTT HGLIVYQEQV MAIAQKVAGF SLGKADLLRR AMGKKKKSEL DKQFESFEQG MKDNGYSAAA IKTLWDILLP FSDYAFNKAH SAAYGLVSYW TAYLKANYPS EYMAGLLTSV KDDKDKSALY LNECRRMGIK VLPPDVNDSD FDFTPRGTDV RFGLSAIRNV GGNVVDGIIA ARREKTRFAD FKDFLRKVPM VVCNKRVIES LIKAGAFDSF AHERKGLVMV HEQAVDSIIG IKKNEAQGQD SLFGAVEGAE DQTFDVQIPP GEWDKTTLLQ FEREMLGLYV SDHPLFGVEH ILASGADCSI AALQDENRSD GQVVTVGGIL SGVQRKVTKK GDTWVLTMLE DLEGAIEVMI FPSAYQLCAT VLAEDAIVFV KGRLDKREDV GKIIAMEVTA PDLTRESGGP LAVSLPLTRC TPPVVGRLKE VLTAHPGTTE VHLQVHNGPK TTIVRLDDRL RVAPSPALMG DLKQLLGPAC LGA
|
| |