Gene Sros_2892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2892 
Symbol 
ID8666178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3147454 
End bp3150975 
Gene Length3522 bp 
Protein Length1173 aa 
Translation table11 
GC content67% 
IMG OID 
ProductDNA polymerase III alpha subunit 
Protein accessionYP_003338591 
Protein GI271964395 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0503022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.978852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACT CGTTTGTGCA TCTTCACGTT CACACGGAGT ACTCCATGCT CGATGGAGCC 
GCTCGTCTGA AGCAGATGTT CAAACAGGTC GGTGACCTGG GCATGCCCGC CATCGCGATC
ACCGACCACG GCAACATGCA CGGCGCCTAC GACTTCTACA AGCAGGCCAC CGGCGCCGGG
ATCAAGCCGG TCATCGGCAT CGAGGCCTAC GTGGCGCCGG CCTCCCGCCA CCAGAAGAAG
CCGGTGCTGT GGGGCGAGCC CCACCAGAAG CGCGACGACG TGTCGGCCGG CGGCTACTAC
ACCCACATGA CGATCTGGGC GAAGAACGCC AAGGGTCTGA CCAACCTGAT GAAGCTCTCC
TCGCGTGCCT ACACCGAGGG CTTCGTGCGC AAGTGGGCCC GGATGGACGC CGAGACCCTC
GCCGAGCACT CCGAGGGGCT GATGGCCACC ACCGGCTGCC CGTCGGGTGA GGTCCAGACC
CGCCTGCGCC TGGGGCAGTA CGACGAGGCG CTCGCGGCGG CGGCGAAGTT CCAGGAGCTG
TTCGGCAGGG ACAACTACTA CCTGGAGATC ATGGACCACG GGCTCGACAT CGAGCGCCGG
GTCCGCGACG GCCTGACCCG CATCTCCCGG GACCTGAACA TCCCGCCCCT GGTCACCAAC
GACTCCCACT ACACCTACGA GTCGGACGCG ACCTCCCACG ACGCGCTGCT GTGCATCCAG
ACCGGCAAGC AGCTCGCCGA CCCCGACCGG TTCCGCTTCG ACGGCAGCGG CTACTACATC
AAGACCGCCG ACGAGATGCG CGCGGTCGAC TCCTCCGACC TGTGGGCCGA GGGCTGCCGC
AACACGCTCC TGGTCGCCGA GAAGGTCGAC CCGACCGGCA TGTTCGGCTT CAAGAACCTG
ATGCCGACCT TCCCCATCCC CGAGGGGGAC AGCGAGGAGA GCTGGTTCCG CAAGGAGATC
TGGAAGGGCA TGGAGCGGCG CTTCCCCGAG GGCGTCGACG AGGAGCACCG CGCCCAGATC
GAGTTCGAGA TGAACGTCAT CCTGCAGATG GGGTTCCCCT CCTACTTCCT CGTGGTCGCC
GACTTCATCA TGTGGGCGAA GAACAACGGC ATCCGGGTCG GGCCCGGCCG TGGCTCGGCG
GCGGGCTCGC TGGCGGCCTA CGCGCTGGGC ATCACCGACC TCGACCCGCT GCCGCACGGC
CTGATCTTCG AGCGGTTCCT CAACCCCGAC CGCGTCTCCA TGCCCGACGT CGACATCGAC
TTCGACGAGC GCCGGCGCGG CGACGTGATC CGCTACGTGA CCGAGAAGTA CGGCGCCGAC
AAGGTCGCCA TGATCGCCAC CTTCGGCACC ATCAAGGCGA AGGCGGCCAT CAAGGACGCC
GCCCGCGTCC TCGGCCATCC GTACGCGCTG GGCGACAAGG TCTCCAAGGC GTTCCCGCCC
GCGGTGATGG GCAAGGACAT CCCGCTGTCG GGCATCTTCG ACAAGGACCA CCCGCGCTAC
AACGAGGCCG GTGAGCTGCG CAAGCTGTAC GACGAGGACG TCGACGTCAA GTCGGCGATG
GACCTCGGCC GGGGCCTGGA GGGCCTGATC CGGCAGACCG GCGTGCACGC CGCCGGCGTG
ATCATGTCCT CGGAGGTGCT GACCGACTAC ATCCCGATCA TGCGCCGTGA CTCCGACGGT
GTGATCATCA CGCAGTTCGA CTACCCGACC TGCGAGACGC TCGGCCTGCT CAAGATGGAC
TTCCTGGGCC TGCGCAACCT CACGATCATC GACGACTGCC TGAAGATGAT CGAGGCCAAC
ACCGGCACCA AGATCGACCT GCTGAAGCTG CCGCTGGACG ACCGCAAGAC CTACGAGCTG
CTGGGCCGCG GCGACACCCT GGGCGTGTTC CAGCTGGACG GCGGCGGCAT GCGGTCGCTG
CTGCGGCTGA TGAAGCCCGA CAACTTCGAG GACATCTCCG CCGTCGGCGC GCTGTACCGG
CCGGGGCCCA TGGGCGCCGA CTCCCACACC AACTACGCGC TGCGCAAGAA CGGCCTGCAG
GACATCACCC CGATCCACCC CGAGTTCGAG GAGTCGCTGC AGGAGATCCT CGGCACGACC
CACGGCCTGA TCGTCTACCA GGAGCAGGTC ATGGCCATCG CGCAGAAGGT CGCCGGGTTC
TCCCTCGGCA AGGCCGACCT GCTGCGCCGC GCGATGGGCA AGAAGAAGAA GTCCGAGCTG
GACAAGCAGT TCGAGTCCTT TGAGCAGGGC ATGAAGGACA ACGGCTACTC GGCCGCCGCG
ATCAAGACCC TCTGGGACAT CCTGCTCCCC TTCTCCGACT ACGCCTTCAA CAAGGCGCAC
AGCGCCGCCT ACGGCCTGGT CTCCTACTGG ACCGCCTACC TCAAGGCCAA CTACCCCTCC
GAATACATGG CCGGCCTGCT GACCTCCGTC AAGGACGACA AGGACAAGTC GGCCCTCTAC
CTGAACGAGT GCCGGCGCAT GGGCATCAAG GTGCTGCCGC CGGACGTCAA CGACTCCGAC
TTCGACTTCA CCCCGCGCGG GACCGACGTC CGGTTCGGGC TGTCGGCCAT CCGCAACGTC
GGCGGCAACG TGGTCGACGG GATCATCGCC GCGCGCAGGG AGAAGACCCG CTTCGCCGAC
TTCAAGGACT TCCTGCGCAA GGTTCCCATG GTCGTCTGCA ACAAGCGGGT CATCGAGTCG
CTGATCAAGG CGGGCGCCTT CGACTCGTTC GCGCACGAGC GCAAGGGCCT GGTGATGGTC
CACGAGCAGG CCGTCGACAG CATCATCGGG ATCAAGAAGA ACGAGGCGCA GGGGCAGGAC
TCCCTGTTCG GGGCGGTCGA GGGCGCCGAG GACCAGACCT TCGACGTGCA GATCCCGCCC
GGGGAGTGGG ACAAGACCAC CCTGCTCCAG TTCGAGCGGG AGATGCTCGG CCTCTACGTC
TCCGACCACC CGCTGTTCGG CGTGGAGCAC ATCCTCGCCT CCGGCGCCGA CTGCTCGATC
GCCGCGCTCC AGGACGAGAA CCGCTCCGAC GGCCAGGTCG TCACGGTGGG CGGCATCCTG
AGCGGCGTCC AGCGCAAGGT CACCAAGAAG GGCGACACCT GGGTCCTCAC CATGCTGGAG
GACCTGGAGG GCGCCATCGA GGTGATGATC TTCCCCTCGG CGTACCAGCT GTGCGCGACG
GTGCTCGCCG AGGACGCCAT CGTCTTCGTC AAGGGCCGCC TGGACAAGCG CGAGGACGTC
GGAAAGATCA TCGCGATGGA GGTGACCGCT CCCGACCTGA CCCGCGAGAG CGGCGGCCCC
CTGGCGGTCA GCCTCCCCCT GACCCGCTGC ACCCCTCCGG TGGTCGGCCG CCTCAAGGAG
GTCCTGACCG CCCATCCCGG CACCACCGAG GTCCACCTCC AGGTCCACAA CGGCCCGAAG
ACCACCATCG TGCGCCTGGA CGACCGCCTG CGCGTGGCGC CCTCCCCGGC CCTGATGGGA
GATCTGAAGC AGCTCCTGGG TCCGGCCTGT CTCGGAGCTT GA
 
Protein sequence
MSDSFVHLHV HTEYSMLDGA ARLKQMFKQV GDLGMPAIAI TDHGNMHGAY DFYKQATGAG 
IKPVIGIEAY VAPASRHQKK PVLWGEPHQK RDDVSAGGYY THMTIWAKNA KGLTNLMKLS
SRAYTEGFVR KWARMDAETL AEHSEGLMAT TGCPSGEVQT RLRLGQYDEA LAAAAKFQEL
FGRDNYYLEI MDHGLDIERR VRDGLTRISR DLNIPPLVTN DSHYTYESDA TSHDALLCIQ
TGKQLADPDR FRFDGSGYYI KTADEMRAVD SSDLWAEGCR NTLLVAEKVD PTGMFGFKNL
MPTFPIPEGD SEESWFRKEI WKGMERRFPE GVDEEHRAQI EFEMNVILQM GFPSYFLVVA
DFIMWAKNNG IRVGPGRGSA AGSLAAYALG ITDLDPLPHG LIFERFLNPD RVSMPDVDID
FDERRRGDVI RYVTEKYGAD KVAMIATFGT IKAKAAIKDA ARVLGHPYAL GDKVSKAFPP
AVMGKDIPLS GIFDKDHPRY NEAGELRKLY DEDVDVKSAM DLGRGLEGLI RQTGVHAAGV
IMSSEVLTDY IPIMRRDSDG VIITQFDYPT CETLGLLKMD FLGLRNLTII DDCLKMIEAN
TGTKIDLLKL PLDDRKTYEL LGRGDTLGVF QLDGGGMRSL LRLMKPDNFE DISAVGALYR
PGPMGADSHT NYALRKNGLQ DITPIHPEFE ESLQEILGTT HGLIVYQEQV MAIAQKVAGF
SLGKADLLRR AMGKKKKSEL DKQFESFEQG MKDNGYSAAA IKTLWDILLP FSDYAFNKAH
SAAYGLVSYW TAYLKANYPS EYMAGLLTSV KDDKDKSALY LNECRRMGIK VLPPDVNDSD
FDFTPRGTDV RFGLSAIRNV GGNVVDGIIA ARREKTRFAD FKDFLRKVPM VVCNKRVIES
LIKAGAFDSF AHERKGLVMV HEQAVDSIIG IKKNEAQGQD SLFGAVEGAE DQTFDVQIPP
GEWDKTTLLQ FEREMLGLYV SDHPLFGVEH ILASGADCSI AALQDENRSD GQVVTVGGIL
SGVQRKVTKK GDTWVLTMLE DLEGAIEVMI FPSAYQLCAT VLAEDAIVFV KGRLDKREDV
GKIIAMEVTA PDLTRESGGP LAVSLPLTRC TPPVVGRLKE VLTAHPGTTE VHLQVHNGPK
TTIVRLDDRL RVAPSPALMG DLKQLLGPAC LGA