Gene Sros_2966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2966 
Symbol 
ID8666253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3228761 
End bp3231466 
Gene Length2706 bp 
Protein Length901 aa 
Translation table11 
GC content70% 
IMG OID 
ProductDNA-directed DNA polymerase 
Protein accessionYP_003338664 
Protein GI271964468 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.254596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGAAGA GCGAAGCGAC CCCCACTCGT CCGTGTCTCC TGCTGCTGGA CGGGCATTCA 
TTGGCCTACC GGGCCTTCTA TGCCCTGCCC GAGGAGAATT TCTCCACGAC GACGGGTCAG
ACGACAAACG CGGTGTTCGG GTTCACGTCG ATGCTGGTCA ACGTGCTCCG CGACGAGCAG
CCCACGCATG TGGCGGTCTG CTTCGACCGG TCGGAACCGA CGTTCCGGCA CGAGGAGTAC
GCCGACTACA AGGCGAACCG GAGCGCCAGC CCCGACAGTT TCCGCAGCCA GATGAGCCTG
ATATACGAGA TGCTCGACGC GCTGCGCGTC CCGCACCTGT CGCTGGCGGG CTACGAGGCC
GACGACCTGA TCGCCACGCT CGCCACCCAG GCCGCCGGGC AGGACATGAA CGTGCTGGTC
GTCACCGGCG ACCGTGACGC GCTCCAGCTG GTCGACGACC GCGTCACGGT GCTGATGACC
CGGGTCGGGA TCAGCAACAT GACCAGGTTC ACCCCCGAGG CGGTGCTGGA GAAGTACGAG
CTCACCCCGG CCCAGTACCC CGACTTCGCG GCCATCCGCG GCGACTCCAG CGACAACCTG
AAGAACATCC CCGGCGTGGG GGAGAAGACC GCGGCCAAGT GGATCCGCGA GTTCGGCTCC
CTGGAGGAGC TGGTCAACCG GATCGACGAG GTCAAGGGCA AGGTCGGCGA CAAGCTCCGC
GACCACCTCG ACCAGGTGCT GATGAACCGC CGTCTCACCC AGCTCATGCG CGACGTCCCG
CTGGAGCGCG AGGTCTCCGC CCTGACGATC GGCCAGGGGG ACCGCGAGGA GATCAACAAG
ATCCTCGACA CGCTGGAGTT CCGGGGGGAG CTCCGCGACC GGCTGTTCAA GACGCTCGCC
TCCGCCGAGC CCGAGGTCGA GGAGGGCTTC GAGGTCGAGG CCGTCAGGCT CGGCCCCGGC
GAGGTGGCCG GATGGCTGCG GGCGCTGCCC GAGGGCAGGG CCGGGCTGGC CTTCAAGGGC
GCCTACGGCA GCGGCACCGG CCGGATCGAC AGCATCGCCG TCGCCGCGCC CGACGGCGCG
GGCGGCGGGC ATGGCGCCGC GTTCATCGAC CCCACCACGC TGACCGAGGC CGACGAGACG
GCCCTGCGCG CCTGGCTCGG CGACGAGTCC AGGCCCAAGG CCGTGCACGA CGCCAAGGGG
CCGATGCTGG CGCTGTGGGC GCAGGGGATG GAGCTGCGCG GCCTGACCTG CGACACGGCG
CTGGCGGCCT ACCTGGCGAT GCCGGGGCAG CGGACGTTCC TGCTGGAGGA CCTCGTGCGG
GCCTACCTCC AGCGCGAGCT GCGCGGCGAG GCCGAGACCG GCGGCCAGGC CAGCCTCTTC
GACGACGAGG ACGACGACAC GGCCCGGGAG CTGGGCCTGC GGGCACTGGC GGTCAGGGAG
CTCGCCGACG CGCTGGAGGC GTTCCTGGAG CCGCGTGGCG GCACCCACCT GATGCGCGAG
GTGGAGCTGC CGCTGGTGAC CGTCCTCGCC GAGCTGGAGC GGGCCGGGAT CGCCGCCGAC
GGCGACTACT TCAGCGGCCT GGAGGCCGAG TTCGGCGCCG CGGTGAAGCA GGCCGTCGAG
GCGGCGCACG CGGCCGTCGG CGAGCAGTTC AACCTGGGCT CGCCCAAGCA GCTCCAGGAG
ATCCTGTTCG TCAGGCTCAA CCTGCCCAAG ACGAAGAAGA CCAAGACGGG CTACACCACC
GACGCCGACG CGCTGGCCTG GCTGGCGGCC CAGACCGAGC ACGAGCTGCC GACGATCATG
CTCAGGCACC GCGACCAGGC CAAGCTGAAG GTCACGGTCG AGGGCCTGGT CAAGGAGATC
GCCGACGACG GGCGCATCCA CACCACGTTC AACCAGATCG TGGCGGCCAC CGGGCGGCTC
AGCTCGGAGA AGCCCAACCT GCAGAACATC CCGATCCGCA CGGTCGAGGG CCGCCGGATC
CGGCAGGGCT TCGTGGTCGG GCCGGGCTAC GAGACCCTGC TGACCGCCGA CTACAGCCAG
ATCGAGCTGC GGATCATGGC GCACCTGTCC GGCGACGAGT CGCTGATCGC GGCCTTCGAG
TCCGGGCACG ACTTCCACAA GACCACCGCC GCCCGGGTCT TCGACATCGA GCCCGAGCAG
GTGACGGGGG AGATGCGGGC CAAGATCAAG GCCATGAACT ACGGCCTGGC CTACGGCCTG
TCGGACTTCG GGCTGGCCGG GCAGCTCAAC ATCCCGGTGC AGGAGGCGAA GGCGCTCAAG
GAGGAATACT TCGAGGAGTT CGGCGGCGTC CGCGACTTCC TCAACGCGAT CGTCGCGCAG
GCCAGGCAGG ACGGCTACAC CGAGACCATC ATGGGCCGCC GCCGCTACCT GCCCGACCTC
AACAGCGACA ACCGCCAGCG CCGCGAGATG GCCGAGCGGA TGGCGCTCAA CGCGCCGATC
CAGGGTTCGG CGGCCGACAT CATCAAGGTC GCCATGCTCA ACGTGCAGAG CGCCGTCAAG
GAGGCGGCCC TGGGCTCCCG GATGCTCCTC CAGGTCCACG ACGAGCTCGT GTTCGAGGTG
GCCCCCGGAG AGCTGGAGAC CCTGCGCGAG CTGGTCACCG ACCGGATGAA CGCCGCATAT
TCCCTGCGCG TCCCGCTGGA GGTCTCGGTG GGCGTCGGCC GCACCTGGGA GGACGCCGGG
CACTGA
 
Protein sequence
MPKSEATPTR PCLLLLDGHS LAYRAFYALP EENFSTTTGQ TTNAVFGFTS MLVNVLRDEQ 
PTHVAVCFDR SEPTFRHEEY ADYKANRSAS PDSFRSQMSL IYEMLDALRV PHLSLAGYEA
DDLIATLATQ AAGQDMNVLV VTGDRDALQL VDDRVTVLMT RVGISNMTRF TPEAVLEKYE
LTPAQYPDFA AIRGDSSDNL KNIPGVGEKT AAKWIREFGS LEELVNRIDE VKGKVGDKLR
DHLDQVLMNR RLTQLMRDVP LEREVSALTI GQGDREEINK ILDTLEFRGE LRDRLFKTLA
SAEPEVEEGF EVEAVRLGPG EVAGWLRALP EGRAGLAFKG AYGSGTGRID SIAVAAPDGA
GGGHGAAFID PTTLTEADET ALRAWLGDES RPKAVHDAKG PMLALWAQGM ELRGLTCDTA
LAAYLAMPGQ RTFLLEDLVR AYLQRELRGE AETGGQASLF DDEDDDTARE LGLRALAVRE
LADALEAFLE PRGGTHLMRE VELPLVTVLA ELERAGIAAD GDYFSGLEAE FGAAVKQAVE
AAHAAVGEQF NLGSPKQLQE ILFVRLNLPK TKKTKTGYTT DADALAWLAA QTEHELPTIM
LRHRDQAKLK VTVEGLVKEI ADDGRIHTTF NQIVAATGRL SSEKPNLQNI PIRTVEGRRI
RQGFVVGPGY ETLLTADYSQ IELRIMAHLS GDESLIAAFE SGHDFHKTTA ARVFDIEPEQ
VTGEMRAKIK AMNYGLAYGL SDFGLAGQLN IPVQEAKALK EEYFEEFGGV RDFLNAIVAQ
ARQDGYTETI MGRRRYLPDL NSDNRQRREM AERMALNAPI QGSAADIIKV AMLNVQSAVK
EAALGSRMLL QVHDELVFEV APGELETLRE LVTDRMNAAY SLRVPLEVSV GVGRTWEDAG
H