Gene Strop_1475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1475 
Symbol 
ID5057928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1684802 
End bp1686430 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content69% 
IMG OID640473743 
ProductRNA polymerase sigma factor 
Protein accessionYP_001158319 
Protein GI145594022 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.446552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.895772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGAAC CCCGCCAGAC CGGCGCCGAC GTTCGCTCGC TCACCGACAC CTTGATCGCA 
CACGCGCAGA GCGCCGGCGG TCAGCTCACG TCGGCACAGC TCGCGCGCAC TGTCGAGTCA
GCCGAGGTGA CTCCGGCCCA GGCCAAGAAG ATCCTCCGGG CGCTCTCGGA GGCGGGGGTG
ACCGTCGTGG TCGACGGTTC AGCCAGCACC GCCCCACGCC GACGGGTCGC CGCCGCCCGG
TCGACCACCC CGGCATCCCG GGCCACCACC GCCAAGACCA CCAAGAAGGC CGCCACGCCT
GCGTCGAAGG CGGCACCGAC CGGGGCCGAG GCACCGGCCC CAGCGCCACG GAAGGCCACC
GCCCGCAAGG CGGCCGGTGC TGCCGCCGAC GGCGCCAAGG CCACTCCGGC GAAGAAGGCC
ACCCGGGCGA CCAAGGCGAC CGTGGCCGCA GCGACCGGGC CGGCGAAGGC CACGACGAAG
ACTGCGAAGG CCACGAAAGC CACGAAGGCC ACCAAGAGCG GGGCTGCTGG CGAGGTCGAC
GCGGAGGAGT TGGCCGCCGA GATCGAGGAT GTGGTGGTCG ACGAGCCGGC GGAGCTGACC
CGGGCCGCCG AGGCTGACGC GGCGAGCTCC GCCAGCGACA ACGACTTCGA GTGGGACGAC
GAGGAGTCCG AGGCCCTCAA GCAGGCACGC CGAGACGCGG AGCTGACCGC GTCCGCCGAC
TCCGTCCGGG CGTACCTGAA GCAGATCGGC AAGGTCCCGC TACTCAACGC CGAGCAGGAG
GTCGAGCTCG CCAAGCGGAT CGAGGCCGGC CTCTACGCCA CCGAGCAGCT GCGCGCGGCG
GAGGAGGGCG ACGAGAAGTT CAACCGCGAC ATGCAACGCG ACATGATGTG GATCTCGCGG
GACGGAGAGC GGGCAAAGAA CCATCTCCTG GAAGCGAACC TCCGCCTGGT GGTGTCGCTC
GCCAAGCGTT ACACCGGCCG TGGGATGGCC TTCCTCGACC TGATCCAGGA GGGCAACCTC
GGCCTGATCC GCGCGGTCGA GAAGTTCGAC TACACCAAGG GCTACAAGTT CTCCACCTAC
GCCACCTGGT GGATCCGCCA GGCCATCACC CGCGCCATGG CCGACCAGGC CCGCACCATC
CGCATCCCGG TACACATGGT CGAGGTGATC AACAAGCTCG GCCGGATCCA GCGCGAGCTA
CTCCAGGACC TGGGCCGCGA GCCCACCCCG GAGGAGCTCG CAAAAGAGAT GGATATCACA
CCGGAGAAGG TGCTGGAGAT CCAGCAGTAC GCCCGGGAGC CGATCTCGCT CGACCAGACC
ATCGGCGACG AGGGCGACAG CCAGCTCGGT GACTTCATTG AGGACTCCGA GGCCGTGGTC
GCGGTCGACG CGGTGTCGTT CTCGCTCCTC CAGGATCAGC TCCAGCAGGT GCTACAGACG
CTGTCCGAGC GCGAGGCCGG CGTGGTCCGC CTCCGGTTCG GCCTGACCGA TGGTCAGCCG
CGCACGCTGG ACGAGATCGG CCAGGTCTAC GGGGTGACCC GGGAGCGCAT CCGACAGATC
GAGTCCAAGA CGATGTCCAA GCTGCGCCAC CCGTCCCGGT CGCAGGTCCT CCGGGACTAC
CTGGACTAA
 
Protein sequence
MTEPRQTGAD VRSLTDTLIA HAQSAGGQLT SAQLARTVES AEVTPAQAKK ILRALSEAGV 
TVVVDGSAST APRRRVAAAR STTPASRATT AKTTKKAATP ASKAAPTGAE APAPAPRKAT
ARKAAGAAAD GAKATPAKKA TRATKATVAA ATGPAKATTK TAKATKATKA TKSGAAGEVD
AEELAAEIED VVVDEPAELT RAAEADAASS ASDNDFEWDD EESEALKQAR RDAELTASAD
SVRAYLKQIG KVPLLNAEQE VELAKRIEAG LYATEQLRAA EEGDEKFNRD MQRDMMWISR
DGERAKNHLL EANLRLVVSL AKRYTGRGMA FLDLIQEGNL GLIRAVEKFD YTKGYKFSTY
ATWWIRQAIT RAMADQARTI RIPVHMVEVI NKLGRIQREL LQDLGREPTP EELAKEMDIT
PEKVLEIQQY AREPISLDQT IGDEGDSQLG DFIEDSEAVV AVDAVSFSLL QDQLQQVLQT
LSEREAGVVR LRFGLTDGQP RTLDEIGQVY GVTRERIRQI ESKTMSKLRH PSRSQVLRDY
LD