Gene Sare_1440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1440 
Symbol 
ID5708063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1666405 
End bp1668003 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content70% 
IMG OID641270949 
ProductRNA polymerase sigma factor 
Protein accessionYP_001536330 
Protein GI159037077 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00316128 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACAGAAC CCCGCCAGAC CGGCGCCGAC GTTCGCTCGC TCACCGACAC CTTGATCGCG 
CACGCGCAGA GCGCCGGCGG TCAGCTCACG TCGGCTCAGC TCGCGCGCAC TGTCGAGGCT
GCCGAGGTGA CTCCGGCCCA GGCCAAGAAG ATCCTCCGGG CGCTCTCGGA CGCGGGGGTG
ACCGTCGTGG TTGACGGTTC GGCCACCACC GCCCCGCGCC GCCGGGTGGC CGCCGCCCGG
TCGACCACCC CTGCTTCCCG GGCCACCACC GCCAAGACCA CCAAGAAGAC CACCACGCCC
GCGCCGAAGC AGACCCCTGC CGAGGCGACG GCCCCGGCGC CACGGAAGGC CACCGCCCGC
AAGGCGGCCG GCACCACCAC CGCCGCGGCC GCCAAGGCGG CGCCGGCGAA GAAGGCCACT
CGGGCCACCA AGGCGACGGT GGCCGCGGCG ACGGGCCCGG CCAAGGCCAC GAAGTCCGCT
GCGAAGGGTG AGGCCGGTGG CGAGGTCGAC CCGGAGGAGT TGGCCGCCGA GATCGAGGAC
GTGGTGGTCG ACGAGCCGGC GGAGCTGACC CAGGCTGCCG AGGCCGACGC GGCGAACTCC
GCCACCGACA ACGACTTCGA GTGGGACGAC GAGGAGTCCG AGGCGCTCAA GCAGGCCCGC
CGGGACGCGG AGCTGACCGC TTCCGCCGAC TCCGTCCGGG CGTACCTGAA GCAGATCGGC
AAGGTCCCGC TGCTCAACGC CGAGCAGGAG GTGGAGCTCG CCAAGCGGAT CGAGGCCGGC
CTGTACGCCG CTGAGCGGTT GCGCGCGACC GAGGAGGGCG AGGAGAAGCT CAACCGCGAC
ATGCAGCGCG ACCTGATGTG GATCTCGCGA GACGGGGAGC GGGCGAAGAA CCATCTCCTG
GAGGCGAACC TGCGCCTCGT GGTGTCGCTC GCCAAGCGGT ACACCGGCCG TGGGATGGCC
TTCCTCGACC TGATCCAGGA AGGCAACCTC GGCCTGATCC GCGCGGTCGA GAAGTTCGAC
TACACCAAGG GCTACAAGTT CTCCACCTAC GCCACCTGGT GGATCCGCCA GGCCATCACC
CGCGCCATGG CCGACCAGGC CCGCACCATC CGCATCCCGG TACACATGGT CGAGGTGATC
AACAAACTTG GCCGGATCCA GCGCGAGCTG CTCCAGGACC TGGGCCGCGA GCCCACCCCG
GAGGAGCTGG CAAAGGAGAT GGATATCACA CCGGAGAAGG TGCTGGAGAT CCAGCAGTAC
GCCCGGGAGC CGATCTCACT CGACCAGACC ATCGGCGACG AGGGCGACAG TCAGCTCGGT
GACTTCATCG AGGACTCCGA AGCCGTGGTC GCGGTTGACG CGGTCTCGTT CTCGCTTCTC
CAGGACCAGC TCCAGCAGGT GCTCCAGACG TTGTCCGAAC GTGAGGCCGG CGTGGTCCGC
CTCCGGTTCG GCCTGACCGA CGGTCAGCCG CGCACGCTTG ACGAAATCGG CCAGGTCTAC
GGGGTGACCC GGGAGCGCAT CCGACAGATC GAGTCCAAGA CGATGTCCAA GCTGCGCCAC
CCGTCCCGGT CCCAGGTCCT CCGGGACTAC CTGGACTGA
 
Protein sequence
MTEPRQTGAD VRSLTDTLIA HAQSAGGQLT SAQLARTVEA AEVTPAQAKK ILRALSDAGV 
TVVVDGSATT APRRRVAAAR STTPASRATT AKTTKKTTTP APKQTPAEAT APAPRKATAR
KAAGTTTAAA AKAAPAKKAT RATKATVAAA TGPAKATKSA AKGEAGGEVD PEELAAEIED
VVVDEPAELT QAAEADAANS ATDNDFEWDD EESEALKQAR RDAELTASAD SVRAYLKQIG
KVPLLNAEQE VELAKRIEAG LYAAERLRAT EEGEEKLNRD MQRDLMWISR DGERAKNHLL
EANLRLVVSL AKRYTGRGMA FLDLIQEGNL GLIRAVEKFD YTKGYKFSTY ATWWIRQAIT
RAMADQARTI RIPVHMVEVI NKLGRIQREL LQDLGREPTP EELAKEMDIT PEKVLEIQQY
AREPISLDQT IGDEGDSQLG DFIEDSEAVV AVDAVSFSLL QDQLQQVLQT LSEREAGVVR
LRFGLTDGQP RTLDEIGQVY GVTRERIRQI ESKTMSKLRH PSRSQVLRDY LD