Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1440 |
Symbol | |
ID | 5708063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1666405 |
End bp | 1668003 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641270949 |
Product | RNA polymerase sigma factor |
Protein accession | YP_001536330 |
Protein GI | 159037077 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00316128 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACAGAAC CCCGCCAGAC CGGCGCCGAC GTTCGCTCGC TCACCGACAC CTTGATCGCG CACGCGCAGA GCGCCGGCGG TCAGCTCACG TCGGCTCAGC TCGCGCGCAC TGTCGAGGCT GCCGAGGTGA CTCCGGCCCA GGCCAAGAAG ATCCTCCGGG CGCTCTCGGA CGCGGGGGTG ACCGTCGTGG TTGACGGTTC GGCCACCACC GCCCCGCGCC GCCGGGTGGC CGCCGCCCGG TCGACCACCC CTGCTTCCCG GGCCACCACC GCCAAGACCA CCAAGAAGAC CACCACGCCC GCGCCGAAGC AGACCCCTGC CGAGGCGACG GCCCCGGCGC CACGGAAGGC CACCGCCCGC AAGGCGGCCG GCACCACCAC CGCCGCGGCC GCCAAGGCGG CGCCGGCGAA GAAGGCCACT CGGGCCACCA AGGCGACGGT GGCCGCGGCG ACGGGCCCGG CCAAGGCCAC GAAGTCCGCT GCGAAGGGTG AGGCCGGTGG CGAGGTCGAC CCGGAGGAGT TGGCCGCCGA GATCGAGGAC GTGGTGGTCG ACGAGCCGGC GGAGCTGACC CAGGCTGCCG AGGCCGACGC GGCGAACTCC GCCACCGACA ACGACTTCGA GTGGGACGAC GAGGAGTCCG AGGCGCTCAA GCAGGCCCGC CGGGACGCGG AGCTGACCGC TTCCGCCGAC TCCGTCCGGG CGTACCTGAA GCAGATCGGC AAGGTCCCGC TGCTCAACGC CGAGCAGGAG GTGGAGCTCG CCAAGCGGAT CGAGGCCGGC CTGTACGCCG CTGAGCGGTT GCGCGCGACC GAGGAGGGCG AGGAGAAGCT CAACCGCGAC ATGCAGCGCG ACCTGATGTG GATCTCGCGA GACGGGGAGC GGGCGAAGAA CCATCTCCTG GAGGCGAACC TGCGCCTCGT GGTGTCGCTC GCCAAGCGGT ACACCGGCCG TGGGATGGCC TTCCTCGACC TGATCCAGGA AGGCAACCTC GGCCTGATCC GCGCGGTCGA GAAGTTCGAC TACACCAAGG GCTACAAGTT CTCCACCTAC GCCACCTGGT GGATCCGCCA GGCCATCACC CGCGCCATGG CCGACCAGGC CCGCACCATC CGCATCCCGG TACACATGGT CGAGGTGATC AACAAACTTG GCCGGATCCA GCGCGAGCTG CTCCAGGACC TGGGCCGCGA GCCCACCCCG GAGGAGCTGG CAAAGGAGAT GGATATCACA CCGGAGAAGG TGCTGGAGAT CCAGCAGTAC GCCCGGGAGC CGATCTCACT CGACCAGACC ATCGGCGACG AGGGCGACAG TCAGCTCGGT GACTTCATCG AGGACTCCGA AGCCGTGGTC GCGGTTGACG CGGTCTCGTT CTCGCTTCTC CAGGACCAGC TCCAGCAGGT GCTCCAGACG TTGTCCGAAC GTGAGGCCGG CGTGGTCCGC CTCCGGTTCG GCCTGACCGA CGGTCAGCCG CGCACGCTTG ACGAAATCGG CCAGGTCTAC GGGGTGACCC GGGAGCGCAT CCGACAGATC GAGTCCAAGA CGATGTCCAA GCTGCGCCAC CCGTCCCGGT CCCAGGTCCT CCGGGACTAC CTGGACTGA
|
Protein sequence | MTEPRQTGAD VRSLTDTLIA HAQSAGGQLT SAQLARTVEA AEVTPAQAKK ILRALSDAGV TVVVDGSATT APRRRVAAAR STTPASRATT AKTTKKTTTP APKQTPAEAT APAPRKATAR KAAGTTTAAA AKAAPAKKAT RATKATVAAA TGPAKATKSA AKGEAGGEVD PEELAAEIED VVVDEPAELT QAAEADAANS ATDNDFEWDD EESEALKQAR RDAELTASAD SVRAYLKQIG KVPLLNAEQE VELAKRIEAG LYAAERLRAT EEGEEKLNRD MQRDLMWISR DGERAKNHLL EANLRLVVSL AKRYTGRGMA FLDLIQEGNL GLIRAVEKFD YTKGYKFSTY ATWWIRQAIT RAMADQARTI RIPVHMVEVI NKLGRIQREL LQDLGREPTP EELAKEMDIT PEKVLEIQQY AREPISLDQT IGDEGDSQLG DFIEDSEAVV AVDAVSFSLL QDQLQQVLQT LSEREAGVVR LRFGLTDGQP RTLDEIGQVY GVTRERIRQI ESKTMSKLRH PSRSQVLRDY LD
|
| |