Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3959 |
Symbol | |
ID | 5704910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4499170 |
End bp | 4500159 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641273384 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001538740 |
Protein GI | 159039487 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.624868 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.299323 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGGA ACCGGGCAAC CGGCGCGAGC GAGGGGACCG TGGGCAACGT GGACAAGAAT ATCGGCATGC GGACCGACGA GGTCGCCGAG GAGCGCGACC TGGTCGGCGT CTACCTTCAC GAGATCTCCC GGACGCCGCT GCTGGACGCC GCCACGGAGG TCGATCTTTC CAAGGCTATC GAGGCCGGAC TCTACGCCGA GCACCTGCTC GACGAGGACC GCGTCCCGGC CGGTGTCGAG CGAGCGGAAC TCCAGCGGCT GGTCGCTGAG GGTGAGCGGG CCAAGGATCT GTTCATCCGT GCCAACCTGC GACTGGTCGT GTCGATCGCC CGGCGGTACG TCCGTTCCGG GATGCCCATG CTGGATCTGA TCCAGGAGGG CAACACCGGT CTGGTCCGGG CGGTCGAGAA GTTCGACTAC GAGCGCGGCT TCAAGTTCTC CACCTACGCG ACCTGGTGGA TCCGTCAGGC GATCAGCCGG GCGATCGCCC AGCAGGAGCG CACCGTGCGG TTGCCGGTGC ACCTGGTGGA GGATGTCAAC CGGATGCGGA ATGTGGCCCG GCAGCTCACC CGGGAGCTTG GCAGCGACCC CGAGCCGGAG CAGATTGCGA CGGCGCTCGG CGTCAGCGTC GAGCGGGTCA ACGAGTTGGT CCGCTGGTCG CAGGACACCG TGTCGTTGGA CACACCGGTC GGCGACGACG GCGACACCAA CCTGGGGGAC CTGGTCGCCG ACAGCGATGC GCCGTCACCG GAGGAGATTG TCCTCACTGG CCTGGAGCGG CAGCGCATCG AGGGCCTGCT CAGTCACCTC GACGACCGGT CGGCCGGCAT CATGCGGGCC CGGTACGGCC TCGAGGACGG CCGGGAGCAC TCGCTGACCG AGGTGGCCTC ACGCTTCTCG CTTTCCCGGG AACGGATCCG ACAGCTGGAG ATCCAGGCAC TCGGCCGGCT GCGCGAGCTG GCTCGTGCCG AAGGGCTTCA GGCAGCCTGA
|
Protein sequence | MARNRATGAS EGTVGNVDKN IGMRTDEVAE ERDLVGVYLH EISRTPLLDA ATEVDLSKAI EAGLYAEHLL DEDRVPAGVE RAELQRLVAE GERAKDLFIR ANLRLVVSIA RRYVRSGMPM LDLIQEGNTG LVRAVEKFDY ERGFKFSTYA TWWIRQAISR AIAQQERTVR LPVHLVEDVN RMRNVARQLT RELGSDPEPE QIATALGVSV ERVNELVRWS QDTVSLDTPV GDDGDTNLGD LVADSDAPSP EEIVLTGLER QRIEGLLSHL DDRSAGIMRA RYGLEDGREH SLTEVASRFS LSRERIRQLE IQALGRLREL ARAEGLQAA
|
| |