Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3126 |
Symbol | |
ID | 5706370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3554526 |
End bp | 3555413 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641272558 |
Product | ECF subfamily RNA polymerase sigma-24 factor |
Protein accession | YP_001537925 |
Protein GI | 159038672 |
COG category | [K] Transcription |
COG ID | [COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.956754 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000770913 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATCGGG TGGTGGAGAC AGTGGTAGGG GCTGCGCATG AGGCACCCTT CGTGAAGCAC CGTGGCCTGT TGTACGCGGT CGCGTACCGG ACGCTCGGCA GCGCCAAGGA AGCCGAGGAC GTCGTGCAGG AGGCCTGGCT GCGGTGGAGT CGTGTCGATC CGGCCACCGT CGTGGACGCC GAGGCGTTCC TGGTTCGAGT GACCATCCGG CTGGCGATCG ACGAACTGCG CAGTGCCCGG GTGCGGCGTG AGGCGTACCT CGGTCCGTGG CTACCGGAGC CCATCCTCAC CCGGCCGGAC ATCGCCGACG GCGTTCTCCT CGCCGACTCC GTCTCCACCG CGCTGCTGCT CGTCCTGGAG ACGCTGTCGC CCCTGGAACG TGCGGTTTTC GTGTTGCGGG AGGCCTTCGG ATATCCGTAC GGCGAGATTG CGCAGTTCCT GGGACGCGGC GAGCCGGCAG TACGTCAGCT TGCGCACCGG GCCCGGAAGG CGGTGGAGAA CCGCCGCCAC CGATATGACA GCGACCCGGT GACCCGACAG CAGGTGACCG AGCGATTCCT GGCCGCCTGT TCCGGTGGCG ACATCACCGC ACTGGTCGGC GTACTGGCAC CGAACGTGAC GATGGTCAGC GACGGCGGCG GCTTCACGGG TGCGCCGCGC AAGCTCATCC ACGGTTCGGA CCTCGTGGCG CGCGCCATCG TCGTGCTCTC CCGGCAGCAG CCGGTCGGTT CGACAGGCGC GATACTGCAC CTCAACGGTG GCCCCGGTGT CGTGGTGCAC TGCGACGGAA CGACCGTGCT CGCGATGACC CTGCACCTGG TGGAGGGGCT GGTCCAGACC CTCCATGTGG TCAGTAACCC GCAGAAGCTG ACCGGGATAG CACTGTGA
|
Protein sequence | MDRVVETVVG AAHEAPFVKH RGLLYAVAYR TLGSAKEAED VVQEAWLRWS RVDPATVVDA EAFLVRVTIR LAIDELRSAR VRREAYLGPW LPEPILTRPD IADGVLLADS VSTALLLVLE TLSPLERAVF VLREAFGYPY GEIAQFLGRG EPAVRQLAHR ARKAVENRRH RYDSDPVTRQ QVTERFLAAC SGGDITALVG VLAPNVTMVS DGGGFTGAPR KLIHGSDLVA RAIVVLSRQQ PVGSTGAILH LNGGPGVVVH CDGTTVLAMT LHLVEGLVQT LHVVSNPQKL TGIAL
|
| |