Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2197 |
Symbol | |
ID | 5708192 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2529732 |
End bp | 2531624 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641271678 |
Product | transcriptional regulator |
Protein accession | YP_001537049 |
Protein GI | 159037796 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.774056 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCGGA CCGAGGACGA GCTCCGCATT CAGATCCTCG GTCCTGTCCA GGCGACCATC CGCGGTGACG CGGTCGACCT GGGCCCTCCC AAACAGCGGG CGGTTCTCGC CCTGCTGGCC CTGCGGGCCG GCGGGCACGT GCCGCTCGAC GATCTGATCG CCGCGCTCTG GGCGGGACAG CCACCGGCCC GGGCGGCCAA TCTGGTGCAC ACCTACGTCG CCCGCCTGCG CCAGACGCTG GAGCCGGACA CGCCACGTCG GCGGCGGACC AACGTCATCG CATCGGTGCC GGGCGGGTAC CGGTTGGCTG TCGGCGCGGA GCAGCTCGAC CTGCACGCGT TCCGTGCCGG AGTCCGGGAG GCTGCGGCCC TACGCGAGCG CGGCGAGCCG ACCAGGGCCT TCGCCCGCTT CGGGGAGAGC GTCGGCCGGT GGCGGGATCC ACAGGTCGGC GACCTGGCCG CGCTCCTGGT CGAGCAGGAC GACCTCGGTC CACTGCGGCA GGAGTTCCTG GCCGCGGCGC TCACCTATGT CGGCCTCGGC CTGGATCTGG GCCGACCGGA GGCGATCCTG CACCTCGCCG AACGGCTGGC GCTGACCGAG CCGCTCAACG AAAGGGTGCA GGCCCGGCTG TTGCAGACGC TGGCACGAAT CGGCCAGCGC GCCTGGGCCA TCGAGCGCTA TGCCGAGGTC CGCGAGCGAC TCCGGCTCGA CCTCGGCGTG GACCCGGGGC CCGAGCTGTC CGCGGCGTAC CGCGAGGTGT TGGATGCCGA GCTGATGTCG TCCGATACCG CCCACCGGAC CTCGCCCCAG GTGCCACCGT GGCGGGGAAC GGTGCCCCTG ATCGATGAAC TGACCGGTCG TTCGGCTGAC CTTACCGCGA TCAACGACCT GCTCGACGGG TACCGGCTGG TGAGCCTTAC CGGCCCGGCT GGGGTGGGCA AGTCCGCGCT CGGCCTGGCC GCCGCGGAGC GGCAGCGAAA GCGGCACGCC GACGGCGTGG CGGTGGTCGA CGTGACGAAC GTCCGCACCG GACACGCCCT CACGCAGGCG GTGACCGCGG TCGTCGTCAA CGGTCCACTG CAGGCAACGG GTACACCCGT GTCCCTGGTG CGCCAGTTGC ACGACCGAAG CCTGCTGCTT GTGATCGACA ACGCCGAGTT GGTCACCGAC GAGGCGGCAG AGCTGACAGA CGAGTTGCTC CGCGAATGTC CCGGACTCAC CGTCCTGTTG ACCTCCCGGG AGCTACTCGG GATGCGGTAC GAGGCGGTGT ATCCGGTCCG GCCGTTGCGA ACCGACCCCG CACCGGGGAC GTCCGCGCCT CCGCCCGCCC AACAGTTGTT CGCCCGGCGG GCCACGCAGG TGCAGCCGAG CTTCCGGCTC GACGAGTCCA CCCTGCCCGG GGTCACCGCG GTGTGCCGGG CGCTCGACGG CCTCCCGCTC GCCATCGAGC TGGCCGCGGC GTGCCTGCGC ACCCAACGGC TGGACACCCT GGTTGACGTC GTGGCTGATT CGCTGCGCTG GCTCCAACCA CCGCGACGGG GGGTGCCGAG ACACCATCGG TCGCTACGGG CCGCCGTGCA CCGCAGCATC GAACTGCTCG ACGCGGCGGA GCAGCGGTGT TTCGCCGCGC TCGGGGCGAT GCCGGCCGAG TTCGACCTCG CGGCCGCTGC CAGCGCCAGC GGCGCACTGG TCGGTGAACG CGCTGCGGTG CAGGTACTCC TGGATCGATT GGTCGACAAG TCGGTACTGG AGGTCCGGCA CGGTCCGGCC GGGAGGCAGT ACCGCATGCT CGGGACGGTT CGCGCAATGG CCCGACAGCT ACTCCAGGAG CAGGGCTCGA TCGGGGCATC TCCCTCGACC CGGTGCGCGT GCGCCTGCCA CCTGGACCCC TGA
|
Protein sequence | MNRTEDELRI QILGPVQATI RGDAVDLGPP KQRAVLALLA LRAGGHVPLD DLIAALWAGQ PPARAANLVH TYVARLRQTL EPDTPRRRRT NVIASVPGGY RLAVGAEQLD LHAFRAGVRE AAALRERGEP TRAFARFGES VGRWRDPQVG DLAALLVEQD DLGPLRQEFL AAALTYVGLG LDLGRPEAIL HLAERLALTE PLNERVQARL LQTLARIGQR AWAIERYAEV RERLRLDLGV DPGPELSAAY REVLDAELMS SDTAHRTSPQ VPPWRGTVPL IDELTGRSAD LTAINDLLDG YRLVSLTGPA GVGKSALGLA AAERQRKRHA DGVAVVDVTN VRTGHALTQA VTAVVVNGPL QATGTPVSLV RQLHDRSLLL VIDNAELVTD EAAELTDELL RECPGLTVLL TSRELLGMRY EAVYPVRPLR TDPAPGTSAP PPAQQLFARR ATQVQPSFRL DESTLPGVTA VCRALDGLPL AIELAAACLR TQRLDTLVDV VADSLRWLQP PRRGVPRHHR SLRAAVHRSI ELLDAAEQRC FAALGAMPAE FDLAAAASAS GALVGERAAV QVLLDRLVDK SVLEVRHGPA GRQYRMLGTV RAMARQLLQE QGSIGASPST RCACACHLDP
|
| |