Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0089 |
Symbol | |
ID | 5707059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 102219 |
End bp | 103157 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641269615 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001535015 |
Protein GI | 159035762 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00450491 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTACACAG TCGTCGTCCT CGCGCTGCCG GATGTGATTG CCTTTGATCT GGCCACACCG GTCGAGACGT TCGGCCGTGT CCGCCTGCCG GACGGCCGGC CCGGATACCG GGTCCTCGTC GCAGGGCCCG ACGATGTCGT CGACGCCGGG CCGGTGCGGC TGGCAGTCAG CGAGCAGCTG GATGCGCTCG ATCGCGCCGA CCTGGTCGTG GTGCCTGGCC GCAACAACCC CTTACGGCCC TCCCCACCCT CGGTGCTCGC CGCCCTGCGT GCTGCCGCGA CCAGGGGCAC ACGCGTCGCC TCCATCTGCG TCGGAGCATT CACGCTGGCA GAAGCAGGAC TACTCGACAA CATGAGGGCC ACGACCCACT GGCTCGCCGC CGAACACCTC GCGCACCAAC ACCCGTCGAT CCAGGTGGAC CCCGACGTGC TCTACATCGA CAACGGCAGC ATTCTCACCT CTGCCGGTGC CGCGTCCGGG CTGGACCTGT GCCTGCACGT GATCCACACC GACTACGGTG CGGCGGTGGC CGCGGATGCC GCACGCCTCG CCGTGGCCCC ACTGCACCGA GCCGGTGGGC AGGCGCAGTA CATCCTGCGG AACCGGCCGC CCCTGCGGAC CTCAGTCCTC GAACCCGTCC TCGCCTGGAT CGAGACCAAC GCGCATCGGG CCCTCACGCT CGCCGACCTC GCCGCCGCCG CGAACCTGAG CACACGCACC CTGACCAGGC GATTCGCTGT CGAGACCGGA CAGAGCCCGA TGCAATGGGT CGCCGGCGTC CGGATTCGTC ACGCCCAGGA GCTCCTGGAG ACCACCGACT ACACGATCGA CCGCATCGCA AACCAGACCG GATTCACCAC CACGAGCAAC TTCCGTGCGC AGTTCCAGGA GGTCGTCGGC ACCACACCAG GCGCCTATCG CACCACCTTC CGGCTGTGA
|
Protein sequence | MYTVVVLALP DVIAFDLATP VETFGRVRLP DGRPGYRVLV AGPDDVVDAG PVRLAVSEQL DALDRADLVV VPGRNNPLRP SPPSVLAALR AAATRGTRVA SICVGAFTLA EAGLLDNMRA TTHWLAAEHL AHQHPSIQVD PDVLYIDNGS ILTSAGAASG LDLCLHVIHT DYGAAVAADA ARLAVAPLHR AGGQAQYILR NRPPLRTSVL EPVLAWIETN AHRALTLADL AAAANLSTRT LTRRFAVETG QSPMQWVAGV RIRHAQELLE TTDYTIDRIA NQTGFTTTSN FRAQFQEVVG TTPGAYRTTF RL
|
| |