Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4999 |
Symbol | |
ID | 5705739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5667833 |
End bp | 5668927 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641274392 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001539733 |
Protein GI | 159040480 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000160838 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGCCG AGCAGTTGAT CTCCTTCGCC CGTGGCGCTC CCTCGCTGGA CATCGTCGAT ATCGAGGGGC TGAAGGCCGC CGCCGTCCGC GCCTTCGACG CCGACCCCGC CGGTGTGACG GCGTACGGTT CCTCCGCCGG GTACCTTCCG TTGCGCGAGT GGATCGCGAA CAAACACGGG GTCCAGGCCG ACCAGATCCT GGTGACCAAC GGATCGCTAC AGGCCGACGC CTTCCTCTTC GACCACCTGA TCCGACCCGG CGACGCGGTG GTGGTGGAGC GCCCGACCTA CGACCGAACT CTGCTGAATC TGCGGCGGAT GGGTGGTGAG CTGCACGGGA TCACCATCCA GCCGGACGGA CTGGACACCA CCGAGCTGCG TAAGTTGCTG GAGTCCGGGG TGCGCCCACG GGTGGCGCAC GTCATCCCGA ACTACCAGAA CCCGGCCGGC GTGACGCTCA GCCTCGACAA GCGGCGCGAG CTTCTCGAAC TCGCCGCCGA GTTCGAGTTC ACTGTCTTCG AGGACGACCC GTACGCCGAC ATCCGGTTCC GCGGCGAGGC GCTGCCGTCG ATGCTCTCGT TGGACAGCCA CAACCTGGTG GTGCACGCGT CCAGCTTCAC CAAGACGGTC TGCCCGGGGG TGCGGGTCGG CTACCTGGTC GGGCCCTCGG ACCTGATTGC CGACATCGCG AAGAAAGCGA CAAGTCTCTA CATCTCGCCG GGCGTGGTGT CCGAGGCGAT CGTCCACCAG TTCTGCGTCT CCGGGGACAT CGACCGCTCG ATCGCCACGG TCCGTCGGGC CCTCGGCGAG CGGGCCCGGG TGCTGGCCGA GTCGTTGCGG CGGCACATCC CGCAGGCCCA GTTCGTCGAG CCGGACGGCG GCTACTTCCT CTGGGTGGAG TTGCCGGAGG ACGTCCGGGT GGACCGGCTG GCCCCGGCCG CGGCGGAGCG AGGAGTCGCG GTGGTGAAGG GCAGCGACTT CGTCCTCGAC GGTGGGCAGC ATGCGCTGCG GCTGGCGTAC TCGGCGGTGA CCGCGGACCA GATCGATGAG GGTGTGCGCC GGCTCGCGGC GGCGATGGCG GCCGTGCGCG GCTGA
|
Protein sequence | MTAEQLISFA RGAPSLDIVD IEGLKAAAVR AFDADPAGVT AYGSSAGYLP LREWIANKHG VQADQILVTN GSLQADAFLF DHLIRPGDAV VVERPTYDRT LLNLRRMGGE LHGITIQPDG LDTTELRKLL ESGVRPRVAH VIPNYQNPAG VTLSLDKRRE LLELAAEFEF TVFEDDPYAD IRFRGEALPS MLSLDSHNLV VHASSFTKTV CPGVRVGYLV GPSDLIADIA KKATSLYISP GVVSEAIVHQ FCVSGDIDRS IATVRRALGE RARVLAESLR RHIPQAQFVE PDGGYFLWVE LPEDVRVDRL APAAAERGVA VVKGSDFVLD GGQHALRLAY SAVTADQIDE GVRRLAAAMA AVRG
|
| |