Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_2258 |
Symbol | |
ID | 3934713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | + |
Start bp | 2264230 |
End bp | 2266173 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637904615 |
Product | SARP family transcriptional regulator |
Protein accession | YP_510200 |
Protein GI | 89054749 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0203775 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACAGG GTCCGTCGGG ATCATTTGCA GCCCGTTTCT TCGGAGCTTT TGCTGTCAAG GATCCCTTGG GAAACCCCGT CACGCCGCGT GGCCAGAAGC CGGCAGCTCT CCTGGCGTAT CTTCTGTATA ATGCAGGCCG GGACGTGCCC CGCGACCGAC TGATTGATAT GTTCTGGAAT GATCGCGGGT CCAAGCAGGG GCGCGACAGT CTCAGGCAAG CGCTCCATGT GCTGCGCTCC ACGATTGGCG AGGCGGGCGC GGTCCTGATG CATATTGATC GACAAAGCGT GCATGTGGTC AAAGCGCATC TTTCAAGCGA CTTGCAGCAG AAGGAAGACG CGCCGTGGGA ATTGGACGGA GAGTTCCTTG CTGATCTTGC CCCGACCGCA CCCGTCTTCG AGGACTGGTT GACAGAGACA CGGCGCGCGG TGCGACGTCA TCAGGTTGCG GCGGCAGAGC AGGCCCTGGA CCGCCTGGAT GCGGAGGCCA ATCCGGATGC CGTGCTGGCC TGTGCCACCG CGATATTGTC GCTTGATCCC CATCATGAGC CCGCGGCGCG ATGCGCGATG GAACGGTATG CGGTTCAAGG CAAGAGAGGC CACGCGCTCC GCGTTTTCGA GACTTTGTCT GAGGCACTCA AATCAGATGG ATTTGAGGTC TCACCAGACA CGGACAACCT GCTTCGTGCG ATCAGCGACA ACCGGTTCCC AATCGACAAG CCACACCCCG CGCCAACGCC CCGGCCAGGT GCGTCAGGTG ACGCCCGCGG TTTGCCCGTT GTCTGGCTCG ACCTGTCCGA AACCCGCCAC GATCGCGATA GCTGGGACTT TGTCTCGGAT TTCTGTGATC ACCTCATATT GCGGTGTGTC CAGATGCCGG AGTTCAACCT GCTGACGGTT GAGGATGTCA CAGATATTGA GAAATACACT GTGGCCGTCT CATCCGGCAC CACCCAATCA GGTCTGCGCA TCAGCCTGAG GCTCCGGGCG CCGGACGATC ACTTGCTGTG GTCGGGTCGC GCGGATCTGC CTGATCAACC GGACGATGAC CGGGTTCACC TGGCCGTGGA CAAGTTGGTG ATGCAGATGC TGCCCCCGCT GGAGGCCCAT GTGTTCGCAA GTCTGGGCGC GCAGATCGAC ACCGCCTACG GATATTACCT CAAGGCCAAG CGGACGTTCT GGACCGATCC GCAATTCGGC TACATCGACA AAGTGGTTGC CGATCTGCAA AGAGCGATCG AGATCGCCCC GACGTTTCTG CCGCCGTACC CGATGATGAT CATGTACCAT AATACGGGTA TGTTCATGAG CCGGCCAGGC ATCGATCACA CCGATGGGCG CGCCCAGGCC CTGGACCTGT CGCAGCGGTT GTTGTTTTTG AACTCGAACT TTCCAAATGC ACATATCTCG ATGGGGTGGT GCCTGCTCTG GCGGCGAAAT TTCGACGCGG CGGAGCGGTC AATCCGTCGC GCCATTGAAC TCGATCCGTA TGAGCCGCAC AGGCTTAGCG TGATCGGAAC AGCGTTGGTC TATCTGGGAC ATCACGAGGA GGGCCAGCGC TTCTACGACA AGGCGCAGGG GCGGATGCAG CACGATTTCG ATTTCCAGCG CACCGATTAT GGGGAACTTC ACTTCTTCAA GACAGAGTAC GAACGGGCCC TGTCCTGGAT GGAAATCCCC GAGGCCCGAA CCCCCTACAA AACGCTTTTC TTCCGGGCAG CCGCCAATGC GCAATTGTCA CGCCGAACGG ATGCGCAGAA CGATATCGAC GCGTTCGTCG AGGATATCCG CCCACGATGG GCGGGCAGGG ACCCCTTCAC GCCCGAGCGT GGGTTTCAAT GGTACGCGGA CATGTTGCCC CTGCGGCTCG CCTCGGACCG TGCGACGCTT CGCGCAGCGA TGGGAAAGCT CGGGCTCGAC GTCACCGTGC ACGAGCCATC GTGA
|
Protein sequence | MSQGPSGSFA ARFFGAFAVK DPLGNPVTPR GQKPAALLAY LLYNAGRDVP RDRLIDMFWN DRGSKQGRDS LRQALHVLRS TIGEAGAVLM HIDRQSVHVV KAHLSSDLQQ KEDAPWELDG EFLADLAPTA PVFEDWLTET RRAVRRHQVA AAEQALDRLD AEANPDAVLA CATAILSLDP HHEPAARCAM ERYAVQGKRG HALRVFETLS EALKSDGFEV SPDTDNLLRA ISDNRFPIDK PHPAPTPRPG ASGDARGLPV VWLDLSETRH DRDSWDFVSD FCDHLILRCV QMPEFNLLTV EDVTDIEKYT VAVSSGTTQS GLRISLRLRA PDDHLLWSGR ADLPDQPDDD RVHLAVDKLV MQMLPPLEAH VFASLGAQID TAYGYYLKAK RTFWTDPQFG YIDKVVADLQ RAIEIAPTFL PPYPMMIMYH NTGMFMSRPG IDHTDGRAQA LDLSQRLLFL NSNFPNAHIS MGWCLLWRRN FDAAERSIRR AIELDPYEPH RLSVIGTALV YLGHHEEGQR FYDKAQGRMQ HDFDFQRTDY GELHFFKTEY ERALSWMEIP EARTPYKTLF FRAAANAQLS RRTDAQNDID AFVEDIRPRW AGRDPFTPER GFQWYADMLP LRLASDRATL RAAMGKLGLD VTVHEPS
|
| |