Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1947 |
Symbol | |
ID | 5705766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2241682 |
End bp | 2243205 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641271452 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001536823 |
Protein GI | 159037570 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.505699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0597402 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCAGA CTCCACGAGA ACTCGCACCC CACATCTCCG CCAAGCACTA CTTCGGCGCC CAACTGCGAT CATGGCGAGA ACAGCGCGGC TGGTCACAAG CGCGGCTCGG CGAGCACCTG CACGTCAGTT CGGATCTCAT CGCGAAGATC GAGAAGGCCC TGCGCTGGCC GACCCCGGAA TTCGCAACCG CCTGCGACAC CGCGCTCGCC GCAGGAGGCG CCCTCACCAA CCTGCTTCCG CTGGTCGAAC TGGAGCGACA CCAGGAGCGT GCCGCCGTGG CCACCACCGC CCGCGCGGTG CGCGCCGCCG TCTCCCAGCA GCGGCGCGGC ACGCCAACGG CATCCGCGTT CGCCGCCTCA CCGGGTGAGA TCGCCACTGG TGCGGCAGCG GATTGGGCAT CAGCGCTGCC CGGTGTGCAA CTCACGGCGC CCCCACCGGC ATGGCCGATC GACCGACTGC TGACCCTGCC ATACGGCCGA GCCGTACCCG CTACGACCCT GACCCTCACC AGCATTCCCG TCCAAGACCG CTGCCACGCC GCCCGCTATC CACGGACGCT CAACGGTGGC CACGCCGCCG CCTTCGGCCT GCGCGATCTC GTGGCGACGG AACTGCTGGA CGGGGACGAG CCCATCCTCG CCGTGGCCAC AACGCCCACC CTTGGATTGG CGGTACCGGC ATACCGCCTC GACGCCTTCA CCATCGGCAT CCTGTGGGCC CTGTGCGGCA TGGACGACGC TCTCCTGGCC GATGATGCCG CGCTCGCCGA CAGCGTCCCC CAACTCCGCC GCTACGCCCA CCTACCCGGC TCCGCAGTCA GCCGCAACGT CGCCGCCGAC CTCAGCACCT TGAGCCAGAT GTGGCTTGGC TCAGACTTCT GCGCCCGCTA CATCAGCCAC CGCCTCGACG CCGCGACCGA CCAGCCCGTC TTCTGGACCC GCGAGCAGTA CGGCGAAGAA GCCACTACCT GGCTGCTCTT CCGCCACAAG ATCGATTATC TGCACGCCAC AAGCGAACGC TTCACCACAC CCACCGCCCC GGCCGCCCGC GTCTTCTGCA TCCCAGAAAC CGCCGTGCGG GGCAGCCCCC ACGCGGAACG CATCCTGCTG CTGCTCTCGG CAGCGTTCAT GGAATCCCTG CGCATCGCCG TGCACGTGAG TCCAGACCCG GCGTACGCCA CCGTCGAAGG CTTCGTGCTG ACCCCACACA CGCAGGTCAT TCTCGCCAAC TGGGTGCGCG CCGATGGACT CTGGCACGTC GACGCCCTCG ATCGCCGTGT CGCGCTACGC CGCTACGACG ACGTTGCCCG CAGCGGCCAA GCCGGCTCCA TCACCGCATC GGGCCGGTCC ATTCGCCGCC TACGAGTCCT CGCCGAGTAC CTCGGCCTGG AATGGCCCTG GCTGCGTCGA CGCTGCACCG AATTGGCCGC CGTCGGCATC GACGGGATGA TCCGACCGCG CAGCCGGTTA CTGACCACTG ACGGGATCAA CGCAGCCTGC CGCTACCTGG CCAGCCTCCC TTGA
|
Protein sequence | MAQTPRELAP HISAKHYFGA QLRSWREQRG WSQARLGEHL HVSSDLIAKI EKALRWPTPE FATACDTALA AGGALTNLLP LVELERHQER AAVATTARAV RAAVSQQRRG TPTASAFAAS PGEIATGAAA DWASALPGVQ LTAPPPAWPI DRLLTLPYGR AVPATTLTLT SIPVQDRCHA ARYPRTLNGG HAAAFGLRDL VATELLDGDE PILAVATTPT LGLAVPAYRL DAFTIGILWA LCGMDDALLA DDAALADSVP QLRRYAHLPG SAVSRNVAAD LSTLSQMWLG SDFCARYISH RLDAATDQPV FWTREQYGEE ATTWLLFRHK IDYLHATSER FTTPTAPAAR VFCIPETAVR GSPHAERILL LLSAAFMESL RIAVHVSPDP AYATVEGFVL TPHTQVILAN WVRADGLWHV DALDRRVALR RYDDVARSGQ AGSITASGRS IRRLRVLAEY LGLEWPWLRR RCTELAAVGI DGMIRPRSRL LTTDGINAAC RYLASLP
|
| |