Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4979 |
Symbol | |
ID | 5706127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5650028 |
End bp | 5651788 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641274374 |
Product | putative transcriptional regulator |
Protein accession | YP_001539716 |
Protein GI | 159040463 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0011144 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCCCACG CCGTACAACA GGATCTTGAG GCCATCCTCG CCGGCACTGC GGCGCGGAAG CGTGAGAGCA GCATCCTCGA CTTCAAGGTG GCCAAATCCG ACCTTAAGGA CGCATGGGCA GACCTGGCCG AAGCGGCTGT GTGCTTCGCG AACGCATCTG GCGGAACCAT CGTCGTCGGA GTGTCCGACA CCCCGGGCGG CCACGGGGCG TTCATTGGGT GCGATCTAGA CGAGAATATG TTGCGTCAGC GCATCTACCA TCTTACGATG CCGGGCCTTC TTGTCGAGGT AGACGTCGTT CGCTTCGCTA ACAAGCGCCT CCTAGCAATC AGGGTGCCGG AAGGGTTGGA GGTGTACTCC ACTACTCGCG GATACACCTA TCACCGAGTC AATGACGAAT GCCTCCCTAT GCGTCCCGCC GAGGTAAGCC GCCTAGCAGA GGAGCGCAGA GGAGTTGACT GGTCTGCCGC GTCTAGTTCA CGCTCTTTAG AGGACGTAGA TCCTCTTGCC CTTAGGCAGT GCAGGCGACT TCTTTCCAAT TCCGTCGACT CTCGACGCCA GTCGTACGCA CGCCTTTCGG ACCATGACCT ACTCCGGTCA CTGAAAGCAG TAGGCGACGA CGAAAGACTG ACTCGCGCGG GTGAACTGTT ACTATGCACG GCTGCTTCGT CCGGCCCAGA AGATGCAGTA GTTTATCAAC ATAGAAAGAC TCAGGCAGGG GAGCCCGACG CGATCATGCG TCTGGGCACC CCACTCGTTC TCGCATTTGA TGAGCTCCTA CAGGCCATCC GCGCACGCCA GGGAATAACC CCAGTAACGC TGGCCGATGG ACAACAGCTT CAAATTGAAG ACTATCCGAT GGCCGCAGTT CGGGAAGCCG TCGCTAACGC ACTAATCCAC GGCGACTGGC GCGCCCGACT GCCTGTGTCA GTCGAACACT CGTCGCAGTA CTTGAAAGTA ACATCCCCCG GTCCACTGGT AAGCGGCATT ACTGTCGACA ACATTCTGAC CAAGGGATCC AGAGCCCGCC ACCCCGCCTT GGCCTCCGCC TTTCGCCTGC TTGGCCTGGC AGAAGAAGTT GGACAGGGCG TTGATCGCAT GTACCGGGAG ATGATCCGGT CCGGGCGAGA CACGCCCCTC ATCTCCGATA ACAACGACCA GGTAACGGTT CTTTTTCGCG GGCAGTCGCC CAATACCCGC ATTACCAAAT TTCTAGCGAC GCTTCCTCCA GAAGAGCAGG ACGACACGGA CGCCCTCCTG ATTGTCCTCG TCCTTTGCTC GAAGCGAACA ATCACAGCGA AGCAGCTAGC ACCGATCATT CAGCGCTCCG AACTGGAAGG GCAGACTGTC TTGCGACGCC TGTCAAATGA TCCTTCCATG CTATTGGAAC CGACTAGGGG AACCGCGAAT CGGACTCAAC CAACCTACCG ACTAACGGCT GACGCCCTTA CCCGCCTAGG GAACGCCGTC GCTTACCACG GTCGCACAAG CGATGAGGTA GACCGGAAAG TCATCGAACA TATGCGAGAC TACGGCGAGA TCAACAACCG AACCGTTCAA CGTCTGTTCG ATGTCGACGT ATACGCTGCT CGCGACATTC TAAAAGACCT GGTCGAGAGG CAAATCATTA CCCGAACCTC GGAGCAGACA CGCGGTGTCG CTGTCCGATA CGGACCGGGT TCGCTCTTTC CAGCGGCAGG AAAGAAGGGA AAACCCCCCA AGAACAAGAG GGTTACCGAC TTGGAAGATA AGCTGTTCTA G
|
Protein sequence | MSHAVQQDLE AILAGTAARK RESSILDFKV AKSDLKDAWA DLAEAAVCFA NASGGTIVVG VSDTPGGHGA FIGCDLDENM LRQRIYHLTM PGLLVEVDVV RFANKRLLAI RVPEGLEVYS TTRGYTYHRV NDECLPMRPA EVSRLAEERR GVDWSAASSS RSLEDVDPLA LRQCRRLLSN SVDSRRQSYA RLSDHDLLRS LKAVGDDERL TRAGELLLCT AASSGPEDAV VYQHRKTQAG EPDAIMRLGT PLVLAFDELL QAIRARQGIT PVTLADGQQL QIEDYPMAAV REAVANALIH GDWRARLPVS VEHSSQYLKV TSPGPLVSGI TVDNILTKGS RARHPALASA FRLLGLAEEV GQGVDRMYRE MIRSGRDTPL ISDNNDQVTV LFRGQSPNTR ITKFLATLPP EEQDDTDALL IVLVLCSKRT ITAKQLAPII QRSELEGQTV LRRLSNDPSM LLEPTRGTAN RTQPTYRLTA DALTRLGNAV AYHGRTSDEV DRKVIEHMRD YGEINNRTVQ RLFDVDVYAA RDILKDLVER QIITRTSEQT RGVAVRYGPG SLFPAAGKKG KPPKNKRVTD LEDKLF
|
| |