Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2326 |
Symbol | |
ID | 5704250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2672473 |
End bp | 2675418 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641271804 |
Product | ATP-dependent transcription regulator LuxR |
Protein accession | YP_001537175 |
Protein GI | 159037922 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00645479 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGCCAG TATTGGCATC ATCACGCGAT GGACCTGCAC GAGCTGTGGG GAGGAACGTC TTGGGTAAGT CTGTGCCGCC GGTGCGCATT CACGCACACC CGTTGCGGAA GGTGCGTGGC TGGGAGGAGC ACCTGGAGCA GATCAGTACC GATCTGCGTC GCGTCGTGTC CGACCGGCGG AGCCGGGTGC TCATGGTGAA GGCAGCGCCC GGCGGCGGTA AGACCCGATT ACTGGCCGAG GCGGCCACCG TCGCCGACGA CATGGGCTTC ACCATCGTCG GTGGTGTCGT GGCTGGTCCT GATGCGGTGC CTGACATGGC GCACCTGCCG GCCGCTACTC AGGTCCGGGT GGCTGGCAGC AGCACAGACC GATTCTCCAG TCCACGGGCG ACCTCCCTGG TCGAGGCGAC GGGGGCGCGG CTGCGTCGGG CCGCGAATGA CGTTCCCACC GTGGTGACCC TCGATGACCT GCACCTGGCC GACTTTCCCA CCCTGATGGC GTTGTGTGAC CTGATTCTGG CGCTCAAAGG CCGACCGATC CTCTGGCTGC TGACCTTTAC CTCCGCGACG AGCGCCGCGC CCTCCGAACA AGCGTCGATG TGTTTGAGTC GACTTCGGGG CAGAGTGGCC GTCGAACCGA TCCGGCGGTT GAGTCCGCTC TCGAACGAGG CACTGGAGCA ACTGGTCACC GACCATACCG GCTCCGTCCC CGACCCAACG CTGCTCGCCC TGGTGGAAAG CCTCAACGAT ACGCCCTCCG CAGTGATCGA GCTGATACGT GGGCTCATGG AGGACGGTGT CATCTGCACA GTCGAGGGCA CCCTACGGTT GACCCCTGGT ACGCCCGGCA GCTCCCGCGA CGTGAATGCT GTTCATGTGC CCGCACCAGT GCCGAAGCGA CTATCCTCAA TGATCGAGGA GAATCTCCTA CGACTTTCTG ATTCGACTGT CAAGTCCCTT CGATTGGCGG CCGTGCTCGG ATCACCGTTC GCGCCGGAGG AACTGTCGGC CATGCTCGAC GAATCACCGG CTGGCCTACT CACCGCGGTG CATGAGGCGG TGGAACGCGG AGTACTGGTC TGTTGTGGGC AGAATCTGGC ATTCCGCACC GAGTCGATCT GGCGGGTACT ACTTGACTCT GTGCCGCCAC CGGTACGCGC GTTGTTGCGT CGGCAGGCGG CCGAAACGTT GCTGCGACGT CCCGACGGGG CCGAACGTGC CGCCCTTCAA CTGGTGCACG TGGCTCGACC CGGTGACGCC AAGGATCTCG ACGTCATCGC CGAAGGAGCC CGACGCCTCG TCGCCACCGA TCCCACCACC GCAGCATCCC TGGCGATCCG TGGGATGGAA CTGCTGGACC CCGGACAGGA CCAACGCGTG CGTCTTGCCA ATACCGCGGT GAAGGCACTC ACCAAGGTGG GGCGCCTGGA CGAGGCGGTC ACGCTGGCCA GGGACGAGAT CGAGGTGGCC GCGGCCCCCA CCTCGGTGCC ACCCGCCCGT GACATCGCCG CCTTGCAGGC GTCGATGTCG ACCGCGTTGC TGCTGCGAGG CGACGCGTCG GCCGCCCGCC AAGCGACCGA CGACGTACTG GCCAGGCAGG CGGGCGGACT CCCCCGGGCC GAGGTGGTCA TCACCCGTCT CGCCGCTTCC TACCTGATTG GTGACAGTAC GGCGGTCGAG CAGGCACGTA CGATCCTCAG CAGGCCCGAC CGACGCGACC GGGCGGTCAG GGTCAGCGCG ATGACTGTCC ACGCGTTGGA TCGGTGGCGA ACTGGCCACG TGGTGGACGC GGTGGGGATC CTGCGGGACG CTGTCGCGCT CAACCATGTC GGTGAGACCG TTCAGGTCCT CGACCCGTGC TGTTTCCTCG CCTTCGCGCT TGTCCGGATC GACGAGTTCG GTGAGGCGGA GGAGGTCGTT CGAGGGTATG GCAGAACGAC CGCCTCGACG GAGAGTTCTC CGGCTACCGC AGTCCAGGCC GTGCTTCGCG CGCCCCTGCA CCTCGCCCAG GGGCAGCTGA ACGAAGCCGA ACAGGTTGCC CGAGTCGCGT TGCGACAGGA CGGCCCTTAC ATGCCGGTAT TCGCCCCCCA GGCATTTCTG GTCCTGGCCC ATGTGGCCTT ACGTCGTGGA ACTCCGGCCC AGGCCGCGAT TCATCTGAAG GCGTTGGAAG GGGAGGGCCT GCAGCATTCC TCCAGCCCCT GGCAGGCCGA GTATCTCTTG CTGCAGGCAC AGCTGGCCGA GGTGAACGAC GGCCCTGCAG CGGCACTGGA GGTGTTGACC GAGACCGGGG CGCAGCCCAT CACACCTCGT GAGATCGTCT TGGAGGATCC GGCAGTCGCG GCCTGGTGGG TACGTTGCGC ACTCACGGCA GACCAGCCCG AGATCGCGGG AGGGGTGGTC GACACCATCG AGGACCTCAG CAAGCTCAAC CCGGAGGTTC CCGCGCTGTT CGCTGCGGCC ACACACGCAC GGGCGCTCGC CGAGGCGGAC ACCGACGCCC TCGCCGAGGC CGGCCGACTA CACCGAAATC CCTGGGCCCG GGCCGCAACA GCCGAGGACC GTGCCCGGAT CTTTCTTTTC CGTGGTGACC ACGAGTCCGC CATTACCGAG CTGGACTGTG CCATGAACGC CTACAACCAG CTCGGAGCGG AACGAGAGGC CGCTCGCGTG CGCGCCCAGC TACGTGGACT CGGAGTGTGG CGTCGACACT GGAAGCAGGC AAAACGTCCA CTGTCGGGCT GGGAGAGTCT CACCGAGACC GAACGGAAGG TTGCCAAGCT CGTGGCCAAC GGACTCACCA ACCAGCAGGC CGCCGACCAC CTGTTCATCT CACCGCACAC GGTCGGATTT CACCTGCGCC AAATCTACCG CAAACTCGGT ATTCGATCCC GCTCAGCGCT GATCCGATAT GACGCATCAC GCGTATCGCG CCCCGAAGCG TCGTAG
|
Protein sequence | MTPVLASSRD GPARAVGRNV LGKSVPPVRI HAHPLRKVRG WEEHLEQIST DLRRVVSDRR SRVLMVKAAP GGGKTRLLAE AATVADDMGF TIVGGVVAGP DAVPDMAHLP AATQVRVAGS STDRFSSPRA TSLVEATGAR LRRAANDVPT VVTLDDLHLA DFPTLMALCD LILALKGRPI LWLLTFTSAT SAAPSEQASM CLSRLRGRVA VEPIRRLSPL SNEALEQLVT DHTGSVPDPT LLALVESLND TPSAVIELIR GLMEDGVICT VEGTLRLTPG TPGSSRDVNA VHVPAPVPKR LSSMIEENLL RLSDSTVKSL RLAAVLGSPF APEELSAMLD ESPAGLLTAV HEAVERGVLV CCGQNLAFRT ESIWRVLLDS VPPPVRALLR RQAAETLLRR PDGAERAALQ LVHVARPGDA KDLDVIAEGA RRLVATDPTT AASLAIRGME LLDPGQDQRV RLANTAVKAL TKVGRLDEAV TLARDEIEVA AAPTSVPPAR DIAALQASMS TALLLRGDAS AARQATDDVL ARQAGGLPRA EVVITRLAAS YLIGDSTAVE QARTILSRPD RRDRAVRVSA MTVHALDRWR TGHVVDAVGI LRDAVALNHV GETVQVLDPC CFLAFALVRI DEFGEAEEVV RGYGRTTAST ESSPATAVQA VLRAPLHLAQ GQLNEAEQVA RVALRQDGPY MPVFAPQAFL VLAHVALRRG TPAQAAIHLK ALEGEGLQHS SSPWQAEYLL LQAQLAEVND GPAAALEVLT ETGAQPITPR EIVLEDPAVA AWWVRCALTA DQPEIAGGVV DTIEDLSKLN PEVPALFAAA THARALAEAD TDALAEAGRL HRNPWARAAT AEDRARIFLF RGDHESAITE LDCAMNAYNQ LGAEREAARV RAQLRGLGVW RRHWKQAKRP LSGWESLTET ERKVAKLVAN GLTNQQAADH LFISPHTVGF HLRQIYRKLG IRSRSALIRY DASRVSRPEA S
|
| |