Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_4571 |
Symbol | |
ID | 7280510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011881 |
Strand | + |
Start bp | 83171 |
End bp | 86128 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643580685 |
Product | transcriptional activator domain protein |
Protein accession | YP_002478498 |
Protein GI | 219883337 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 190 |
Fosmid unclonability p-value | 0.723489 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCCCC GTCTTAACCA GCTGTTCAGT CAGTTGATAA ACCAGAACCG CGTGGTTTGG GTTGCCGCAA CGGCAGGTTC CGGCAAAACC ACGGCGATTG TTCAGGCTGC TGCCACATGG GGCGGCCCTA TTGCCTGGCT CACTCTGGAC GGGACAGACG CTGCCCCCGG CCGGCTTCTC ATCTATCTGG AGGCGGCAAT AGCAGCACAC GTGCCAGACG CTGCCGGTCT GGCCAGCAGC GCGCTCACGG CGCGTATTCC CCATCCCGAA GTGGCTGGTT TGCTGGCCGA GTCTGTAGGA GATGAAGATC TCCTGCTGGT GCTTGATGGC CTGGAGAATC TAGTTGGCGC CTACGAAGCG CTGGACGTCG TCGGGGCGGT TGTCCGTTAT GCACCGGTGG GACTCAAGGT TGTCCTGCTT ACTCGGGTTG ATCTACCGAT CGATCTAAGT GCTCAGGCTG GGGTTGATCG CGTGGCAACT ATAGGTGAGG AGGACCTGGC CTTCACTCCC GAGGAAGCTG CAGGGGCACT CGTTGAAGCC GGAATAACAG ACATTGACGC CTCCAGTGCC GTCGAGGCAA CAGGCGGCTG GGTAACAGGG GTTCTTTTTG AGGCGTGGCA CTCACGGCTG TACATTTCCG GAACCGGCGG TGAGGCTGAC CCCCTACATG GCTATCTGGC TTCGCAAATC CTGGCTAAGT TGGCTCCTGA AGAACGGGAG TTCCTCATCG TTTCTTCCTT ACTGGACGAA GTGACACCAT CCCGGGCGGC AGCCCTGGGT CAATCCAACG CCGGCGATTT GCTGGTTAGA CTGCGAAGCC ACCACTTGCC TGTTTCCTGG ATTTCCGGGA CCTACCGCAT GCGGTGCCAC CCTCGATTCC GAGAATACTT GGTGACTTGT TTGGAGCGCC GGGGAAAGGC AGAAGTTCAG GCTACACGGC GCGCCTATGG GGACCTTCTA GTCACCGAGG GCCATCTTGA GGAAGCTGTC GAGCAGTTCC TGGCCGCAGA AGAGCTGGAC CGAGCTGTGG ATGCTGCCGA AGTGGTTATA GGCGACGTAC TAGACCGACT CGATTTCGTT GTTGCCGAAA GGTGGCTCGG TTACCTCGCC CCACCTGGCA GTTCCGGGTC CCGGAGGCTG GGCCCGGCCA CACTCATGCT GTCAATAGCC CGCGAGGACT ACCGAAAGGG CGTGGCTATC GCCGATGATC TTCAGGCAAA CGGTGTCAGG GATGACCTGG CCCGTCTTTC CCCGCGGGGT GGTGCAATCA TGGCGTGGTG TTACTGGCAT CTGGACCGGC TAGACGATAT GCGGGCGGTG ATCGACCTGG CCCCGAATAG CCCAGAGATT GATGCTGTCC GTTACCTGCT ATCACTTGCC ACCCGTCGGG AAGCAACCGG CGCTTACCCC GCTCCTACCC TTAGTGGTGG ACCACTGGAT GCTTTGGTGA TGCGCGTGCA CTACGCCCAC GGAAGACTGA GTGAGGTCAG CAAAATGCCG GACTCCCCAT GGGCTGCTGC AGTCTCCGCT CCGTGGCGAG TGGGCGCTCT TAGAGCAACA GGCCGCTTGA CGGAGGCCTT GGAGCTCTAT CGCTCAGCTG ATGCTGGCCA CTGGGCGCCC GCCTGGATGC ACGGCATTGT TGGTCCGGAA CTGATGATCG ATCTGGAGGA CACAGAAGAA GCGCAACGGG TACTTGCCAA GGGCCAGGGA TTGGTCAGGG CAAGCGGCTC CGTTGTGTTC GAGTGGCTGA ACCGCCTCAT CGAAGCCAAA CTTGAGCTGC GTTTGAACCA CGACCCTGTT GCTGCGCTAA ATCTTCTCGA ACAGGTGGAG AACGCCGGAG GGCGCCACTA TGACTTCATT AGTGAGGCTT TGGATACCTG GAAGGGCCTC GCGCTTCTGC TTTCCCGGTC AGACAATGAC GACGCCGTCG TGATGCTCAG ACGAGCTGTT AACAGCATGA CCGAGGCCAA CCGCATTCTT GACCTGCCCG CTGCGGCGAT ATACCTAGCT GAAGCCGAAT GGAGACAGGG TGACTTGACA GAATCTGATG CAGCTGCCGA CCAGGCGATG GTTGCCGCCA GGCTCCAAGA ATCCAACCAC CAGATTCTAC TGGCCCTTGC GGATTTTCAG GCAGTACTAA CTCGTCGCCT GGACTCTGAA GAATTCACCG ACTCACCATG GCATGAACTG GGCCAGGCCC TGATGGCCCG CGGCGTGGGA GCAGGTTGGA ATCAGCATCC CGTCATTCTC TTATCCGAGT TCGGGCGGAT TGCCATCTCA GTCGCTGGGC AAGAGGTAAA GCCGCGCATT GCCAAAAGTC TGAATCTACT AGCCTATCTG GCCGCAGTCC CGAGTCACCA TGCTTCCCGC GAAGATCTGC TTTCGGCGCT CTTCGATGGT CAATCAGACG AGTCCGCGAG AGCCTACTTG CGCCAGGCCG CTCACCGGCT TCGGGAAGCT CTTCCTGCCG GTATCGGACC AATCTTCACC GGCAATACCT TGGCGTTCAC CACTCCCGTC ATTCTGGACA GCGAGTCCAC CAGATTCGAG GCGTTAATCG CAAAGGCTGC ACGACTCCGC GGGCAGGGAA GACTTGAAGC GCTTTTGAAG GCGCTTGCCA TCGTTGACAG CGGGGAGTAC CTGCCGGGGA TGGATTCATC CTGGGCAACG CAGCGGCGGG AGCAATTAGA GGAGCAGGCA GCCCAAGCTC GACTTCAGGC TGCCCAAATG GCATTTACAA CCCAGCAGTA CCGCCAGGCC GAGCAACTTG CGGAACAAGT TGTCGCCCAA GATCCGTATA AAGAGAGCGC TTGGCGAATT CTCATGCGGA TAGCCAGCGC CACAGGAAAT GAGGACGGCG TCGTAGCCTC CTACCGCCGC TGCAAGGCAG CACTCCAAGA GCTGGGCATC ACGCCCTCAG ACTCAACCCA ACAAATGTTT CAACGGCTCA GGCGTTAA
|
Protein sequence | MRPRLNQLFS QLINQNRVVW VAATAGSGKT TAIVQAAATW GGPIAWLTLD GTDAAPGRLL IYLEAAIAAH VPDAAGLASS ALTARIPHPE VAGLLAESVG DEDLLLVLDG LENLVGAYEA LDVVGAVVRY APVGLKVVLL TRVDLPIDLS AQAGVDRVAT IGEEDLAFTP EEAAGALVEA GITDIDASSA VEATGGWVTG VLFEAWHSRL YISGTGGEAD PLHGYLASQI LAKLAPEERE FLIVSSLLDE VTPSRAAALG QSNAGDLLVR LRSHHLPVSW ISGTYRMRCH PRFREYLVTC LERRGKAEVQ ATRRAYGDLL VTEGHLEEAV EQFLAAEELD RAVDAAEVVI GDVLDRLDFV VAERWLGYLA PPGSSGSRRL GPATLMLSIA REDYRKGVAI ADDLQANGVR DDLARLSPRG GAIMAWCYWH LDRLDDMRAV IDLAPNSPEI DAVRYLLSLA TRREATGAYP APTLSGGPLD ALVMRVHYAH GRLSEVSKMP DSPWAAAVSA PWRVGALRAT GRLTEALELY RSADAGHWAP AWMHGIVGPE LMIDLEDTEE AQRVLAKGQG LVRASGSVVF EWLNRLIEAK LELRLNHDPV AALNLLEQVE NAGGRHYDFI SEALDTWKGL ALLLSRSDND DAVVMLRRAV NSMTEANRIL DLPAAAIYLA EAEWRQGDLT ESDAAADQAM VAARLQESNH QILLALADFQ AVLTRRLDSE EFTDSPWHEL GQALMARGVG AGWNQHPVIL LSEFGRIAIS VAGQEVKPRI AKSLNLLAYL AAVPSHHASR EDLLSALFDG QSDESARAYL RQAAHRLREA LPAGIGPIFT GNTLAFTTPV ILDSESTRFE ALIAKAARLR GQGRLEALLK ALAIVDSGEY LPGMDSSWAT QRREQLEEQA AQARLQAAQM AFTTQQYRQA EQLAEQVVAQ DPYKESAWRI LMRIASATGN EDGVVASYRR CKAALQELGI TPSDSTQQMF QRLRR
|
| |