Gene Achl_4571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_4571 
Symbol 
ID7280510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011881 
Strand
Start bp83171 
End bp86128 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content59% 
IMG OID643580685 
Producttranscriptional activator domain protein 
Protein accessionYP_002478498 
Protein GI219883337 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones190 
Fosmid unclonability p-value0.723489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCCCC GTCTTAACCA GCTGTTCAGT CAGTTGATAA ACCAGAACCG CGTGGTTTGG 
GTTGCCGCAA CGGCAGGTTC CGGCAAAACC ACGGCGATTG TTCAGGCTGC TGCCACATGG
GGCGGCCCTA TTGCCTGGCT CACTCTGGAC GGGACAGACG CTGCCCCCGG CCGGCTTCTC
ATCTATCTGG AGGCGGCAAT AGCAGCACAC GTGCCAGACG CTGCCGGTCT GGCCAGCAGC
GCGCTCACGG CGCGTATTCC CCATCCCGAA GTGGCTGGTT TGCTGGCCGA GTCTGTAGGA
GATGAAGATC TCCTGCTGGT GCTTGATGGC CTGGAGAATC TAGTTGGCGC CTACGAAGCG
CTGGACGTCG TCGGGGCGGT TGTCCGTTAT GCACCGGTGG GACTCAAGGT TGTCCTGCTT
ACTCGGGTTG ATCTACCGAT CGATCTAAGT GCTCAGGCTG GGGTTGATCG CGTGGCAACT
ATAGGTGAGG AGGACCTGGC CTTCACTCCC GAGGAAGCTG CAGGGGCACT CGTTGAAGCC
GGAATAACAG ACATTGACGC CTCCAGTGCC GTCGAGGCAA CAGGCGGCTG GGTAACAGGG
GTTCTTTTTG AGGCGTGGCA CTCACGGCTG TACATTTCCG GAACCGGCGG TGAGGCTGAC
CCCCTACATG GCTATCTGGC TTCGCAAATC CTGGCTAAGT TGGCTCCTGA AGAACGGGAG
TTCCTCATCG TTTCTTCCTT ACTGGACGAA GTGACACCAT CCCGGGCGGC AGCCCTGGGT
CAATCCAACG CCGGCGATTT GCTGGTTAGA CTGCGAAGCC ACCACTTGCC TGTTTCCTGG
ATTTCCGGGA CCTACCGCAT GCGGTGCCAC CCTCGATTCC GAGAATACTT GGTGACTTGT
TTGGAGCGCC GGGGAAAGGC AGAAGTTCAG GCTACACGGC GCGCCTATGG GGACCTTCTA
GTCACCGAGG GCCATCTTGA GGAAGCTGTC GAGCAGTTCC TGGCCGCAGA AGAGCTGGAC
CGAGCTGTGG ATGCTGCCGA AGTGGTTATA GGCGACGTAC TAGACCGACT CGATTTCGTT
GTTGCCGAAA GGTGGCTCGG TTACCTCGCC CCACCTGGCA GTTCCGGGTC CCGGAGGCTG
GGCCCGGCCA CACTCATGCT GTCAATAGCC CGCGAGGACT ACCGAAAGGG CGTGGCTATC
GCCGATGATC TTCAGGCAAA CGGTGTCAGG GATGACCTGG CCCGTCTTTC CCCGCGGGGT
GGTGCAATCA TGGCGTGGTG TTACTGGCAT CTGGACCGGC TAGACGATAT GCGGGCGGTG
ATCGACCTGG CCCCGAATAG CCCAGAGATT GATGCTGTCC GTTACCTGCT ATCACTTGCC
ACCCGTCGGG AAGCAACCGG CGCTTACCCC GCTCCTACCC TTAGTGGTGG ACCACTGGAT
GCTTTGGTGA TGCGCGTGCA CTACGCCCAC GGAAGACTGA GTGAGGTCAG CAAAATGCCG
GACTCCCCAT GGGCTGCTGC AGTCTCCGCT CCGTGGCGAG TGGGCGCTCT TAGAGCAACA
GGCCGCTTGA CGGAGGCCTT GGAGCTCTAT CGCTCAGCTG ATGCTGGCCA CTGGGCGCCC
GCCTGGATGC ACGGCATTGT TGGTCCGGAA CTGATGATCG ATCTGGAGGA CACAGAAGAA
GCGCAACGGG TACTTGCCAA GGGCCAGGGA TTGGTCAGGG CAAGCGGCTC CGTTGTGTTC
GAGTGGCTGA ACCGCCTCAT CGAAGCCAAA CTTGAGCTGC GTTTGAACCA CGACCCTGTT
GCTGCGCTAA ATCTTCTCGA ACAGGTGGAG AACGCCGGAG GGCGCCACTA TGACTTCATT
AGTGAGGCTT TGGATACCTG GAAGGGCCTC GCGCTTCTGC TTTCCCGGTC AGACAATGAC
GACGCCGTCG TGATGCTCAG ACGAGCTGTT AACAGCATGA CCGAGGCCAA CCGCATTCTT
GACCTGCCCG CTGCGGCGAT ATACCTAGCT GAAGCCGAAT GGAGACAGGG TGACTTGACA
GAATCTGATG CAGCTGCCGA CCAGGCGATG GTTGCCGCCA GGCTCCAAGA ATCCAACCAC
CAGATTCTAC TGGCCCTTGC GGATTTTCAG GCAGTACTAA CTCGTCGCCT GGACTCTGAA
GAATTCACCG ACTCACCATG GCATGAACTG GGCCAGGCCC TGATGGCCCG CGGCGTGGGA
GCAGGTTGGA ATCAGCATCC CGTCATTCTC TTATCCGAGT TCGGGCGGAT TGCCATCTCA
GTCGCTGGGC AAGAGGTAAA GCCGCGCATT GCCAAAAGTC TGAATCTACT AGCCTATCTG
GCCGCAGTCC CGAGTCACCA TGCTTCCCGC GAAGATCTGC TTTCGGCGCT CTTCGATGGT
CAATCAGACG AGTCCGCGAG AGCCTACTTG CGCCAGGCCG CTCACCGGCT TCGGGAAGCT
CTTCCTGCCG GTATCGGACC AATCTTCACC GGCAATACCT TGGCGTTCAC CACTCCCGTC
ATTCTGGACA GCGAGTCCAC CAGATTCGAG GCGTTAATCG CAAAGGCTGC ACGACTCCGC
GGGCAGGGAA GACTTGAAGC GCTTTTGAAG GCGCTTGCCA TCGTTGACAG CGGGGAGTAC
CTGCCGGGGA TGGATTCATC CTGGGCAACG CAGCGGCGGG AGCAATTAGA GGAGCAGGCA
GCCCAAGCTC GACTTCAGGC TGCCCAAATG GCATTTACAA CCCAGCAGTA CCGCCAGGCC
GAGCAACTTG CGGAACAAGT TGTCGCCCAA GATCCGTATA AAGAGAGCGC TTGGCGAATT
CTCATGCGGA TAGCCAGCGC CACAGGAAAT GAGGACGGCG TCGTAGCCTC CTACCGCCGC
TGCAAGGCAG CACTCCAAGA GCTGGGCATC ACGCCCTCAG ACTCAACCCA ACAAATGTTT
CAACGGCTCA GGCGTTAA
 
Protein sequence
MRPRLNQLFS QLINQNRVVW VAATAGSGKT TAIVQAAATW GGPIAWLTLD GTDAAPGRLL 
IYLEAAIAAH VPDAAGLASS ALTARIPHPE VAGLLAESVG DEDLLLVLDG LENLVGAYEA
LDVVGAVVRY APVGLKVVLL TRVDLPIDLS AQAGVDRVAT IGEEDLAFTP EEAAGALVEA
GITDIDASSA VEATGGWVTG VLFEAWHSRL YISGTGGEAD PLHGYLASQI LAKLAPEERE
FLIVSSLLDE VTPSRAAALG QSNAGDLLVR LRSHHLPVSW ISGTYRMRCH PRFREYLVTC
LERRGKAEVQ ATRRAYGDLL VTEGHLEEAV EQFLAAEELD RAVDAAEVVI GDVLDRLDFV
VAERWLGYLA PPGSSGSRRL GPATLMLSIA REDYRKGVAI ADDLQANGVR DDLARLSPRG
GAIMAWCYWH LDRLDDMRAV IDLAPNSPEI DAVRYLLSLA TRREATGAYP APTLSGGPLD
ALVMRVHYAH GRLSEVSKMP DSPWAAAVSA PWRVGALRAT GRLTEALELY RSADAGHWAP
AWMHGIVGPE LMIDLEDTEE AQRVLAKGQG LVRASGSVVF EWLNRLIEAK LELRLNHDPV
AALNLLEQVE NAGGRHYDFI SEALDTWKGL ALLLSRSDND DAVVMLRRAV NSMTEANRIL
DLPAAAIYLA EAEWRQGDLT ESDAAADQAM VAARLQESNH QILLALADFQ AVLTRRLDSE
EFTDSPWHEL GQALMARGVG AGWNQHPVIL LSEFGRIAIS VAGQEVKPRI AKSLNLLAYL
AAVPSHHASR EDLLSALFDG QSDESARAYL RQAAHRLREA LPAGIGPIFT GNTLAFTTPV
ILDSESTRFE ALIAKAARLR GQGRLEALLK ALAIVDSGEY LPGMDSSWAT QRREQLEEQA
AQARLQAAQM AFTTQQYRQA EQLAEQVVAQ DPYKESAWRI LMRIASATGN EDGVVASYRR
CKAALQELGI TPSDSTQQMF QRLRR