Gene Arth_1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1222 
Symbol 
ID4446285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1333310 
End bp1336090 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content66% 
IMG OID639689030 
ProductLuxR family transcriptional regulator 
Protein accessionYP_830716 
Protein GI116669783 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2197] Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGGATC CGCTCAAGTC TGACGGGAAG GCTTCGGCGC AGACGCGCGC CGGAACCAAG 
CGTCCGAATG CTGGTGAGCC CTGGATAGAC CGGGGGGACA TGCTGTCCGC AGTCAGGGAC
GCCCTTGCCT CGGACGACTG CTGGGCGGTT TTTGTGGTGG GCGACGCCGG ATTGGGCGCC
TCTTCCTTGC TGGCACAGCT GAACGACGCC CGAAACGGCC GGACGTCCGT GATGACTGTC
CACGGGAGTC CGTCCCTTGC ACCCGTTCCG TACGGAGCGC TGTCTCCGTA CCTGGCTGAT
CTCTCCGTTG ACGACGTCAC GTCCAAGGTG GCCGTCCTGA GGGCATTATG GGCGCACCTC
GAAAGCGGCA GGAAAAATCC GGGACCCGCC CTCCTGTTGG TTGACGACGC CCACGACCTG
GACACGGCCA CGGCCGAGAT GATTTCCGAA CTTGTCCAGG CGGGCTGGGC CAAACTCGTT
GCCACCTGCA TTCCGAGGCC CGGCATACCG CAGCCGCTGC TCAGGCTTTG GCACGACGGT
ACGGCCGAAA GATTCGACAT CGCCCCCTTG ACAATGGAAC AGGGGCATCA ACTCTGCGAG
GCGCTGCTCG ACGGCAAAGT CCTCAACAGC ACGTCGCGTC AATACTGGCA GGAGGCCGGA
GGCAACACAC TGCTGCTCAA GACCCTCGTC CGCGAGGCCC AGCGCTCCGG CGACCTGATC
CGGCGCAACG GAGTGTGGCT TAACACGGGT TCGTCCCACG TCCGGACCCT GGAACTCACC
GCCGTCGTCA AGGTCCAGTT GATGCGCATC TCGGCCGACG GCCGCGAGGC CCTGAACCTG
ATTGCGCTGG CGGAGCCGGT GGACAGGACC CTCGTGGGAG AAATCGTCGG TGAACCTGCC
GTCAAGGAAC TGCTGGACCA GCGGCTCGTG GCCCAGTCGG TGGACAGCGA GCCGACTCTC
CGGCTGGTCA GCCCCGTCTA CGGCGAAGTG CTGCGCCGGA TCGTTCCCGC AGCCCGAAGC
CTCCAGCTGC ACCGGGAACT CGTGTCCCGG ATGGAAGTCT CGGCAGACAA ACCCGAGTCG
CTGCTCCGAA TAGTTTCCTG GTCCCTGGAC TGCGGGGCCG AAGTGCCTGC ACGCCTGCTG
GTCCGGGCTG CCGTGCTCGC TTGCAAACTG TTTGAGAGCG AGACGGCCCT GCGCATCGCC
TGGGCCGTCA AGGACCCGGA ACTGCAACAA ACTGCGCGCG CCGTGATGGC CCGCTCACAC
TACAACCTCG GGCACTACGA CGAAGCTGCC GCCCTTCTGA ACGTCAACGT CGACGCCGGT
AGCGGCCTCA CGCACATGGT GGTCAACAGC CTCCTGAGAT CGGCCACCCG GGCGGCACTC
GGGCACTCCG CCTCCGAAAT CGCGGAAGAT GCCAGCCGGC TGCGCCAATG GGGCGAAAAA
GAGGCCGCGG CCAATCCCGC GGACTCAACC GCCATCCTCA AGACCACGGC ACACCGGGCG
TCCCTTGTGG AACTGATGGC GTACTCGCTC GCGGGCGACT ACAGGCAAAT GGGGCCCGCG
CTCGATGACC TGCTGGCAGC CGGCGGTTCG CCCGCGGCAC CCGATGGCGC GCTCACCCGG
TCCATGGCCC TGGCACTGCA GGCGGAGCGC CTGTGCGCGC TGGGGTATCC GCTGCAAGGG
CGCCAACTCG CCGCGGAGGC CTTCGGCCAG CCCCAGCCTC CCGACAACGA CATCTTCTTC
CTCCCCGAGT TCATCGTGGT CCGACTTGTT GCCTGCGATC TTGCGGCCGG TGAATGGGCC
GAGGCGGAGC AGCTCCTGGC GGGTTACGCG GAAACACCCT TCGGTTCCGC CATGGTTTCG
TTCAGCGGAG CCGCCTACGT CGTAATGGGG TATATAGCCG TCCGGCAGGG AAAGCTCGCT
GACGCCCTGG AGTTACTGAC CACGGGTCTG GAGGCCCTCC GCGACAGTGA TCCCCAGCAG
ATGTTCCGGC TGTGCGCGGC CATGGCCTAT TACGTGGCGG CCGCCCAAGG CCTCAGAGCC
GAGGCCGACA GGCTGAAGTC GGACTATGAC GCCTGGGGTG AGCGGGGCAT GCACCTGATG
ACGGCTTTTG CCCGGGATTT CCTGGCTGCC GGTGACGAAC ACCTGGCCGG GGACGGAACC
GGCATCGCGG CGCTGCACCG GAGCGCAGGG GAAGCGGCGC AACAGGACGC CAAGTTCCTG
GAGCTGAATG CCCTCGCCAT GGCGCTGGTC CTCGGAGACA CGACCAGGCT GGAGCGGCTG
CAGGAACTCG CCGGCGCGGT GGAAGGCGCC TGGGCGGCTG CCCTGGCTGA ATACTGTGCG
GCGCTTTCCC ATGGCAGCGC CGAACTGTTC CTGCGCGCGG GGGATTCCCT CTCCGCTGCC
TCCGTCTTCC AGTTGGCCGC GGACGCTTAT TCAGCCGCGC TGGCTTCACT GGACCGGAGT
CGGGACCGGG AACTGGCAGC CTTGGCCCGC GCAGGCATCG CCCGGTGCAA TGAAGAGCTG
GGCCATGCTG GAAGCGAAGC CCAACCTGAC ACCGTGGCTC CCGTTTTGAC CAAACGAGAG
CGGAACATTG TGGGGCTGGC GGCAACCGGA CTCAGTGACC GCCAGATCGC AGACAAACTG
CAGATTTCTG TCCGCACCGT GGAAGGTCAC CTTTATCGTT GCTACCTGAA ACTTGGTATC
GCCGGACGCG ACGAGTTGGC CGCGGCAGCA GGACTTGGAG AAGACGCAGC GGTGCGGGCT
AAAGCATCCC CCGGGAAATA G
 
Protein sequence
MKDPLKSDGK ASAQTRAGTK RPNAGEPWID RGDMLSAVRD ALASDDCWAV FVVGDAGLGA 
SSLLAQLNDA RNGRTSVMTV HGSPSLAPVP YGALSPYLAD LSVDDVTSKV AVLRALWAHL
ESGRKNPGPA LLLVDDAHDL DTATAEMISE LVQAGWAKLV ATCIPRPGIP QPLLRLWHDG
TAERFDIAPL TMEQGHQLCE ALLDGKVLNS TSRQYWQEAG GNTLLLKTLV REAQRSGDLI
RRNGVWLNTG SSHVRTLELT AVVKVQLMRI SADGREALNL IALAEPVDRT LVGEIVGEPA
VKELLDQRLV AQSVDSEPTL RLVSPVYGEV LRRIVPAARS LQLHRELVSR MEVSADKPES
LLRIVSWSLD CGAEVPARLL VRAAVLACKL FESETALRIA WAVKDPELQQ TARAVMARSH
YNLGHYDEAA ALLNVNVDAG SGLTHMVVNS LLRSATRAAL GHSASEIAED ASRLRQWGEK
EAAANPADST AILKTTAHRA SLVELMAYSL AGDYRQMGPA LDDLLAAGGS PAAPDGALTR
SMALALQAER LCALGYPLQG RQLAAEAFGQ PQPPDNDIFF LPEFIVVRLV ACDLAAGEWA
EAEQLLAGYA ETPFGSAMVS FSGAAYVVMG YIAVRQGKLA DALELLTTGL EALRDSDPQQ
MFRLCAAMAY YVAAAQGLRA EADRLKSDYD AWGERGMHLM TAFARDFLAA GDEHLAGDGT
GIAALHRSAG EAAQQDAKFL ELNALAMALV LGDTTRLERL QELAGAVEGA WAAALAEYCA
ALSHGSAELF LRAGDSLSAA SVFQLAADAY SAALASLDRS RDRELAALAR AGIARCNEEL
GHAGSEAQPD TVAPVLTKRE RNIVGLAATG LSDRQIADKL QISVRTVEGH LYRCYLKLGI
AGRDELAAAA GLGEDAAVRA KASPGK