Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3547 |
Symbol | |
ID | 4443770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3985971 |
End bp | 3987569 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639691371 |
Product | CdaR family transcriptional regulator |
Protein accession | YP_833022 |
Protein GI | 116672089 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3835] Sugar diacid utilization regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGCAG CTACAGTGGA CGACATCCTG TCGGATCTCC CTCTGGGTTT CGCCAGCCTC ATCCTCCGGC CGTCCCGCGC GGATTCCCCC ATTGAACGCT TCCTGATCGT TGACTCCGAC GACGAGACAT CCGACGGCGG CGCCGCCTTT GTGCTGCTGA TCGGGGTTCG CGGGCGCTCC GCTTTGCCCG CGCTGCGGCG CCTTCTGAAG GATCCTCCCC TGGTGATCGC GGTCAAGGGC AGTCCGGGTG AACTCGAGGA AGCCGAAGAG CTGCTCCGCG CAGCAGGAAC AGGCCTTCTC CTGGTGGACC CGGCCGCCGA CTGGGACCGG CTCCTGTCCA TCGCCAAGGA CCGCATCACG CCGCGCAGCT ACCAGAGCGA GGTGCTGACG CTGCTGGAAG AAGACCTGTT CGCCATCGCG CAGACCACCG CACGCCTGAC GTCCAGCCAC GTGGTCATCG AGGACGCCGC CAACAAGGTC CTGGCGTACT CCACGGTGAC GAATGACATC GACGAGCTCC GCAAGGCATC GATCCTTGCC CGCCGCGGTC CGCGGAAATA CGAACTCCTC CTCAAGGACC TCGGCGCCTA CCGCGAACTG CACCGGACCC GCCTGCCAGT CAGGGTTCCG GCCCGGCCCC AGGACGGACT TCGGGAGCGC ATTGCCATCA CATTGTTCGC CGGCGACCGC ATTATGGGGT ACATCTGGCT GCAGGAAACC GGCGACGGCT TCGGGGCCGA CGTCGACTAC GTCCTCACCG GATCCGCGGC GCGGGCGTCC GCCGAGCTGA TCCGTTACCG CAACCAGCAG TCCGTGCACA TGCGGCAGGA CCGGATTGCG CGGATCCTAT CCGGGCCGGC CGAAGCCGCG GCGAGCGCCC ACAGCGAGAA GATCCCGGCG GACCGCCCCG CAGCACTCGT TCTGCTAGGG ATGTCAGCAA CTGACGCGCA GGCGGACGAT GCGGCCCTCA AACACGGTGA ACTCGCCAAC CTCGCTTCCA TCCATGCGGC CGCCTATAAA CAGTCAGCCG TCGTGGGGCA GTTCAACGGC GACACCGCGG TGATCATCCC GGCGCTCCAG TCCGCGAATG CCGAAGCCGG ACTCCGGAGC CTCGCCGAAG CCATCGTGCG GGACGCTGGA AAGCATCTGG GTATCAGTCC CTTCGCCGCG GTGGGACCCA TCGCCCCTGA TCTGCTGTCC ATCCATTCGG TGACCGCAAA AACGGAGGCA CTGCTCGGCT GCATGCGGCG GTCCGGGACG GCCGGGGTGG CCACCGTTGA CGATTTTGAA GTGGACATCC TCTTCCAGGA AGCACTGGAG AACTTCACCG CTTCGGCTTT CCGCCATCGC AGCCTCTGGT CCCTCCTCCG CCACGACGGG GAACTGGCTG AGACCCTTCG GGTGTATTTC GAGGCGTCAC TGGATGTCAG CGAATGCGCC AGGCGAATGA AGCTGCACAA GAACACCGTT TACTACAGGA TCAGCAAGGC CAGCCGCGTG ACCGGCCTGA ACTTCACCGA TCCCCGCCAT TCATTGGTCG CCCTGCTCCA CATCCAGGAG TGGGCCGGCA AGCATAAGGA GCACCCGGGC AACCCATGA
|
Protein sequence | MAAATVDDIL SDLPLGFASL ILRPSRADSP IERFLIVDSD DETSDGGAAF VLLIGVRGRS ALPALRRLLK DPPLVIAVKG SPGELEEAEE LLRAAGTGLL LVDPAADWDR LLSIAKDRIT PRSYQSEVLT LLEEDLFAIA QTTARLTSSH VVIEDAANKV LAYSTVTNDI DELRKASILA RRGPRKYELL LKDLGAYREL HRTRLPVRVP ARPQDGLRER IAITLFAGDR IMGYIWLQET GDGFGADVDY VLTGSAARAS AELIRYRNQQ SVHMRQDRIA RILSGPAEAA ASAHSEKIPA DRPAALVLLG MSATDAQADD AALKHGELAN LASIHAAAYK QSAVVGQFNG DTAVIIPALQ SANAEAGLRS LAEAIVRDAG KHLGISPFAA VGPIAPDLLS IHSVTAKTEA LLGCMRRSGT AGVATVDDFE VDILFQEALE NFTASAFRHR SLWSLLRHDG ELAETLRVYF EASLDVSECA RRMKLHKNTV YYRISKASRV TGLNFTDPRH SLVALLHIQE WAGKHKEHPG NP
|
| |