Gene Arth_3547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3547 
Symbol 
ID4443770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3985971 
End bp3987569 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content65% 
IMG OID639691371 
ProductCdaR family transcriptional regulator 
Protein accessionYP_833022 
Protein GI116672089 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3835] Sugar diacid utilization regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCAG CTACAGTGGA CGACATCCTG TCGGATCTCC CTCTGGGTTT CGCCAGCCTC 
ATCCTCCGGC CGTCCCGCGC GGATTCCCCC ATTGAACGCT TCCTGATCGT TGACTCCGAC
GACGAGACAT CCGACGGCGG CGCCGCCTTT GTGCTGCTGA TCGGGGTTCG CGGGCGCTCC
GCTTTGCCCG CGCTGCGGCG CCTTCTGAAG GATCCTCCCC TGGTGATCGC GGTCAAGGGC
AGTCCGGGTG AACTCGAGGA AGCCGAAGAG CTGCTCCGCG CAGCAGGAAC AGGCCTTCTC
CTGGTGGACC CGGCCGCCGA CTGGGACCGG CTCCTGTCCA TCGCCAAGGA CCGCATCACG
CCGCGCAGCT ACCAGAGCGA GGTGCTGACG CTGCTGGAAG AAGACCTGTT CGCCATCGCG
CAGACCACCG CACGCCTGAC GTCCAGCCAC GTGGTCATCG AGGACGCCGC CAACAAGGTC
CTGGCGTACT CCACGGTGAC GAATGACATC GACGAGCTCC GCAAGGCATC GATCCTTGCC
CGCCGCGGTC CGCGGAAATA CGAACTCCTC CTCAAGGACC TCGGCGCCTA CCGCGAACTG
CACCGGACCC GCCTGCCAGT CAGGGTTCCG GCCCGGCCCC AGGACGGACT TCGGGAGCGC
ATTGCCATCA CATTGTTCGC CGGCGACCGC ATTATGGGGT ACATCTGGCT GCAGGAAACC
GGCGACGGCT TCGGGGCCGA CGTCGACTAC GTCCTCACCG GATCCGCGGC GCGGGCGTCC
GCCGAGCTGA TCCGTTACCG CAACCAGCAG TCCGTGCACA TGCGGCAGGA CCGGATTGCG
CGGATCCTAT CCGGGCCGGC CGAAGCCGCG GCGAGCGCCC ACAGCGAGAA GATCCCGGCG
GACCGCCCCG CAGCACTCGT TCTGCTAGGG ATGTCAGCAA CTGACGCGCA GGCGGACGAT
GCGGCCCTCA AACACGGTGA ACTCGCCAAC CTCGCTTCCA TCCATGCGGC CGCCTATAAA
CAGTCAGCCG TCGTGGGGCA GTTCAACGGC GACACCGCGG TGATCATCCC GGCGCTCCAG
TCCGCGAATG CCGAAGCCGG ACTCCGGAGC CTCGCCGAAG CCATCGTGCG GGACGCTGGA
AAGCATCTGG GTATCAGTCC CTTCGCCGCG GTGGGACCCA TCGCCCCTGA TCTGCTGTCC
ATCCATTCGG TGACCGCAAA AACGGAGGCA CTGCTCGGCT GCATGCGGCG GTCCGGGACG
GCCGGGGTGG CCACCGTTGA CGATTTTGAA GTGGACATCC TCTTCCAGGA AGCACTGGAG
AACTTCACCG CTTCGGCTTT CCGCCATCGC AGCCTCTGGT CCCTCCTCCG CCACGACGGG
GAACTGGCTG AGACCCTTCG GGTGTATTTC GAGGCGTCAC TGGATGTCAG CGAATGCGCC
AGGCGAATGA AGCTGCACAA GAACACCGTT TACTACAGGA TCAGCAAGGC CAGCCGCGTG
ACCGGCCTGA ACTTCACCGA TCCCCGCCAT TCATTGGTCG CCCTGCTCCA CATCCAGGAG
TGGGCCGGCA AGCATAAGGA GCACCCGGGC AACCCATGA
 
Protein sequence
MAAATVDDIL SDLPLGFASL ILRPSRADSP IERFLIVDSD DETSDGGAAF VLLIGVRGRS 
ALPALRRLLK DPPLVIAVKG SPGELEEAEE LLRAAGTGLL LVDPAADWDR LLSIAKDRIT
PRSYQSEVLT LLEEDLFAIA QTTARLTSSH VVIEDAANKV LAYSTVTNDI DELRKASILA
RRGPRKYELL LKDLGAYREL HRTRLPVRVP ARPQDGLRER IAITLFAGDR IMGYIWLQET
GDGFGADVDY VLTGSAARAS AELIRYRNQQ SVHMRQDRIA RILSGPAEAA ASAHSEKIPA
DRPAALVLLG MSATDAQADD AALKHGELAN LASIHAAAYK QSAVVGQFNG DTAVIIPALQ
SANAEAGLRS LAEAIVRDAG KHLGISPFAA VGPIAPDLLS IHSVTAKTEA LLGCMRRSGT
AGVATVDDFE VDILFQEALE NFTASAFRHR SLWSLLRHDG ELAETLRVYF EASLDVSECA
RRMKLHKNTV YYRISKASRV TGLNFTDPRH SLVALLHIQE WAGKHKEHPG NP