Gene Arth_3698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3698 
Symbol 
ID4443699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4160552 
End bp4162201 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content69% 
IMG OID639691522 
ProductCdaR family transcriptional regulator 
Protein accessionYP_833173 
Protein GI116672240 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3835] Sugar diacid utilization regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATAA AGCTCAGTGA GATCCTAAAA CATGCCACCC TCACAGCCGC CGATCCCGTG 
ATCCGCGCCG GTGCAGGCGC AGTGGCGGGT ACGCAGCTGC GGTGGGTGCA CTCCAGTGAA
GTCCTGGACA TCGCGCCGTT ACTTAGCGGC GGCGAGCTGC TTCTTACCGG CGGCGACGCG
CTGGTTACTG CCTCGGATGC ACGCCGGGCG GAGTACGTCC GGCAGCTCTC GGAGCGCGGG
GTTGGTGCGC TGGCGGTGGA AACTGGCCAG CGGCTCGCTT CACTGCCGTC GTCGATGATC
CAGGCAGCGG AAACCGCCGG GCTTCCCCTT ATCGAATTCC GCAAGGTGGT GCCCTTCGTG
GGCATCATGC AGGCGATTAA CTCGATGCTC GTCAGTGAAT CGGTGGCCCA TCTGCGCCGC
GCGGATGAGG CCAGCCACGC CATGGCCGTT GAGCTGGCCC ACGGGAGCAG CCTGGATCAG
ATCCTTGCCG TGCTGGCCGG AATCATTGGT GCGACCCTGG AGCTCTCATC CGTGTCCGGT
GTAACGCTGG GCAACGCCGG CCCGGGCGGC GCCGATACGC GTCCGGCGGC CGGCGCCGGT
ACGCATCCGG GGAACGGCAC GGCTTCCCAG CCGGCAGAGG GCCTGGAGTC CGGCACCGGC
TCCCCGGCAG AGCCCGGAAG CGCGGCAGAG CCCGGAAGCG CGGCAGAGAC GGGCATCGGC
GGCACGTTGA TAAGCATTGA TGTGCCGGTC CGTGGCGTGC CCTCCGCGAG GTTGCTCATC
AATGTCCCGG CCGACGGCGA CGTCAACCTG GCGCGCGTGG CCGGCGGCCG CTGCGTGGAC
ATCCTGTCGC TCGCTCTGCT GCAGCGGATG CCGCCCGGGC TGAAAGAAGT GGCCGGCACG
GCGCTGCTGC GGGCCGTCAG TTCCGGCAGC CAGCCCTGGC GGCTTCAGCA GCTCTCCCCC
GCCGCCGGGA TCCTTCCCTC TGCGACGGTG GTCGCCGTCG TCGTCCGGTC GTCCACCTCA
CAGCAGCTGC GGGCGGCAAT GGACACTATC CTCAAGCGGT CGGCGCAGCA GAGCGCCAGC
TATGTGGACA ATGCCGAGCT CCTGGCGCTT GCCGCCCTCC CTTTTGACGG GGCGGCGGCC
GCGCGGGCCG GGCTCGTGGC AGCACTCAGG GAGCTCCCGG TTGAGGCGGG GACCATGACG
GCGGTGGGTC CGCTGGCCGC CGGGATCGAA CACGCTCCCT GGTCGTTGTC GGAGGCGAAA
AGCGCCCTGG ATCTCGCCGT GAACGGCTCA CTTCGGTCAG CGGCCCGGCC GGCGTCGGAC
ACGGAAGGAG TGGTGATCGA CGTCGAAAAT CTGGCGGTGG AGCGCCTGGC GGTGCAGCAT
CTGGACCAGG GCGCGCGGCA GGATTTCGTC CGCCAGCAGG TGGGGCCCCT GCTGGATCAC
GACGCGCGGC GCAACTCCCA GCTGCTCGCA ACCCTGGCAA CCTGGCTGGA CTCTGGCTGC
AACACCGCGC AGGCTGCCCG CGAACTGCAT GTTGAGCGGC AGTCGATGCA TCACCGCATG
CAGCGGATTT TTGAGTTGTG CGGCGGCGAT CCGCGCGGAA CCGGCCGGCT CGCTGCGCTG
CACCTCGCCA CCCGGCTGGC CGCGCTGTAG
 
Protein sequence
MPIKLSEILK HATLTAADPV IRAGAGAVAG TQLRWVHSSE VLDIAPLLSG GELLLTGGDA 
LVTASDARRA EYVRQLSERG VGALAVETGQ RLASLPSSMI QAAETAGLPL IEFRKVVPFV
GIMQAINSML VSESVAHLRR ADEASHAMAV ELAHGSSLDQ ILAVLAGIIG ATLELSSVSG
VTLGNAGPGG ADTRPAAGAG THPGNGTASQ PAEGLESGTG SPAEPGSAAE PGSAAETGIG
GTLISIDVPV RGVPSARLLI NVPADGDVNL ARVAGGRCVD ILSLALLQRM PPGLKEVAGT
ALLRAVSSGS QPWRLQQLSP AAGILPSATV VAVVVRSSTS QQLRAAMDTI LKRSAQQSAS
YVDNAELLAL AALPFDGAAA ARAGLVAALR ELPVEAGTMT AVGPLAAGIE HAPWSLSEAK
SALDLAVNGS LRSAARPASD TEGVVIDVEN LAVERLAVQH LDQGARQDFV RQQVGPLLDH
DARRNSQLLA TLATWLDSGC NTAQAARELH VERQSMHHRM QRIFELCGGD PRGTGRLAAL
HLATRLAAL