Gene Arth_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1107 
Symbol 
ID4446410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1200491 
End bp1202188 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content70% 
IMG OID639688913 
Productpurine catabolism PurC domain-containing protein 
Protein accessionYP_830601 
Protein GI116669668 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.166611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCCCGT TGGCGCCTGA GGCAACCGCA GAGCGGCTCG GCTTCGTCAC GCTGGAGCAG 
TTCCTGCACA AGCTGCCGGC AGAGCTGAAG ATACTCCACG ACGGCGGCAA CGGTGCCGGG
ATGCTCCGCT GGGTGGAACC CAGCGAACTG GAGGACCCCA CGCCCTACCT GCTCGACGGC
GAGTTCATCC TCACCGCCGG GCTGCCCTTC CTCGGCGACG GCGGCAGCGA GGCGAAGGTG
GATGCCTACG TCCGGCGGCT GGTTGGCGCC GGGGTGGGGG CCCTGGGGTT CGGGCTGGAA
CCGTATTTCG CTGCCGTGCC CGAGACCGTG GTGGCGGCCT GCCGGCGCCA CAACCTGACC
CTCGTGGAAT TCCCCAAGAC TGTTCCGTTC GCCGCCATCG GGCTGGAGTT TTCGCAGCTG
CTGGAGTCGG ATAACGCCAG GGTGTTCCGG CAGCTCGCGG ACACCAACCG ACAGCTGATG
CGGGCTGTCC TTTCGGCCAG GCCGGAACAC GAGCTGCTGG CGGCGCTAGT GCAGCGGGTT
CCGGTCTGGG CGCTGCTCGT CGGAGCAGAC GGCCGCATCC GCGCCCGGGC CGCGGGCAGC
AACGGCGCCG GCAGCAACGG CGGGGGCGCC GCCGTCGAAC TTTCCGCCCT GCAGCCGCTG
CTGAAACGGC TGCTGGCCGG CAGCGGCCCC CGCGTGGAAA TGGAGCAGCT GGAGGAGCCG
GGGGCCACCA TGGTGTTCGG CCATCCGCTC CGCAGCACCC GGGACGCCAA CCTCGGCGCC
CTGGTCCTGG GCTCGGAGGG CCCGCTCACC CCCGCGCAGA ACAGTGTGGT GTCCTCCGTC
GTGGGGCTGC TGGAATTGCT CGTGCGGCAG CGGACCAGCG GCTCGCTGGC CCCAAGCCAG
CTTGCGACGG CGCTGCTGCT GCACCCGGAC AGCGCGGCCT CCGGCAGCAC GAAGCACGTT
AACGGCCTCA AAGACCTCCT GGCGCAGAGC ATGTCCTCCA CGCGCTCAGC GCCGCTCCGG
GTTGTCCAGG GCGTGAAGGC GGAGGCCGGC GGCCCCGCCG GCGAAAGCCC TGTGCGGGAA
CTGCTGCAGT GGCGCCGGAT GTTCGACACC AAACTCGTCG AGATCACCGA CTACGGTTTC
GCCGCCATCA CCCGTCTCAA GGTGGACGAC GCATTGCTCG CCGAGACGGA GGCGCTGGGC
TGGCGCCTGG TGATCGGCGA CGCCACGGAG TTCCACGGGC TGTCCTCTGC GTACCGTCGG
GCCACCTCCC TGCGGGCGCG GGTTCAGGCA ACCGGCCGGA GCGCCCGGGT GGACGAGGTG
ACCTGGTCCG TGGCCGGACT GCTCGGCCGC GAAGCCGGGA CCATGCTTGC GGCCCGGCTG
CTGGAACCGG TCCTCAACCA GGAGGATCCC GAGCGGCGGG CCGGACAGCT GACGGTGCTG
CGGACATGGC TGGGCGAGAA CGGGAGCTGG GACGCCACGG CCAAAGCCAT GGGCCTGCAC
CGCAACAGCG TCCGCCGCCA GATCAATGCC CTCGCCGAAC TGTTGGACAC CGACCTCAAC
CAGGCCCAGG TGCGGGCGGA ACTCTGGTTC GCCCTTCAGT ATGTGGACGA TTCTCCTGCC
CCGGCCTCCC CCGGCGCTGC CGGCCTGGCA GACGCAACCA GCCCGGAATC CGGAACACCC
GACTCCCAGT CGCGGTAG
 
Protein sequence
MLPLAPEATA ERLGFVTLEQ FLHKLPAELK ILHDGGNGAG MLRWVEPSEL EDPTPYLLDG 
EFILTAGLPF LGDGGSEAKV DAYVRRLVGA GVGALGFGLE PYFAAVPETV VAACRRHNLT
LVEFPKTVPF AAIGLEFSQL LESDNARVFR QLADTNRQLM RAVLSARPEH ELLAALVQRV
PVWALLVGAD GRIRARAAGS NGAGSNGGGA AVELSALQPL LKRLLAGSGP RVEMEQLEEP
GATMVFGHPL RSTRDANLGA LVLGSEGPLT PAQNSVVSSV VGLLELLVRQ RTSGSLAPSQ
LATALLLHPD SAASGSTKHV NGLKDLLAQS MSSTRSAPLR VVQGVKAEAG GPAGESPVRE
LLQWRRMFDT KLVEITDYGF AAITRLKVDD ALLAETEALG WRLVIGDATE FHGLSSAYRR
ATSLRARVQA TGRSARVDEV TWSVAGLLGR EAGTMLAARL LEPVLNQEDP ERRAGQLTVL
RTWLGENGSW DATAKAMGLH RNSVRRQINA LAELLDTDLN QAQVRAELWF ALQYVDDSPA
PASPGAAGLA DATSPESGTP DSQSR