Gene Arth_3819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3819 
Symbol 
ID4447658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4305395 
End bp4306984 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content68% 
IMG OID639691643 
ProductO-succinylbenzoate-CoA ligase 
Protein accessionYP_833294 
Protein GI116672361 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01923] O-succinylbenzoate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAAATT TTGGCATCGG CTCGTGGCTG CAACGGCGCC GCCCGAAGTC GGGCAACAAG 
ACAGCAATTA TCGCCGGAGA CCGGGAGGTC AGTTACGAGC AACTGGCGGA ACGCTCGGCC
CGGCTTGCCA ATGCCCTCCG TGACCGCGGC GTGGCCCGCG GGGACCGGGT GGCCTACCTG
GGGGAGAACG ATCCGTCCTT CCTGGAGACC CTTTTCGCCT GCGGCCTGGC CGGAGCCGTC
TTCGTCCCAC TGAACACGCG GCTGGCGCCC CCGGAAATTC AGTTCCAGCT CAGGGACTGC
GGCGCCGTGC TGCTGGTGCA CGCGGAGAGC CTGTCGGACC TGGCGGTCCG TGGCGCGGGC
GGCACCGGCG CCGTCCGGCG GATCGCCGTC GACGAAGCCG CCCCGACAGG AAAGCACGAC
GGCGGCGCTG CCGCCGTGCA GGGGGACCAG CCGGCGGAAC GCTACGAAGA CGTGGTGGCG
TCCGGCGCGA ACGTGGCCCC CGACGAGCCG GTGGGCCTGG ACGACGGCGC CATGATCCTC
TACACCTCGG GCACCACCGG TCACCCCAAG GGTGCCCTGC TGACCCACGG GAACATCACG
TGGAACTGCA TCAATGTGAT TGTCGATTTC GACTTCGCTT CCACGGACGT TGCCTTGATG
ATCTCCCCGA TGTTCCATGT GGCGTCGCTG GACATGGGCG TCCTGCCCAC ACTGCTGAAG
GGCGGGACCG TGGTGCTGGA GGCCCGGTTC GATCCGCTGC GGACGCTTCA GCTCATCGAA
CGGCACCGGG CCACCACCAT CAGCGGGGTG CCCACCACCT ACCAGATGCT CTGCGAACAT
CCCGCCTGGG AAACCACGGA CCTGAGCTCC CTGAACAAGC TGACCTGCGG AGGGTCGGCG
GTGCCGCTGC GCGTGCTGGA TGCCTACGAG AAGCGGGGGC TGCACTTTTC GAACGGCTAC
GGGATGACCG AGACGGCGCC GGGTGCCACC ACGCTGCCGG CGGCGCGGTC CCGGGACAAG
GCCGGATCGT CCGGGCTGCC GCACTTCTTT ACGGAGGTCC GGATAGCAGA CCTCGCCAGT
CCCGACACGG AGCCGGCGGC ACCGGGCACG GTGGGTGAGA TCCAGATCAA GGGTCCCAAC
GTCATCCACG AATACTGGAA CCGGCCCGAC TCGACGGCCG ATTCCTACAC CGCGGACGGC
TGGTTCAAGT CCGGCGACAT GGGCTACAAG GACGGCGAGG GCTTCGTGTT CATCTCGGAC
CGGCTCAAGG ACATGATCAT CTCCGGCGGC GAGAACATCT ATCCGGCGGA AGTGGAGCAG
GCCATCACCG AGCTGGAGGC CGTGGGCAGC GTGGCGGTGA TCGGCGTGCC GGACGAAAAG
TGGGGCGAAG TGCCGCGGGC CGTGGTGCTG CTGCGGGAGG GGGCCCAGCT GAGTGAGGAG
CAGCTGCGGG CCCACCTGGA CGGGCGCCTG GCCCGCTACA AGATTCCCAA GTCGGTGGTG
TTCGTGGACG AGATGCCGCG GACGGCGAGC GGCAAGATCA GGAAGGCGGA CCTGCGGAAG
CTGACCCCCG CAAACGGCCA GCTGCAGTAG
 
Protein sequence
MENFGIGSWL QRRRPKSGNK TAIIAGDREV SYEQLAERSA RLANALRDRG VARGDRVAYL 
GENDPSFLET LFACGLAGAV FVPLNTRLAP PEIQFQLRDC GAVLLVHAES LSDLAVRGAG
GTGAVRRIAV DEAAPTGKHD GGAAAVQGDQ PAERYEDVVA SGANVAPDEP VGLDDGAMIL
YTSGTTGHPK GALLTHGNIT WNCINVIVDF DFASTDVALM ISPMFHVASL DMGVLPTLLK
GGTVVLEARF DPLRTLQLIE RHRATTISGV PTTYQMLCEH PAWETTDLSS LNKLTCGGSA
VPLRVLDAYE KRGLHFSNGY GMTETAPGAT TLPAARSRDK AGSSGLPHFF TEVRIADLAS
PDTEPAAPGT VGEIQIKGPN VIHEYWNRPD STADSYTADG WFKSGDMGYK DGEGFVFISD
RLKDMIISGG ENIYPAEVEQ AITELEAVGS VAVIGVPDEK WGEVPRAVVL LREGAQLSEE
QLRAHLDGRL ARYKIPKSVV FVDEMPRTAS GKIRKADLRK LTPANGQLQ