Gene Arth_1896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1896 
Symbol 
ID4445585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2134434 
End bp2136023 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content62% 
IMG OID639689708 
Productbenzoylformate decarboxylase 
Protein accessionYP_831380 
Protein GI116670447 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0968821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCG TTCACGCCGC AGCGTACGAG CTTCTTCGAA GCAACAGGCT GACCACAATT 
TTCGGCAACC CGGGATCCAA CGAACTCCCC TTCCTTGACG CAATGCCCGC CGATTTCCGG
TACATCCTGG GCCTGCATGA AGGGGTCGTG GTGGGCATGG CGGACGGCTT CGCCCAGGCA
TCCGGTCAGG CCGCCTTTGT TAACCTGCAC GCCGCTTCAG GGACGGGCAA TGCGATGGGC
GCCCTAACGA ACGCGTGGTA TTCGCATACT CCGCTGGTGA TCACGGCTGG TCAGCAGGTC
CGTCCGATGA TCGGGCTCGA GGCGATGCTG TCCAATGTCG ACGCCGCCTC ACTGCCACGG
CCGCTGGTGA AATGGAGTGC CGAACCGGCA CAGGCCCCGG ATGTCCCCCG GGCACTGAGC
CAAGCAATCC ACACTGCCAC GAGCGATCCC AAGGGACCCG TATACCTCTC GATCCCTTAT
GACGACTGGA ATCAGGACAC TGGGAACCTG AGTGAACATC TGAGCTCACG ATCTGTAAGC
AGGGCAGGCA ACCCCTCCGC AGAACAGCTG GACGACATCT TGTCTGCCCT GAGGGAAGCA
GCGAACCCTG CCCTGGTTTT CGGACCTGAT GTCGATGCGG CCCGGGCAAA TCACCATGCC
GTTCGCTTGG CGGAAAAGCT GGCCGCGCCC GTGTGGATTG CGCCGTCAGC GCCACGCTGC
CCCTTTCCGA CGCGCCATCC GAACTTCCGC GGCGTTCTGC CGGCCTCCAT CGCCGGAATC
TCCGCGCTAC TGAACGGCCA CGATCTGATC GTGGTCATCG GCGCCCCCGT GTTCCGCTAC
CACCAGTACC AGCCCGGCAG CTACCTGCCG GAAAATAGCC GGCTAATCCA CATCACCTGT
GACGCCGGCG AGGCAGCACG GGCACCGATG GGAGATGCCC TGGTTGCAGA CATCGGTCAG
ACGCTGCGGG CGCTGGCCGA CATAATTCCC CAATCGAAAC GGCCACCCCT TAGGCCGCGT
GTCATTCCGC CGGTCCCGGA CTCACAGGAT GATCTCCTGG CACCGGACGC TGTCTTTGAG
GTGATGAACG AGGTGGCGCC CGAGGACGTT GTCTACGTCA ACGAGTCGAC TTCCACCGTC
ACGGCCCTGT GGGAACGGGT GGAACTAAAG CATCCCGGAA GCTACTATTT CCCTGCATCG
GGCGGCCTTG GCTTTGGAAT GCCCGCAGCG GTCGGCGTAC AACTCGCGAA CGATCGGCGG
CGGGTCATCG CCGTCATCGG CGACGGCTCG GCAAACTACG GCATCACCGC TTTGTGGACG
GCCGCCCAAG AAAAAATCCC TGTTGTTTTC ATCATCCTGA ACAACGGCAC CTACGGGGCG
CTGCGCGCCT TCGCCAAACT CCTCAACGCG GAAAACGCCG CAGGCCTTGA CGTCCCGGGT
ATCTGCTTCT GCGCCATCGC CGAAGGCTAT GGCGTCGAGG CCCACAGGAT TACGAGCCTC
GAAAACTTCA AAGACAAGCT TTCCGCAGCC CTGCAGTCAG ACACCCCCAC TCTCCTGGAA
GTACCCACTT CCACCACCAG CCCCTTCTAA
 
Protein sequence
MTTVHAAAYE LLRSNRLTTI FGNPGSNELP FLDAMPADFR YILGLHEGVV VGMADGFAQA 
SGQAAFVNLH AASGTGNAMG ALTNAWYSHT PLVITAGQQV RPMIGLEAML SNVDAASLPR
PLVKWSAEPA QAPDVPRALS QAIHTATSDP KGPVYLSIPY DDWNQDTGNL SEHLSSRSVS
RAGNPSAEQL DDILSALREA ANPALVFGPD VDAARANHHA VRLAEKLAAP VWIAPSAPRC
PFPTRHPNFR GVLPASIAGI SALLNGHDLI VVIGAPVFRY HQYQPGSYLP ENSRLIHITC
DAGEAARAPM GDALVADIGQ TLRALADIIP QSKRPPLRPR VIPPVPDSQD DLLAPDAVFE
VMNEVAPEDV VYVNESTSTV TALWERVELK HPGSYYFPAS GGLGFGMPAA VGVQLANDRR
RVIAVIGDGS ANYGITALWT AAQEKIPVVF IILNNGTYGA LRAFAKLLNA ENAAGLDVPG
ICFCAIAEGY GVEAHRITSL ENFKDKLSAA LQSDTPTLLE VPTSTTSPF