Gene Arth_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4026 
Symbol 
ID4447827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4544804 
End bp4546387 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content68% 
IMG OID639691857 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_833501 
Protein GI116672568 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTAA ACAAGTTCAA CCTCCCCGAT GTGGGCGAAG GGCTGACCGA GGCCGAGATC 
GTTTCGTGGA ACGTCAAGCC CGGTGACAGC GTCGCGATCA ATGACATCCT GTGCGAAATC
GAAACGGCAA AGTCCCTCGT GGAACTCCCT TCGCCGTTTG CGGGGACGGT CACCGAACTG
CTCGTTCCCG TGGGGGTCAC GGTCGACGTC GGCACTCCGA TCATCAGCGT GAGCGACGCC
GTGTCCGGCG ACCCCACGCC TGCGGATGCG CCTGTTCCGG TGGCCCCCGC AGCTGCAGCA
CAGACGCCCG CAGCTCCCAC CGCGAACGCC CCGATGTACG GGAAACTCTT CGAGGATCAC
GACGAGCAAG ACAGCGGGAC GCCAGGCGCG CAGGCTGTTC CCGCCTCCGG CAAGGCCGTA
GGTCCCCTGG TGGGCTCGGG TCCCAAGGCC GACGCCGTCA AACGCCGCCC GCGCAAGGCG
TCGCCGTCCG CGGCGGTACT GCCGGCACCG GCTGCGGCCA CCTCGCCGGT GGTTGAGCCC
GCCCCTTTGG CCGAGCCGGT CGAAACCCAA GGCATCTGGA TCAACGCCGG CGCAGCTTCC
GCCTCCGGCA CCCCCGACAC CGCGGGGGTT CCCGAAGAAG CTTCCGGGCG TCCCACCCTC
GGCGGTACCA TCAGCGGCCT GGTCAACCGG GTCCTGGCGA AGCCTCCGGT GCGCAAGATT
GCCCGCGACC TCGGGATCGA CCTCGCAGAC GTTGTCGCCA CGGGCGCGCG TGGCGAAGTC
ACCCGGGAGG ACCTGGTGAG CTACCAGGCC CAGCGCGACG CCGAGGTGGA CAAGGCGGAC
ACCTTCTGGG GCAAGTCCGG CCGGCCCCAG GACCAGCGGA TCGAGCGGAT TCCCGTCAAG
GGTGTCCGCA AGGCCACGGC CAAGGCGATG GTGGAGTCCG CGTTCGCCGC ACCGCACGTG
AGCATCTTCG TGGACGTCGA CGCCAGCCGC ACCATGGAGT TCGTCAAGCG GCTCAAGGTG
TCGCGTGACT TCGAAGGCAT CAAGGTCTCC CCGCTGTTGA TCCTGGCCAA GGCAGTGATC
TGGGCCGCCG CCCGCAACCC CAGCGTCAAC GCCACGTGGG TGGACAGCGC CGACGGCAGC
GACACGGCCG AGATCCACGT CAAGCACTTC ATGAACCTGG GCATCGCAGC GGCAACCCCG
CGCGGCCTGA TGGTGCCGAA CATCAAGAAC GCACAGGACC TATCGCTCAA GGAACTGGCC
CTGGCACTCA ACGACCTGGC CACCACTGCC CGCGCCGGCA AGACCCGGCC CGCGGAGATG
CAGGGCGGCA CCCTGACGGT CACCAACATC GGTGCCCTGG GCATCGACAC CGGCACCCCG
ATCATCAATC CCGGCGAGGT GGCCATCGTG GCGTTTGGCA CCATCAAGCA GAAGCCGTGG
GTCCTGGACG GCGAAGTCAT CCCGCGTTGG ATCACCACCC TGGGCGGTTC ATTCGACCAC
CGGGTGGTGG ACGGGGACCT CTCGGCGCGC TTCATGGCGG ACGTTGCGGC AATCCTTGAG
GAGCCGGCCC TCCTGCTGGA CTAG
 
Protein sequence
MTVNKFNLPD VGEGLTEAEI VSWNVKPGDS VAINDILCEI ETAKSLVELP SPFAGTVTEL 
LVPVGVTVDV GTPIISVSDA VSGDPTPADA PVPVAPAAAA QTPAAPTANA PMYGKLFEDH
DEQDSGTPGA QAVPASGKAV GPLVGSGPKA DAVKRRPRKA SPSAAVLPAP AAATSPVVEP
APLAEPVETQ GIWINAGAAS ASGTPDTAGV PEEASGRPTL GGTISGLVNR VLAKPPVRKI
ARDLGIDLAD VVATGARGEV TREDLVSYQA QRDAEVDKAD TFWGKSGRPQ DQRIERIPVK
GVRKATAKAM VESAFAAPHV SIFVDVDASR TMEFVKRLKV SRDFEGIKVS PLLILAKAVI
WAAARNPSVN ATWVDSADGS DTAEIHVKHF MNLGIAAATP RGLMVPNIKN AQDLSLKELA
LALNDLATTA RAGKTRPAEM QGGTLTVTNI GALGIDTGTP IINPGEVAIV AFGTIKQKPW
VLDGEVIPRW ITTLGGSFDH RVVDGDLSAR FMADVAAILE EPALLLD