Gene Arth_3414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3414 
Symbol 
ID4444144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3841065 
End bp3843833 
Gene Length2769 bp 
Protein Length922 aa 
Translation table11 
GC content66% 
IMG OID639691238 
Product2-oxoacid dehydrogenase subunit E1 
Protein accessionYP_832889 
Protein GI116671956 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCAG GAGAAGATAC CTCCCATATC CTCAGCGGGT TGACTAACCA GCTGCCTGAT 
CGTGATCCGG AAGAGACCGC CGAATGGATT GAGTCCCTGG ATACGCTGAT CAGGGAACAG
GGCACCGAGC GTGCCCAGTA CATCATGCGC AGTCTCCTGC AGCGTGCCGG CGCCCAGAGC
GTCGGGGTTC CGATGGTCAC CACCACGGAC TATGTGAACA CCATTCCCGC GGACCAGGAA
GCACCGTTCC CGGGCAACGA GGAATACGAG CGCCGCTACC GGGCGTACAT GCGCTGGAAC
GCCGCGGTCA TGGTGCACCG GTCCCAGCGC CCGAACATCG GGGTCGGCGG GCACATCTCC
ACCTACGCCG GGGCCGCGAC CCTGTACGAG GTCGGGTTCA ACCACTTCTT CCGCGGCAAG
GACCACCCCG GCGGCGGGGA CCAGGTCTTC TTCCAGGGCC ACGCCTCCCC GGGCATGTAC
GCCCGGGCGT TCATGGAAGG ACGCCTGACC GAGGAGGACC TGGACGGGTT CCGGCAGGAA
AAGTCCAAGG CCGGCCACGC CCTGTCCTCC TACCCGCACC CGCGGCTGAT GCCCGGGTTC
TGGGAATTCC CCACCGTGTC CATGGGCATC GGGCCGATGA ACGCGATCTA CCAGGCCCAG
TCCAACCGGT ACCTGCACAA CCGCGGCCTG AAAGACACCT CCGACCAGCA GGTCTGGGCG
TTCCTGGGCG ACGGGGAAAT GGACGAGCCC GAGTCCCGCG GCCTGCTCCA GCTCGCCGCG
AACGAGAACC TGGACAACCT GAACTTCGTG ATCAACTGCA ACCTCCAGCG CCTGGACGGG
CCGGTGCGCG GCAACGGGAA GATCATGCAG GAACTCGAAG CGTTCTTCCG CGGCGCGGGC
TGGAACGTCA TCAAGGTCGT CTGGGGCCGG GAATGGGATG ACCTCCTGGC CAAGGACAAC
GACGGGTCCC TGGTGAAGAT CATGAACGAG ACCCCGGACG GGGACTACCA GACCTACAAG
GCCGAATCCG GCGGGTTCGT CCGCGAACAC TTCTTCGGGA AGACCCCGCA GACCAAGGAC
ATGGTCGCGG ACCTGAGCGA TGACCAGATC TGGAACCTCA AGCGCGGCGG CCACGACTAC
CGCAAGGTCT ACGCCGCGTA CAAGGCAGCC ACCGAATTCA AGGGCAAACC CACCGTCATC
CTGGCCAAAA CGGTCAAGGG CTACGGCCTC GGCCCGCACT TCGAAGGCCG CAACGCCACC
CACCAGATGA AGAAACTCAC CCTCGACGAC CTCAAGGAAT TCAGGGACTA CCTAAGGATC
CCCATTTCCG ACGCCCGGCT GGAGGAGGAC CCGTACAGCC CGCCGTACTT CCACCCCGGC
GCCGAGGCTC CGGAGATCGC CTACCTCCTC GAGCGCCGCG CGGCACTCGG CGGCTACACG
CCCGAGCGCC GCCCCGACCA CAAGGCCATT GAACTGCCCG AAGCCAAGAC CTTCGACGTC
GCCAAGCGCG GCACCGGCAA GCAGCAGGCC GCCACCACCA TGGCCTTCGT CAGGCTCCTG
AAGGACCTGC TCCGCGACAA GAAGTTCGGG CACCGGATTG TTCCGATCGT GCCGGACGAA
TCCCGCACGT TCGGCATGGA CGCGTTCTTC CCCACGGCCA AGATCTACAA CCCGGGCGGC
CAGAACTACC TCTCCGTGGA CCGGGACCTG GTCCTGGCCT ACAAGGAATC CGCCCAGGGC
CAGCTGATCC ACCCCGGCAT CAACGAAGCC GGCGCCGTCG CAGCCTTCAC CGCCGCCGGC
ACCGCCTACG CCACCCACGG CGTCCCGCTG ATCCCGGTCT ACGTGTTCTA CTCCATGTTC
GGCTTCCAGC GCACCGGCGA CGCCTTCTGG GCCGCCGCGG ACCAAATGAC CCGCGGCTTC
ATCATCGGCG CCACCGCAGG CCGGACCACC CTCACCGGCG AAGGACTCCA GCACGCCGAC
GGCCACTCCC CCATCCTCGC CGCCACCAAC CCGGCCGTCG TCACCTACGA CCCCGCCTAC
GGCTACGAAA TGGGCCACAT CATCCGCGAC GGCATCGAGC GGATGTACGG ACCGGGGGCC
GCAAGCGGCG AAGGAACCGG AGCTGCATCC GATAAAAACC TGATGTACTA CCTCACCGTC
TACAACGAAC CCATCACCCA GCCCAACGAG CCTGAGAACC TCGACGTCAA CGGTGTACTG
AAAGGCATCT ACCGGGTATC GGCGTCGGGT GCGGAAGGTC CCAAGAGCCA GATCCTGGCC
TCGGGCGTTT CGGTGCCCTG GGCGCTGGAA GCCCAGCGGA TCCTGGCCGA GGACTGGGGT
GTCTCCGCGG ACGTCTGGTC CGTCACGTCA TGGAATGAAC TCCGCCGCGA CGGCCTCGCC
GCCGAGGAGG AAGCCTTCCT GAACCCCGGC GAACCGGCCC GGATTCCGTT TGTGGCCCAG
CAATTGGCTG ATGCGCAAGG CCCGGTCGTT GCGGTCTCGG ACTACATGAA GGCCGTCCCG
GACCAGATCC GCCAGTTCCT CCCGAACCAG TTCGCCTCGC TCGGCGCGGA CGGCTTCGGC
TTCTCCGACA CCCGCGCCGC AGCGCGCCGC TTCTTCAAGA ACGACACCCA CTCCATCGTG
GTGAAGACGC TGCAAATGCT CGCGGCGAGG GGCGACGTGG AGGAGGGGGC GCCGTCGTAC
GCCATGGACC GCTACAAACT CCTGGACGTG AACGCCGGAA CCACCGGAGG AGCCGGCGGC
GACGCCTGA
 
Protein sequence
MAAGEDTSHI LSGLTNQLPD RDPEETAEWI ESLDTLIREQ GTERAQYIMR SLLQRAGAQS 
VGVPMVTTTD YVNTIPADQE APFPGNEEYE RRYRAYMRWN AAVMVHRSQR PNIGVGGHIS
TYAGAATLYE VGFNHFFRGK DHPGGGDQVF FQGHASPGMY ARAFMEGRLT EEDLDGFRQE
KSKAGHALSS YPHPRLMPGF WEFPTVSMGI GPMNAIYQAQ SNRYLHNRGL KDTSDQQVWA
FLGDGEMDEP ESRGLLQLAA NENLDNLNFV INCNLQRLDG PVRGNGKIMQ ELEAFFRGAG
WNVIKVVWGR EWDDLLAKDN DGSLVKIMNE TPDGDYQTYK AESGGFVREH FFGKTPQTKD
MVADLSDDQI WNLKRGGHDY RKVYAAYKAA TEFKGKPTVI LAKTVKGYGL GPHFEGRNAT
HQMKKLTLDD LKEFRDYLRI PISDARLEED PYSPPYFHPG AEAPEIAYLL ERRAALGGYT
PERRPDHKAI ELPEAKTFDV AKRGTGKQQA ATTMAFVRLL KDLLRDKKFG HRIVPIVPDE
SRTFGMDAFF PTAKIYNPGG QNYLSVDRDL VLAYKESAQG QLIHPGINEA GAVAAFTAAG
TAYATHGVPL IPVYVFYSMF GFQRTGDAFW AAADQMTRGF IIGATAGRTT LTGEGLQHAD
GHSPILAATN PAVVTYDPAY GYEMGHIIRD GIERMYGPGA ASGEGTGAAS DKNLMYYLTV
YNEPITQPNE PENLDVNGVL KGIYRVSASG AEGPKSQILA SGVSVPWALE AQRILAEDWG
VSADVWSVTS WNELRRDGLA AEEEAFLNPG EPARIPFVAQ QLADAQGPVV AVSDYMKAVP
DQIRQFLPNQ FASLGADGFG FSDTRAAARR FFKNDTHSIV VKTLQMLAAR GDVEEGAPSY
AMDRYKLLDV NAGTTGGAGG DA