Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3414 |
Symbol | |
ID | 4444144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3841065 |
End bp | 3843833 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639691238 |
Product | 2-oxoacid dehydrogenase subunit E1 |
Protein accession | YP_832889 |
Protein GI | 116671956 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCAG GAGAAGATAC CTCCCATATC CTCAGCGGGT TGACTAACCA GCTGCCTGAT CGTGATCCGG AAGAGACCGC CGAATGGATT GAGTCCCTGG ATACGCTGAT CAGGGAACAG GGCACCGAGC GTGCCCAGTA CATCATGCGC AGTCTCCTGC AGCGTGCCGG CGCCCAGAGC GTCGGGGTTC CGATGGTCAC CACCACGGAC TATGTGAACA CCATTCCCGC GGACCAGGAA GCACCGTTCC CGGGCAACGA GGAATACGAG CGCCGCTACC GGGCGTACAT GCGCTGGAAC GCCGCGGTCA TGGTGCACCG GTCCCAGCGC CCGAACATCG GGGTCGGCGG GCACATCTCC ACCTACGCCG GGGCCGCGAC CCTGTACGAG GTCGGGTTCA ACCACTTCTT CCGCGGCAAG GACCACCCCG GCGGCGGGGA CCAGGTCTTC TTCCAGGGCC ACGCCTCCCC GGGCATGTAC GCCCGGGCGT TCATGGAAGG ACGCCTGACC GAGGAGGACC TGGACGGGTT CCGGCAGGAA AAGTCCAAGG CCGGCCACGC CCTGTCCTCC TACCCGCACC CGCGGCTGAT GCCCGGGTTC TGGGAATTCC CCACCGTGTC CATGGGCATC GGGCCGATGA ACGCGATCTA CCAGGCCCAG TCCAACCGGT ACCTGCACAA CCGCGGCCTG AAAGACACCT CCGACCAGCA GGTCTGGGCG TTCCTGGGCG ACGGGGAAAT GGACGAGCCC GAGTCCCGCG GCCTGCTCCA GCTCGCCGCG AACGAGAACC TGGACAACCT GAACTTCGTG ATCAACTGCA ACCTCCAGCG CCTGGACGGG CCGGTGCGCG GCAACGGGAA GATCATGCAG GAACTCGAAG CGTTCTTCCG CGGCGCGGGC TGGAACGTCA TCAAGGTCGT CTGGGGCCGG GAATGGGATG ACCTCCTGGC CAAGGACAAC GACGGGTCCC TGGTGAAGAT CATGAACGAG ACCCCGGACG GGGACTACCA GACCTACAAG GCCGAATCCG GCGGGTTCGT CCGCGAACAC TTCTTCGGGA AGACCCCGCA GACCAAGGAC ATGGTCGCGG ACCTGAGCGA TGACCAGATC TGGAACCTCA AGCGCGGCGG CCACGACTAC CGCAAGGTCT ACGCCGCGTA CAAGGCAGCC ACCGAATTCA AGGGCAAACC CACCGTCATC CTGGCCAAAA CGGTCAAGGG CTACGGCCTC GGCCCGCACT TCGAAGGCCG CAACGCCACC CACCAGATGA AGAAACTCAC CCTCGACGAC CTCAAGGAAT TCAGGGACTA CCTAAGGATC CCCATTTCCG ACGCCCGGCT GGAGGAGGAC CCGTACAGCC CGCCGTACTT CCACCCCGGC GCCGAGGCTC CGGAGATCGC CTACCTCCTC GAGCGCCGCG CGGCACTCGG CGGCTACACG CCCGAGCGCC GCCCCGACCA CAAGGCCATT GAACTGCCCG AAGCCAAGAC CTTCGACGTC GCCAAGCGCG GCACCGGCAA GCAGCAGGCC GCCACCACCA TGGCCTTCGT CAGGCTCCTG AAGGACCTGC TCCGCGACAA GAAGTTCGGG CACCGGATTG TTCCGATCGT GCCGGACGAA TCCCGCACGT TCGGCATGGA CGCGTTCTTC CCCACGGCCA AGATCTACAA CCCGGGCGGC CAGAACTACC TCTCCGTGGA CCGGGACCTG GTCCTGGCCT ACAAGGAATC CGCCCAGGGC CAGCTGATCC ACCCCGGCAT CAACGAAGCC GGCGCCGTCG CAGCCTTCAC CGCCGCCGGC ACCGCCTACG CCACCCACGG CGTCCCGCTG ATCCCGGTCT ACGTGTTCTA CTCCATGTTC GGCTTCCAGC GCACCGGCGA CGCCTTCTGG GCCGCCGCGG ACCAAATGAC CCGCGGCTTC ATCATCGGCG CCACCGCAGG CCGGACCACC CTCACCGGCG AAGGACTCCA GCACGCCGAC GGCCACTCCC CCATCCTCGC CGCCACCAAC CCGGCCGTCG TCACCTACGA CCCCGCCTAC GGCTACGAAA TGGGCCACAT CATCCGCGAC GGCATCGAGC GGATGTACGG ACCGGGGGCC GCAAGCGGCG AAGGAACCGG AGCTGCATCC GATAAAAACC TGATGTACTA CCTCACCGTC TACAACGAAC CCATCACCCA GCCCAACGAG CCTGAGAACC TCGACGTCAA CGGTGTACTG AAAGGCATCT ACCGGGTATC GGCGTCGGGT GCGGAAGGTC CCAAGAGCCA GATCCTGGCC TCGGGCGTTT CGGTGCCCTG GGCGCTGGAA GCCCAGCGGA TCCTGGCCGA GGACTGGGGT GTCTCCGCGG ACGTCTGGTC CGTCACGTCA TGGAATGAAC TCCGCCGCGA CGGCCTCGCC GCCGAGGAGG AAGCCTTCCT GAACCCCGGC GAACCGGCCC GGATTCCGTT TGTGGCCCAG CAATTGGCTG ATGCGCAAGG CCCGGTCGTT GCGGTCTCGG ACTACATGAA GGCCGTCCCG GACCAGATCC GCCAGTTCCT CCCGAACCAG TTCGCCTCGC TCGGCGCGGA CGGCTTCGGC TTCTCCGACA CCCGCGCCGC AGCGCGCCGC TTCTTCAAGA ACGACACCCA CTCCATCGTG GTGAAGACGC TGCAAATGCT CGCGGCGAGG GGCGACGTGG AGGAGGGGGC GCCGTCGTAC GCCATGGACC GCTACAAACT CCTGGACGTG AACGCCGGAA CCACCGGAGG AGCCGGCGGC GACGCCTGA
|
Protein sequence | MAAGEDTSHI LSGLTNQLPD RDPEETAEWI ESLDTLIREQ GTERAQYIMR SLLQRAGAQS VGVPMVTTTD YVNTIPADQE APFPGNEEYE RRYRAYMRWN AAVMVHRSQR PNIGVGGHIS TYAGAATLYE VGFNHFFRGK DHPGGGDQVF FQGHASPGMY ARAFMEGRLT EEDLDGFRQE KSKAGHALSS YPHPRLMPGF WEFPTVSMGI GPMNAIYQAQ SNRYLHNRGL KDTSDQQVWA FLGDGEMDEP ESRGLLQLAA NENLDNLNFV INCNLQRLDG PVRGNGKIMQ ELEAFFRGAG WNVIKVVWGR EWDDLLAKDN DGSLVKIMNE TPDGDYQTYK AESGGFVREH FFGKTPQTKD MVADLSDDQI WNLKRGGHDY RKVYAAYKAA TEFKGKPTVI LAKTVKGYGL GPHFEGRNAT HQMKKLTLDD LKEFRDYLRI PISDARLEED PYSPPYFHPG AEAPEIAYLL ERRAALGGYT PERRPDHKAI ELPEAKTFDV AKRGTGKQQA ATTMAFVRLL KDLLRDKKFG HRIVPIVPDE SRTFGMDAFF PTAKIYNPGG QNYLSVDRDL VLAYKESAQG QLIHPGINEA GAVAAFTAAG TAYATHGVPL IPVYVFYSMF GFQRTGDAFW AAADQMTRGF IIGATAGRTT LTGEGLQHAD GHSPILAATN PAVVTYDPAY GYEMGHIIRD GIERMYGPGA ASGEGTGAAS DKNLMYYLTV YNEPITQPNE PENLDVNGVL KGIYRVSASG AEGPKSQILA SGVSVPWALE AQRILAEDWG VSADVWSVTS WNELRRDGLA AEEEAFLNPG EPARIPFVAQ QLADAQGPVV AVSDYMKAVP DQIRQFLPNQ FASLGADGFG FSDTRAAARR FFKNDTHSIV VKTLQMLAAR GDVEEGAPSY AMDRYKLLDV NAGTTGGAGG DA
|
| |