Gene Amir_5003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5003 
Symbol 
ID8329201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5961381 
End bp5962493 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content75% 
IMG OID644945440 
Productpyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit 
Protein accessionYP_003102672 
Protein GI256379012 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03181] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.25658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACAG CCACACCGGA CCGGACCCTG CTCCCCACCG AGGAACCGCT GGCGCTGCTG 
CGTCCCGACG GGTCCGCGGT CGAGGGCTCA CCGCTTCGGA TGCCCGACGA CGAGGTGCTG
CTGGAGCTGC ACCGCCGCAT GGTCGTCGGC CGCCGCTTCG ACACCCAGGC CACCGCGCTC
ACCCGCCAGG GCCGCCTCGC CGTCTACCCG TCCTCGCGCG GCCAGGAGGC GTGCCAGGTC
GGCGCGGTCC TGGCGATGCG CGAGCGCGAC TGGCTGTTCC CCACCTACCG CGACAGCGTC
GCCCTGGTCA CCAGGGGTGT GCCCGCCGCG GGCGCGCTGA CCCTGCTGCG CGGCGACTGG
CACCTCGGCT ACGACCCGCG CGAGCACCGC GTCGGACCGC AGTGCACGCC GCTGGCGACC
AACACCCCGC ACGCCGTCGG CTTCGCGCAC GCCGCCCGCT ACAAGGGCGA GGACACCGCC
GCGCTGGTGC TGCTCGGCGA CGGCGCGACC AGCGAGGGCG ACACGCACGA GGCGCTGAAC
TTCGCCGGGG TGTGGAAGGC GCCGGTGGTG TTCCTGGTGC AGAACAACGG CTACGCGATC
AGCGTGCCGC TGAGCAAGCA GACCGCCGCG CCCACGTTGG CGCACAAGGG GATCGGGTAC
GGCATCCGGT CGGTCCTGGT GGACGGCAAC GACGCGGCGG CGGTCCACGC GGTGGTGTCG
GAGGCGCTGG CGTCCGGTGA GCCGGTGCTC GTGGAAGCGC TTACCTACCG CATCGAGGCG
CACACCAACG CCGACGACGC GTCCCGCTAC CGGGACTCCG CCGAGGTCGC GCACTGGCTG
GCCCGCGACC CCGTCGACCG GCTCGCCTCG CACCTGGCCT CGCGCGGGCT GCTCGACCCG
GCGCGCCGAG ACTCGGTGGA CGCCGAGGCG GAGGAGTTCG CGGCGGCGCT GCGGGCCGAG
CTGAACGCGG ACGCGCGCGT GGACCCGGCG GACCTGTTCC GGCACGTGTA CGCCGAGCCG
ACCGCGCAGC TGCGCGAGCA GGCCGCGATG CTGGCGCGCG AACTGGACGC CGAGCACTCG
GGCGCCGACG ACCTGGACGG GGGACGGGCA TGA
 
Protein sequence
MATATPDRTL LPTEEPLALL RPDGSAVEGS PLRMPDDEVL LELHRRMVVG RRFDTQATAL 
TRQGRLAVYP SSRGQEACQV GAVLAMRERD WLFPTYRDSV ALVTRGVPAA GALTLLRGDW
HLGYDPREHR VGPQCTPLAT NTPHAVGFAH AARYKGEDTA ALVLLGDGAT SEGDTHEALN
FAGVWKAPVV FLVQNNGYAI SVPLSKQTAA PTLAHKGIGY GIRSVLVDGN DAAAVHAVVS
EALASGEPVL VEALTYRIEA HTNADDASRY RDSAEVAHWL ARDPVDRLAS HLASRGLLDP
ARRDSVDAEA EEFAAALRAE LNADARVDPA DLFRHVYAEP TAQLREQAAM LARELDAEHS
GADDLDGGRA