Gene M446_5900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5900 
Symbol 
ID6133632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6487890 
End bp6489329 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content74% 
IMG OID641646007 
Productpyruvate dehydrogenase complex dihydrolipoamide acetyltransferase 
Protein accessionYP_001772619 
Protein GI170743964 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.539647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCA ACGTGCTGAT GCCCGCCCTC TCGCCCACCA TGGAGAAGGG CAACCTCGCC 
AAGTGGCTGA AGAAGGAGGG CGACCCGGTC AAGTCCGGCG ACGTGCTGGC CGAGATCGAG
ACCGACAAGG CGACCATGGA GGTCGAGGCG GTCGACGAGG GCGTGCTCGC CAGGATCGTG
GTGCCCGAGG GCACCGCGGA CGTGCCGGTG AACGATCTCA TCGCCGTGAT CGCGGCGGAG
GGCGAGGACC CGGCGCGCGT CGGGGCCGGG GAGGGGGCGG CGCAAGGGGC GGCGAAGGGG
GCGGCGCCCC CGCCGCGGGA TGAGGACCGC ACGGAGGGCG GGGCGAGCCT CGCCTACGCG
CGCGTGAACG AGGCGCCGGA CGCCGCCAAG GCCGCGGCGA ACGGGGCCGC GGCGAAGCCG
AACGGCGCGG CGCCCGGCGG CGCCCCGCAG GGCGCCGCGC CGGCGGGGGG GCGGATCCTC
GCCTCGCCGC TCGCGCGCCG CATCGCCAAG CAGGAGGGGA TCGACCTCTC GCGCGTCCGC
GGCTCCGGCC CGCACGGCAG GGTGATCGAG CGCGACGTGC GCGCGGCCCT CAAGGAGGGC
CCGGCGCCGG CCGCCCCGGC GGGCGCCCCG GCGGCCGCGC CCGGCGGGGC GACCCCGCCC
GCCGCCAAAC CCGCGGCCGG CGCCCCGGCG GCCTCCGGCC TGACCGGCGA CCAGGTCAAG
GCGATGTTCG AGAGGGGGAG CTACGAGGAG GTCCCCCTCG ACGGGATGCG CAGGACCATC
GCCAAGCGCC TCGTCGAGTC GAAGCAGACC GTCCCGCACT TCTACCTCTC GCTCGATTGC
GAACTCGACG CGCTGCTGGC GCTGCGCGAG CAGGTCAATG CGGGCGCCGG CAAGGACCGG
GACGGCAAGC CGCTGTTCAA GCTCTCGGTG AACGACTTCG TCATCAAGGC GCTGGCGCTG
GCCCTGCAGC GGGTGCCGAA CGCCAACGCG GTCTGGGCCG AGGACCGGAT CCTGAAGTTC
CGCCACTCCG ACGTCGGCGT CGCGGTGGCG GTGGACGGGG GCCTGTTCAC GCCGGTGATC
CGCAGGGCCG AGCAGAAGAC ACTCTCGACC CTCTCGGCCG AGATGAAGGA CCTCGCCGGC
CGCGCCCGCA GCCGCAAGCT GAAGCCCGAG GAGTACCAGG GCGGCGCGAC GGCGGTGTCG
AATCTCGGCA TGTACGGCAT CAAGGAATTC GGCGCGGTCA TCAACCCGCC CCACGGCACG
ATCCTGGCGG TGGGGGCGGG CGAAGCCCGG GTCGTGGCCA GGAACGGCGC GCCGGCGGTG
GTGCAGGCCA TGACCGTGAC GCTCTCCTGC GACCACCGGG TGGTGGACGG GGCGCTCGGC
GCCGAATTGC TCGCCGCCTT CAAGAGCCTG ATCGAGAACC CGATGGGGAT GCTGGTGTGA
 
Protein sequence
MPINVLMPAL SPTMEKGNLA KWLKKEGDPV KSGDVLAEIE TDKATMEVEA VDEGVLARIV 
VPEGTADVPV NDLIAVIAAE GEDPARVGAG EGAAQGAAKG AAPPPRDEDR TEGGASLAYA
RVNEAPDAAK AAANGAAAKP NGAAPGGAPQ GAAPAGGRIL ASPLARRIAK QEGIDLSRVR
GSGPHGRVIE RDVRAALKEG PAPAAPAGAP AAAPGGATPP AAKPAAGAPA ASGLTGDQVK
AMFERGSYEE VPLDGMRRTI AKRLVESKQT VPHFYLSLDC ELDALLALRE QVNAGAGKDR
DGKPLFKLSV NDFVIKALAL ALQRVPNANA VWAEDRILKF RHSDVGVAVA VDGGLFTPVI
RRAEQKTLST LSAEMKDLAG RARSRKLKPE EYQGGATAVS NLGMYGIKEF GAVINPPHGT
ILAVGAGEAR VVARNGAPAV VQAMTVTLSC DHRVVDGALG AELLAAFKSL IENPMGMLV