Gene NATL1_04561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04561 
SymbolpdhC 
ID4779657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp417220 
End bp418590 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content43% 
IMG OID640083733 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_001014285 
Protein GI124025169 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01349] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.852087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACTC ATGACATTTT TATGCCTGCT TTAAGTTCAA CGATGACTGA GGGCAAAATA 
GTTGAGTGGC TAAAAAAACC TGGAGATAAA GTTGAAAGAG GTGAGTCAGT TTTAGTTGTT
GAGTCAGATA AAGCTGATAT GGATGTTGAG TCTTTCCAAG ATGGCTTTCT TGCTTCGATT
GTGATGCCTG CAGGCAGCTC TGCACCTGTT GGAGAGACAA TCGGATTGAT CGTTGAGACA
GAAGATGAGA TCGCTGCGGC TCAAGCAAAT TCGCCTTCTC CTTCCCCTCA ATCAGGAAGT
CAAGAAAAAG ATAGTTCATC GCCTCAAGTT CAAGAAAAAC AAGCCTCTGT CGACTCTCCT
AAAGCAACAG TAGTTACAAA GGCATCTCCT GCACCTCTTG TTTCAGAATC CTCTGTAAAT
CAGGATCAGT TTTTAAATGA TGGTCGAATA GTCGCTTCCC CAAGAGCTAA AAAACTTGCT
TCTCAAATGG GAGTGGATTT GGCAACTGTT AGGGGCTCAG GACCTCATGG ACGAATACAA
GCTGAGGATG TTCAAAGTGC AAAAGGGCAA CCCATAAGCG TTCCTTGGAT TGCAGAAAGT
AATGCTCCGG CGAAAATAGT TTCTGATGTG CCTCGCGTAG AAAAAAAATC TGTTGACGCT
GGTAAGCCAC CTGCTCCAGG GAAAAGTTTT GGATCTAGAG GGGAAACAAT TGCATTTAAT
ACTCTTCAAC AAGCTGTAAA TCGGAACATG GAGGAAAGTT TAAATACTCC TTGTTTCAGA
GTCGGATATT CAATTCTTAC TGATGAATTG GATGATCTTT ATAAACAAGT TAAACCTGAT
GGAGTAACTA TGACTGCTTT ACTTGCTAAA GCAGTTGGCT TAACGCTGGC TAGACACCCC
CAGGTGAATG CAGCTTTTAG TTCTGAGGGG ATTGCCTATC CTTCACAAAT AAATGTAGCC
GTTGCAGTTG CGATGGAGGA CGGAGGGTTG ATAACTCCAG TGCTGCAAAA TGCTGATAAG
ACGAGCCTTA CTGATTTATC CCTACAATGG GCTGATCTTG TTAAGCGAGC TAGGAATAAG
CAATTAGAAC CGCAAGAATA TAGCAGTGGA ACGTTTACAC TCTCGAATCT AGGTATGTTT
GGAGTGGATC GTTTTGATGC AATTCTGCCC CCAGGGACTG GAGCAATTTT AGCGGTAGGA
GCTTCATTGT CTAAAGTTGT TGCTTCTAAA GATGGTTCGA TTTCAATCAA AAAACAAATG
CAAGTAAATC TTACCGCTGA TCACAGAGTG ATATATGGGG CTGATGGAGC ACTATTCCTC
AAGGATTTGG CATACTTAAT TGAAAAGAAC CCTTATAGCC TCTCGTCTTG A
 
Protein sequence
MATHDIFMPA LSSTMTEGKI VEWLKKPGDK VERGESVLVV ESDKADMDVE SFQDGFLASI 
VMPAGSSAPV GETIGLIVET EDEIAAAQAN SPSPSPQSGS QEKDSSSPQV QEKQASVDSP
KATVVTKASP APLVSESSVN QDQFLNDGRI VASPRAKKLA SQMGVDLATV RGSGPHGRIQ
AEDVQSAKGQ PISVPWIAES NAPAKIVSDV PRVEKKSVDA GKPPAPGKSF GSRGETIAFN
TLQQAVNRNM EESLNTPCFR VGYSILTDEL DDLYKQVKPD GVTMTALLAK AVGLTLARHP
QVNAAFSSEG IAYPSQINVA VAVAMEDGGL ITPVLQNADK TSLTDLSLQW ADLVKRARNK
QLEPQEYSSG TFTLSNLGMF GVDRFDAILP PGTGAILAVG ASLSKVVASK DGSISIKKQM
QVNLTADHRV IYGADGALFL KDLAYLIEKN PYSLSS