Gene Noca_4509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4509 
Symbol 
ID4597028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4768856 
End bp4770052 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content72% 
IMG OID639779120 
Productpyruvate dehydrogenase (acetyl-transferring) 
Protein accessionYP_925693 
Protein GI119718728 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03181] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.751763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCCAGC AGGCAGCGGG AGGCAACCCC GCACACACGG CTGGCTTCGG CCCCGACCTC 
GCCGAGGTCT TCGGGCCCGC CCAGCAGGAC GGCGGACCCG AGCTGGTCCA GCTGCTCACC
CCCGAGGGAG AGCGGGTCGA GCACCCCGAC TTCTCCTTCG ATCTCGGCGA CGACACCATC
CTCGGCTTCT ACCGCGACAT GGTGCTGACC CGGCGCATCG ACACCGAGGC GACCGCGCTG
CAGCGGCACG GCGAGCTCGG CATCTGGGCC CAGCTGCTCG GCCAGGAGGC CGCGCAGATC
GGCGCCGGCC GCGCGCTGCG CCCGCAGGAC TTCGTGTTCC CGACCTACCG CGAGCACGGC
GTCGCCTGGT GCCGCGGCAT CGACCCGCTG GATCTGCTCG GCCTGTTCCG CGGCGTCGAC
CAGGGCTCGT GGGACCCGAA GGACAAGAAC TTCGGCCTTT ACACGATCGT GATCGGCGCC
CAGACCCTGC ACGCCACCGG CTACGCCATG GGCATGCAGC GCGACGGCGT CGTCGGCACC
GGCGACCCCG ACCGCGACGC CGCGGTGATC GCGCACTTCG GCGACGGCGC GTCCTCGCAG
GGCGACGTCA ACGAGTCGTT CGTCTTCGCC GCCTCCTACA ACGCGCCGGT GGTGTTCTTC
TGCCAGAACA ACCAGTGGGC GATCTCCGAG CCGTTCGAGC GGCAGAGTCG GATCCCGCTC
TACCAGCGGG CGCTCGGCTT CGGCTTCCCC GGCGTGCGCG TCGACGGCAA CGACGTGCTC
GCGACGTACG CCGTCACCCA GGCGGCGCTC GACCGGGCCC GCGACGGCCA GGGCCCGACG
TTCGTGGAGG CGTACACCTA CCGGATGGGC GCGCACACCA CGACCGACGA CCCGACCCGC
TACCGGCTCT CCGACGACCT GGAGCGCTGG AAGCTCAAGG ACCCGATCGC CCGCGTCGAG
GCCTACCTGC GCCGCAACGG CATCGCCGAC GACGCGTTCT TCGCCGGCGT CCAGGAGGAG
GCCGCCGACC TCGGCGTCCG ACTGCGCGAG GGCTGCCGGG CGCTGCCCGA CCCCTCGCCG
CTGAGCATCT TCGACCACGT CTACACAGAG CTCACCGAGG AGCTCGAGCA GCAGCGGGAG
GCGTTCGCCG CCTACCTTGC CAGCTTCGAT GGCGCGGACT TCGAGGGGGC GCACTGA
 
Protein sequence
MTQQAAGGNP AHTAGFGPDL AEVFGPAQQD GGPELVQLLT PEGERVEHPD FSFDLGDDTI 
LGFYRDMVLT RRIDTEATAL QRHGELGIWA QLLGQEAAQI GAGRALRPQD FVFPTYREHG
VAWCRGIDPL DLLGLFRGVD QGSWDPKDKN FGLYTIVIGA QTLHATGYAM GMQRDGVVGT
GDPDRDAAVI AHFGDGASSQ GDVNESFVFA ASYNAPVVFF CQNNQWAISE PFERQSRIPL
YQRALGFGFP GVRVDGNDVL ATYAVTQAAL DRARDGQGPT FVEAYTYRMG AHTTTDDPTR
YRLSDDLERW KLKDPIARVE AYLRRNGIAD DAFFAGVQEE AADLGVRLRE GCRALPDPSP
LSIFDHVYTE LTEELEQQRE AFAAYLASFD GADFEGAH