Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2499 |
Symbol | |
ID | 5670895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2975412 |
End bp | 2976488 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241416 |
Product | pyruvate dehydrogenase (acetyl-transferring) |
Protein accession | YP_001506837 |
Protein GI | 158314329 |
COG category | [C] Energy production and conversion |
COG ID | [COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.556081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTTGA TGGCGGAACG CAGTGAGAAC GGTGCGCCGT TCGACACGGG TGTTCGGCTG CTTGCGCCGG ACGGCACTCT CGTCGACGAC CCGCGTTTCA CTGTGCGTGC CGACTCCCGG CAGACGGAGT CGTTCTACCG GGAGATGGTG CGGGCGCGCC GCCTCGACGA GGAGGCCACG GCGCTGCAAC GCCAGGGCGA ACTGGTCCTG TGGATTCCGC TGCGGGGCCA GGAGGCCGCC CAGGTGGGCT CGGCGGCTGC CGCCGAGCCG GCGGACTTCC TCTTCCCCAG CTATCGCGAG CATGCCGTGG TCTGGCACCG CGGCATCCCC CCGGTCGAGG CGCTGCGCCT GCTGCGGGGC GTCACGCACG GCGGCTGGGA TCCCGAGGCG TACAACGTGG CCAACTACGT GCTCGTGCTG GCGTCGCAGA CCCTGCACGC GGTGGGCTAC GGCCTGGGCG TCCGCCTCGA CGGCGCAGCC GGCCAGGTGG TGATGGTCTA CCTCGGTGAC GGCGCGATGA GCCAGGGCGA CGCGAACGAG GCGTTCGTCT GGGCCGCGAG CTTCGGCGCG CCGGTGGTGT TCTTCTGCCA GAACAACCAG TGGGCGATCT CCACACCCAG CGCGCGCCAG TCACCCGTGC CGCTGGCACG CCGCGCGGCG GGCTTCGGCT TTCCGGGCGT GCGGGTGGAC GGCAACGACG TGCTGGCGGT GCATGCGGTG ACCACCTGGG CCCTGGAGCA CGCCCGCTCG GGTCAGGGCC CCGTCCTGAT CGAGGCGAAC ACCTACCGGA TGGCGCCGCA CACGACGTCC GACGACGCCA GCAGGTACCA GGAGGCCGCC GAGGTCGCGG CGTGGCGGGC GCGCGACCCG ATCGACCGGG TCGCGCTGCT GCTCGGCCAC ACGCACGACC CGGCCTGGTT CGAGGGTGTG CGGGCGGAGG CGGAGGAAGC CGCGGCCACG CTGCGCCGGG AGTGCCTGGC CCTGCCCGAC CCGGCCCCGC GGACCCTGGT GGATCATGTG CTGGTGGGTG GCTCCGCGCT ACTTCGGGAG CAGCGGGGTG AGATCTGGGA GGGTTGA
|
Protein sequence | MTLMAERSEN GAPFDTGVRL LAPDGTLVDD PRFTVRADSR QTESFYREMV RARRLDEEAT ALQRQGELVL WIPLRGQEAA QVGSAAAAEP ADFLFPSYRE HAVVWHRGIP PVEALRLLRG VTHGGWDPEA YNVANYVLVL ASQTLHAVGY GLGVRLDGAA GQVVMVYLGD GAMSQGDANE AFVWAASFGA PVVFFCQNNQ WAISTPSARQ SPVPLARRAA GFGFPGVRVD GNDVLAVHAV TTWALEHARS GQGPVLIEAN TYRMAPHTTS DDASRYQEAA EVAAWRARDP IDRVALLLGH THDPAWFEGV RAEAEEAAAT LRRECLALPD PAPRTLVDHV LVGGSALLRE QRGEIWEG
|
| |