Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4743 |
Symbol | |
ID | 5673085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5664812 |
End bp | 5665822 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243600 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_001509016 |
Protein GI | 158316508 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGGC CGAACATCTA TCTGCAGGAC GTCACCCTGC GGGACGGCAT GCACGCCATC CGGCACCGGG TCGACCCGGA GCGGGTCGGC GCCATCGTGG CCGCTCTCGA CAAGGCGGGC GTCCGGGCGA TCGAGGTCAC CCACGGCGAC GGGCTGGCCG GCTCCAGCCT GACGTACGGC CCCGGCAGCC ACACGAACTG GGAGTGGATC GAGGCGGCGG TCACCAACGC CTCGCAGGCG ACGATCACCA CGCTGCTGCT ACCGGGCGTG GGCACGATCG CCGAGCTGCG CCGCGCGCAC GCGATGGGGG TCGGCTCGGT CCGCGTCGCG ACGCACTGCA CCGAGGCGGA CGTCGCCGCC CAGCACATCG CCGCGGCGCG CGAGCTCGGC ATGGACGTCT CCGGCTTCCT GATGATGAGC CACATGGCCG AGCCGGCCGA GCTGGCCGCC CAGGCCAAGC TGATGGAGTC GTACGGGGCG CACTGTGTCT ACGTCACCGA CTCGGGCGGC CGGCTGACCA CCGACCGCGT GCGCGAGCGG GTCCGTGCCT ACCGCGACGT CCTGCGCCCG GACACCCAGA TCGGCATCCA CGCGCACGAG AACCTCTCGC TGTCGGTCGC CAACTCGTTC GCCGCGGTCG AGGAGGGCGC CTACCGCGTC GACGCGTCGC TCGCCGGCCA GGGAGCCGGC GCCGGCAACT GCCCGATCGA GCCGTTCGTC GCGGTCGCGC TGCTGCTGGG CTGGGATCTC GACTGCGACC TGCTCGCGCT GGAGGACGCG GCCGAGGACC TGGTCCGGCC GTTGCAGGAC CGTCCGGTCC GGGTCGACCG CGAGACGCTC ACGCTCGGCT TCGCGGGCGT GTACTCCAGT TTCCTCCGGC ACGCCGAGAT CGCCGCCGAG ACCTACGGCG TGGACGCCCG CAGCATCCTG ATCGAGGCGG GCCGGCGAAA GCTGGTCGGC GGCCAGGAGG ACATGCTCGT CGACATCGCC CTGGCGATAC AGCCCAAGTA G
|
Protein sequence | MSGPNIYLQD VTLRDGMHAI RHRVDPERVG AIVAALDKAG VRAIEVTHGD GLAGSSLTYG PGSHTNWEWI EAAVTNASQA TITTLLLPGV GTIAELRRAH AMGVGSVRVA THCTEADVAA QHIAAARELG MDVSGFLMMS HMAEPAELAA QAKLMESYGA HCVYVTDSGG RLTTDRVRER VRAYRDVLRP DTQIGIHAHE NLSLSVANSF AAVEEGAYRV DASLAGQGAG AGNCPIEPFV AVALLLGWDL DCDLLALEDA AEDLVRPLQD RPVRVDRETL TLGFAGVYSS FLRHAEIAAE TYGVDARSIL IEAGRRKLVG GQEDMLVDIA LAIQPK
|
| |