Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4534 |
Symbol | |
ID | 5672883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5410246 |
End bp | 5411211 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641243399 |
Product | Citryl-CoA lyase |
Protein accession | YP_001508815 |
Protein GI | 158316307 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2301] Citrate lyase beta subunit |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.668521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTGCTC ATGCTCGGTC CGCTCGTGCT CGCCGGTCGT GTCTGGCTGT TCCGGCTTCG AACGTGAAGA TGTTGGGGAA GGCGCAGGGT CTTCCTGCCG ACCAGATCTT CTGTGATCTG GAGGACTCGG TTGCCCCGGG GGCGAAGGAG TCGGCGCGGG GGAACGTCGT GTCGGTGTTG AACGAGGGTG ACTGGGCGGG TAAGACCCGG GTTGTGCGGG TCAATGACCT GACGACGAAG TGGACCTATC GTGATGTCGT GACGGTGGTC GAGGGTGCGG GGGCGAACCT GGACTGTGTG ATGCTGCCGA AGGTGCAGAC CGCCGCGCAG GTGCAGTGGC TGGATCTGTT GCTGACGCAG ATCGAGGAGG TGATGGGTTT CGAGGTCGGG CGGATCGGTA TCGAGGCGCA GATCGAGAAC GCGCTGGGTC TGTCGAACGT GAAGGAGATC GCGTTCGCCA GCCCGCGGAT CGAGACGATC ATCTTTGGTC CGGCGGATTT CATGGCGTCG ATGAACATGC CGTCGCTGGT GGTGGGTGCG CTGAACCCGG ATTATCCGGG GGACCCGTTC CACTATGTGC TGTTCAAGAT TTTGGAGGCG GCGCGGGCGC GGGGGGTGCA GGCGATCGAC GGGCCGTTCC TGCAGATCCG GGATGTGGAG GCGTTCCGTG GGGTGGCGAA GAAGTCCGCG GCGTTGGGTT ATGACGGTAA GTGGGTGCTG CATCCGGGGC AGATCGATGC CGCGAACGAG GTGTACGCCC CCCGGCAGGA GGATTACGAC CACGCTGAGC TGATCCTGGA CGCCTATGCC TGGCACACCT CGGACGAGGG TGGGCTGCGG GGCGCGGTGA TGCTCGGGGA CGAGATGATC GACGAGGCGA GCCGGAAGAT GGCCGAGGTG ATCGCGGGCA AGGGGCGTGC CGCGGGTATG GCGCGTACCG CGGCCTTCGA GCCGCCGGCC GGCTGA
|
Protein sequence | MSAHARSARA RRSCLAVPAS NVKMLGKAQG LPADQIFCDL EDSVAPGAKE SARGNVVSVL NEGDWAGKTR VVRVNDLTTK WTYRDVVTVV EGAGANLDCV MLPKVQTAAQ VQWLDLLLTQ IEEVMGFEVG RIGIEAQIEN ALGLSNVKEI AFASPRIETI IFGPADFMAS MNMPSLVVGA LNPDYPGDPF HYVLFKILEA ARARGVQAID GPFLQIRDVE AFRGVAKKSA ALGYDGKWVL HPGQIDAANE VYAPRQEDYD HAELILDAYA WHTSDEGGLR GAVMLGDEMI DEASRKMAEV IAGKGRAAGM ARTAAFEPPA G
|
| |