Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4572 |
Symbol | |
ID | 5672919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5454842 |
End bp | 5455981 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243435 |
Product | L-carnitine dehydratase/bile acid-inducible protein F |
Protein accession | YP_001508851 |
Protein GI | 158316343 |
COG category | [C] Energy production and conversion |
COG ID | [COG1804] Predicted acyl-CoA transferases/carnitine dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.417277 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAGG CGCTGGGCGA CGTCACGGTC GTGTGCCTCA GCGCGCTCGG GCCGGTCCCG TTCGCGACCA TGCTGCTCGC CGACCTGGGC GCCCGGGTGA TCCGGATCGA CCGGGCCGAC CGGCCCGGTG GCGTCACCGG CCTGCGGCTC GAGGACGATC CCCGTACCCG GGGGCAGCGC GGCATCGGGG TCGACGTCCG GCATCCGTCG GGCCGGTCCG TGGTGCTGCG GCTGGTCGAG ACGGCGGACG TGTTCCTGGA GGGAATGCGC CCCGGTGTCG CCGAGCGGCT CGGGCTCGGC CCGGCCGAGC TGCTGGCCGT CAACCCCCGC CTGGTGTACG GGCGGGCGAC GGGGTGGGGG CAGTCGGGCC CGCGCGCTCA ACAGGCCGGG CACGACATCA ATTACGCCGG GCTCGCCGGC GGCCTGTACC CGACCGGACC GGCCGAGCTG CCGCCGCTGC CGCCGCTCAA TCTGCTGGCC GACTTCGCCG GCGGCGGTTC CTACCTCGCC CTCGGCGTGC TCGCGGCCCT GCACCACCGC ACGGGGACCG GACGCGGCCA GGTGGTCGAC GCGGCCATGG TCGACGGCGT CGCCAACCTC ACGGCGATGA TGCACGGGAT GCTCGCGGCC GGCCTGTGGA GCGATCGCCG CGGCGACAAC CTGCTCGACG GCGGCGCCCC GTTCTACCGC ACCTACCGCA CCGCCGACGA CGGCTTCGTC GCCGTGGGCG CGCTGGAGCC GCAGTTCTAC CGGCTGCTGC TGGAGAACCT CGGGCTCGAC CCCGCGCGGT GGCCGCAGCA CGACCGGTCG ACCTGGCCCG AGCAGGAGCG CGTCCTGGGG GATCTGTTCG CCGCGCGCAC CCGGGACGAG TGGACGAAGC TGTTCGACGG CGTCGACGCC TGTGTGACGC CCGTGCTGAG CCTGGCGGAG GCGGCGGCCT CGGCCGAGCT GCGCGAGCGC GCGACCTTCG TCGAGTGGGA CGGCGTCGCG CAGCCGGCGC CGGCACCCCG GCTGTCGGCG TCCCCGGCCG TCGAACGTCC CCGGTCGGGC TGGTGCAGCC ATTCGGCCGA GATTCTGACC GAGCTCGGGC TGACCGAGAC GGAGCGGGCG GCGCTGCGCG ACGCGGGTGT GATCGCTTAG
|
Protein sequence | MTQALGDVTV VCLSALGPVP FATMLLADLG ARVIRIDRAD RPGGVTGLRL EDDPRTRGQR GIGVDVRHPS GRSVVLRLVE TADVFLEGMR PGVAERLGLG PAELLAVNPR LVYGRATGWG QSGPRAQQAG HDINYAGLAG GLYPTGPAEL PPLPPLNLLA DFAGGGSYLA LGVLAALHHR TGTGRGQVVD AAMVDGVANL TAMMHGMLAA GLWSDRRGDN LLDGGAPFYR TYRTADDGFV AVGALEPQFY RLLLENLGLD PARWPQHDRS TWPEQERVLG DLFAARTRDE WTKLFDGVDA CVTPVLSLAE AAASAELRER ATFVEWDGVA QPAPAPRLSA SPAVERPRSG WCSHSAEILT ELGLTETERA ALRDAGVIA
|
| |