Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4521 |
Symbol | |
ID | 5672870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5393603 |
End bp | 5394772 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243386 |
Product | L-carnitine dehydratase/bile acid-inducible protein F |
Protein accession | YP_001508802 |
Protein GI | 158316294 |
COG category | [C] Energy production and conversion |
COG ID | [COG1804] Predicted acyl-CoA transferases/carnitine dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.461003 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGC CGGTTGGCTC CGGCCCGCTC GCAGGGACGC GCGTCCTTGA GATCGCGGGC CGGGGCCCCG GCCCCTTCGC GGGCATGCTC CTGGCGGACA TGGGAGCGGA GATCCTGCGC ATCGACCGGC CGGATCCCCC CGCCCGTCCG CAGGCGCCCG ACCACCGCCT GGAGCTGTAC AACCGCGGAC GACGCTCCGT GGTCGTCGAT CTCAAGCACG GAGCCGGCCC CGAGGTGGTG CTGCGTCTGG TCGAACGGGC GGACGCGATC TACGAGGGGT ACCGGCCCGG CGTCGCCGAA CGTCTCGGGA TCGGCCCGGA CGCGTGCCTG CAGCGAAACC CACGGATCGT CTACGGGCGC GGCACGGGCT GGGGGCAGAC CGGGCCCCTG GCGGACAAGG CCGGCCACGA CATCAACTAC ATCGCGCTCG CCGGGGCGCT CGACCCCCTC GGCTACGCGG GCGGGCCGCC CGCCATCCCG CTCAACCTGA TCGGTGACTA CGCGGGCGGA GGCATGATGC TCGCCTTCGG CATCGTGTGC GCGCTGCTGG AGGCGCGCGG CTCGGGCCGG GGCCAGGTGG TCGACGCGGC GATGGTGGAC GGGGCGTCGC TGCTCATGAC CCTGTACCAC GGACGTCGGA TGATGGGCAC GTGGAGCGAC GAACGCGGCA CGAACTACGT GGACTCCGGC GCGCCGTTCT ACAACGTCTA CGAGACGAGC GACCACCGGT ACGTCGCGAT CGGCGCGATC GAGCCGAAGT TCCAGCGGGT GCTGTTCGAC GCCATCGGCG TGGACCTCGA CGGCATCACC ATGCCGGTGT CGACGATCGA CCGTACCGAC TGGGCCGAAC TGCGGCGCCG GCTCGCGGCG GTCTTCGTGA CCCGCACCCG CGACGAGTGG ACCAAGCTGC TCGCCGACGT CGACGGCTGC TACTCGCCCG TGCTGTCCCT CGGGGAGGTG TCCGCACATC CCCATCACCA GGCGCGTGCC GGCTTCCCGG AACTCGACGG AATCCCCCAT CCCGCGCCGG CGCCCCGGTT CGGGCGGACC CCGGCGGCCA TCCGGACGGC CCCGCCCAGG CCCGGGGAGC ACACCCGGGA AGCCCTGGCG GACTGGGGCT TCACCGACGA CGAACTGCAG ACGCTGCACC GGGACGGCGC AATCCGGTAG
|
Protein sequence | MSLPVGSGPL AGTRVLEIAG RGPGPFAGML LADMGAEILR IDRPDPPARP QAPDHRLELY NRGRRSVVVD LKHGAGPEVV LRLVERADAI YEGYRPGVAE RLGIGPDACL QRNPRIVYGR GTGWGQTGPL ADKAGHDINY IALAGALDPL GYAGGPPAIP LNLIGDYAGG GMMLAFGIVC ALLEARGSGR GQVVDAAMVD GASLLMTLYH GRRMMGTWSD ERGTNYVDSG APFYNVYETS DHRYVAIGAI EPKFQRVLFD AIGVDLDGIT MPVSTIDRTD WAELRRRLAA VFVTRTRDEW TKLLADVDGC YSPVLSLGEV SAHPHHQARA GFPELDGIPH PAPAPRFGRT PAAIRTAPPR PGEHTREALA DWGFTDDELQ TLHRDGAIR
|
| |