Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4716 |
Symbol | |
ID | 5673058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5630060 |
End bp | 5631421 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243573 |
Product | DitF protein |
Protein accession | YP_001508989 |
Protein GI | 158316481 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.300284 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTCCCC GGGTACGAGC CGCCCTATGT CGTGGCCCGG GTGGTGCTCG CCGAGCAGCC CGACCTGCAC CTGTTGACCA ATGTGGTCGA CTGCAACGTC GAGGCGGTCA GCACCGGTAT GGAGGTCGAG GTGACCTTCG AACAGCGAGG GGACGTGTTC GTACCCATGT TCCGGCCGGT CGTATGAACT CCGCCGGTCC CGTGAACGTC GAGCGCCAGT CGATCGTCTC CGGCATCGGG CGGTCACCGT CCGGGCGGCG GCTGAACCGG TCCGCAATGG ATCTGACTCT GGACGCCTGT CTGGCGGCCA TCGCTGATGC CGGTCTGACC CCGCGCGACG TCGACGGGCT CACCTCCTGG CCCGACCACG CCGCCCCGCA CGGCTTCGGC GGGCCCCGGG TCGGTGAGCT GCACACGCTC CTGCGCCTGG ACCTGTCGTG GATCCTGGGC TGCGGGGACG GCGCCAACGT CATCGGCATC CTCGGGATCG CCGCCCACGC GGTGGCCACA GGCCTCGCCC GGCATGTGCT GGTCTACCGG ACCGTCGGCG AGGCGACCAG CCAGGGCACC GGCCGCCGCC CGGCGGTGAT GGCCGACCCC GGGGCCGCGC CGTGGAAGGC CACCTACGGG GTCGGTTCGC CGGTGCAGTT CGCGGCCCTG TGGGCCCAGC ACCATTTCGA TCGTTACGGC ACCACCCGGG AGCAGCTCGG CTGGGTCGCG GTGAACGACC GGCGTAACGC CGCCGGTAAC CCGGACGCGA TCTACCGCGA CCCGATGACG ATCGACGACT ACCTCGCGGG GCGGATGATC AGCGAACCGC TGTGTCTGTT CGACTGCGAC GTGCCGGCGG ACGGGTCGAT CGCCTTCGTT GTCTCGCGCG CCGACCACCG CCGTGACGTC GACCGGCCCG TCTTCTTCGA AGCCCTGGGT GGTGGGCGGC CGATGACGTC GAGCTGGGAG TTCTGGCCGG ACCTTGACGT CATGGCCGCG ATGAAGGCCG CCGAGCAGCT GTGGTCGCGT ACCTCGCTGC GGCCCGGCGA CGTCGACGTC GCCGGTCTCT ACGACGGCTT CAGCATCTTC GTCCTGTACT GGCTGGAGGC GCTCGGGTTC TGCGGCCGCG GCGAGTCCGG GCCGTTCGTC GAAGGCGGCA CCCGCATCGC CCGTGACGGT GAGCTGCCAC TCAACACCTC CGGCGGCCAA CTGTCGGAGG GCCGCTACCT CGGCTTCGGT CTGGCCTACG AGACCTTCCT GCAGTTACGG AACCAGGCCG GCACCCGGCA GGTCACCGAC GCCGAGGTCG GCCTCGTCAC GGGCGGCGGC GGCCCGCTCG CCCAGGCATT CCTCTTCACC AACGACCGCT GA
|
Protein sequence | MGPRVRAALC RGPGGARRAA RPAPVDQCGR LQRRGGQHRY GGRGDLRTAR GRVRTHVPAG RMNSAGPVNV ERQSIVSGIG RSPSGRRLNR SAMDLTLDAC LAAIADAGLT PRDVDGLTSW PDHAAPHGFG GPRVGELHTL LRLDLSWILG CGDGANVIGI LGIAAHAVAT GLARHVLVYR TVGEATSQGT GRRPAVMADP GAAPWKATYG VGSPVQFAAL WAQHHFDRYG TTREQLGWVA VNDRRNAAGN PDAIYRDPMT IDDYLAGRMI SEPLCLFDCD VPADGSIAFV VSRADHRRDV DRPVFFEALG GGRPMTSSWE FWPDLDVMAA MKAAEQLWSR TSLRPGDVDV AGLYDGFSIF VLYWLEALGF CGRGESGPFV EGGTRIARDG ELPLNTSGGQ LSEGRYLGFG LAYETFLQLR NQAGTRQVTD AEVGLVTGGG GPLAQAFLFT NDR
|
| |