Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3310 |
Symbol | |
ID | 5671682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3921927 |
End bp | 3923108 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242199 |
Product | hypothetical protein |
Protein accession | YP_001507619 |
Protein GI | 158315111 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.408434 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCTG ACGTCATGAA CATCGAGCGC CGGGCGATCG TCTCCGGCAT CGGGCGGTCC CAGTCCGGCC GGAGGTTGAA CCGGCCCGCG ATCGATCTGA CCCTGGACGC CTGTCTCGCG GCCATCGCCG ACGCGGGTCT CACCCCGCGC GACATCGACG GGCTCACCTC CTGGCCCGAC CACCCCGCCC CGCACGGCTT CGGCGGCCCC CGGGTCGGCG ACCTGCACAC CCTCCTGCGG CTGGACCTGT CGTGGATCCT CGGCTGCGGC GACGGCGCCA ACGTCATCGG CATCCTCGGA ATCGCCGCCC ACGCGGTGGC CACGGGCCTC GCCCGGCATG TGCTGGTCTA CCGGACCGTC GGGGAGGCGA CAAGCCAGGG GACCGGCCGC CGTCCGGCCG TGATGGCCGG ACCCGGGGCC GGGGCCGCGC CGTGGAAGAC GACCTACGGG GTGGGTTCGC CCGTCCAGTT CGCCGCGCTG TGGGCCCAGC ACCACTTCGA CCGCTACGGC ACCACCCGGG AGCAGCTCGG CTGGGTCGCC GTGAACGACC GGCGCAACGC CGCCGACAAC CCGGACGCGA TCTACCGCGA CCCGATGACG ATCGACGACT ACCTCGCGGC ACGGATGATC AGCGAACCGC TGTGCCTGTT CGACTGCGAC ATCCCGGCGG ACGGGTCGAT CGCCTTCGTC GTCTCGCACG CCGACCACCG GCGCGACGTC GACCGGCCCG TCTTCTTCGA GGCCCTCGGC GGCGGGCGGC CGATGACCTC GAGCTGGGAG TTCTGGCCGG ACTTCGACAT CATGGCCGCG ACGAAAGCCG CCGAGCAGCT CTGGTCGCGT ACCTCGCTGC GGCCCGCCGA CGTCGACGTC GCGGGTCTCT ACGACGGGTT CAGCATCTTC GTCCTGCTGT GGCTCGAGGC ACTCGGCTTC TGCGGCCGCG GCGAGGCGGG CCCGTTCGTG GAAGGTGGCA CGCGCATCGC CCGCACCGGT GATCTGCCGC TCAACACCTC CGGCGGCCAA CTGTCAGAGG GCCGCTACCT CGGCTTCGGC CTGGCCTACG AGACCTTCCT GCAGTTACGG AACCAGGCCG GCACCCGGCA GGTCACCGAC GCCGAGGTCG GCCTCGTCAC AGGCGGCGGC GGCCCGCTCG CCCAGGCGTT CCTGTTCACC GGTGACCGCT GA
|
Protein sequence | MSADVMNIER RAIVSGIGRS QSGRRLNRPA IDLTLDACLA AIADAGLTPR DIDGLTSWPD HPAPHGFGGP RVGDLHTLLR LDLSWILGCG DGANVIGILG IAAHAVATGL ARHVLVYRTV GEATSQGTGR RPAVMAGPGA GAAPWKTTYG VGSPVQFAAL WAQHHFDRYG TTREQLGWVA VNDRRNAADN PDAIYRDPMT IDDYLAARMI SEPLCLFDCD IPADGSIAFV VSHADHRRDV DRPVFFEALG GGRPMTSSWE FWPDFDIMAA TKAAEQLWSR TSLRPADVDV AGLYDGFSIF VLLWLEALGF CGRGEAGPFV EGGTRIARTG DLPLNTSGGQ LSEGRYLGFG LAYETFLQLR NQAGTRQVTD AEVGLVTGGG GPLAQAFLFT GDR
|
| |