Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4213 |
Symbol | |
ID | 5672568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5018124 |
End bp | 5019374 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641243086 |
Product | 4-phytase |
Protein accession | YP_001508503 |
Protein GI | 158315995 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.798604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.63897 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGT CCTTCGCGCT GAAGCTCCGT CCGGACCTGG CTTTCTCCGA CGGCACCCCA TTCGACGCCG CGGCGGTGAA GTTCAACTGG GACCGGCTCA AGGACCCGGC CAGCGCCTCG CCCAGCGCGA CGGAAGCGGC GATGGTCGCC TCGGCCGAGG TAGTCGACGA CGTCACCCTG AAAGTCACGA TGACCACTCC CGTGACCGCG TACACGCAGG CGATCGTCGG CTCGGCGATG AACTGGATCG CCTCACCGGC GGCCCTGCAG AAGGGTCAGC AGGCCTTCGA CGAGAGCCCT GTCGGCGCCG GCCCCTTCAC TCTGCAGAGC TGGACCAGGC AGGCCGAGAT CAGGCTCGTC AAGAACCCCC GCTACTGGGA CGCACCCAAG CCCTACCTCG CCGGCATCAC GATGCGCGCG GTGCTCGACG CCGACCAGCG TTACAACACC CTGATCAGTG ACGGCGCCGA TGTTGCCGTC GAGACGAACT GGATCAACCT GGCCAAGGCC GAGAAGGCGG GTCTGCCGAC CGACCTCCTG CCGCTCAGCG GTGGCTACTT CATCGCCCTG AACACGCGCA GAGAGCCGTT CAACGATATT CGCGCCCGAC AGGCCGTGGC CGCGGCACTC GACATCGATG CGCTGAACCT CGCCGTCTAC AACGGCGAAG GCCAGGTGGC TGACACGCTG TTCACCAAGA ACTCCCCCTT CTACTCGGAC AAGCCACTGA CGACCGTGGA CCAGGCGAAG GCCCAGAAAC TCTTCGACGA GCTGGCCGCC GAGGGCAAGC CCGTGTCCTT CACGTTCTCC ACCTATCCGT CCAGCGAGAA CAGGGCGATC GCGGAGAACG TCCAGGCCCA GCTCGACAGC TTCAAGAACG TCAAGGTCGA GGTCGCGACC GTCGACTACT CGCAGGTCGG CGCGATGCGC ACGACCCACG ACTTCGACGC GATCGTATCC GCCGCGGCCT TCCAGGACCC CGAGCCGCGG CTGTTGGCGA ACTTCACCGG GAACTCGCCG GCGAACATGC CCGGCCCCGT GGACCCGGAG CTCGACAAGA ATCTGCTGGC CGGCCGGACC GGAACGTCGT TGGAGCAGCG TAAGGCGGCC TACGACGCGG CGCAGGCGCG GTTGACCGAG GCGATGCCGG CCATCTTCCT CACCCGGTCG GCGCCTGCCG TCATCACGGG CAAGAACGTG GGCGGCATCG TGCAGTACGG CGCGGGTTCC CTGCTACCCG AGGATCTGTG A
|
Protein sequence | MAESFALKLR PDLAFSDGTP FDAAAVKFNW DRLKDPASAS PSATEAAMVA SAEVVDDVTL KVTMTTPVTA YTQAIVGSAM NWIASPAALQ KGQQAFDESP VGAGPFTLQS WTRQAEIRLV KNPRYWDAPK PYLAGITMRA VLDADQRYNT LISDGADVAV ETNWINLAKA EKAGLPTDLL PLSGGYFIAL NTRREPFNDI RARQAVAAAL DIDALNLAVY NGEGQVADTL FTKNSPFYSD KPLTTVDQAK AQKLFDELAA EGKPVSFTFS TYPSSENRAI AENVQAQLDS FKNVKVEVAT VDYSQVGAMR TTHDFDAIVS AAAFQDPEPR LLANFTGNSP ANMPGPVDPE LDKNLLAGRT GTSLEQRKAA YDAAQARLTE AMPAIFLTRS APAVITGKNV GGIVQYGAGS LLPEDL
|
| |