Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4801 |
Symbol | |
ID | 5673142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5731955 |
End bp | 5733463 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243657 |
Product | alkaline phosphatase |
Protein accession | YP_001509073 |
Protein GI | 158316565 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3540] Phosphodiesterase/alkaline phosphatase D |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.447394 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGG TGAGCCGACG GTCCGTCGTT CTCGGCGGAA TTGCGGGCGT GGGAACGGTG CTGGCGGGCG CCGCGGCGGC CCGTGCCGCG TCCTATCCCT TCACGCTGGG CGTTGCCTCG GGCGAGCCCA GCGCGGACGG ATTCGTGCTG TGGACCCGCC TCGCGCCCAG TCCCCTCGCC GCGGACGGGC TCGGCGGCAT GTCGAGCGGC GCCGTCACCG TGGAGTGGCA GGTCGCCACC GACCAGTACT TCACCCAGAT CGCCGCCAGC GGCTCGGTCT CCGCCGTCCA GGCCTGGGCG CACAGCGTGC ATGTCGAGGT CGGCGGCCTG CAGCCCAACC GGGAGTACTG GTACCGCTTC CGCGCCTCCG GCCAGATCTC GCCGGTCGGC CGGGCCCGGA CCGCCCCGGC CGTCGGCTCC AGCCCCGTCC TGAAGATGCT GTTCACCTCG TGCTCGCACT ACGAGGCCGG CTACTTCACC GCCTACCGCC GGATGGCCGA GGAGAACCCG GACCTCATCC TGCACCTCGG GGACTACATC TACGAGGGCG GGGCCGGGTC CGGCGTGCGC TCGCACGTGC CCAGCGCCGA GATCAGCTCG CTGGCCGACT ACCGCGTCCG GCACGCTCTC TACAAATCCG ACGCCGACCT GCAGGCCGCG CACGCCGCCG CGCCGTGGAT ACCGGTCTGG GACGACCACG AGGTCGAGAA CAACTACGCC GACCTCGTCC GCAACGACAC CAGTCCGGCC GGCGACTTCA CCGCCCGCCG GGCGGCCGCC TACAAGGCGT ACTACGAGCA CATGCCGCTG CGGTCGGCGC AGGTTCCCGT CAACGAGAAC CTGCAGCTCT ACCGGCGCCT GCGCTGGGGC AGCCTGGCCA CCTTCCACAT GCTCGACACC CGGCAGCACC GGGACGACCA GGCGTGCGGT GACGGCACGA AGGTCTGCGC CGCGGCCGAC GACCCGGCAC GCACGCTGAC CGGGGCGACG CAGGAGGCCT GGCTGCTCGA CGGCCTCGGC CAGCGCCTGG GTACCTGGGA CATCATCGGC CAGCAGGTGT TCTTCGCCCA GCGCCTCGCC GCCTCCGACG GCTCGAAAAG CATGGACGCC TGGGACGGTT ACACCGCCAA CCGCGGCCGG ATCCAGGCGG GCTGGCAGGC CAGCGGCAAC ACCAGCACGG TCGTGCTCAC CGGAGACGTC CACCAGCACT GGGCGGCCGA CATCATGGAC AACTACGCGA CCCAGAACAA GGTGATCGGC ACCGAGCTGG TGTCCACCTC GATCACCTCA GGCGGGGACG GCGCCGGTGC CGGGACCGGC CTGTCCAGCC TCAACCCGCA TGTGAAGTTC AACTGGAACC GGCGCGGCTA CGTCCGCACC GTCACCACAC CCACCCAGAT GACGGTGGAC TTCCGCGCGC TCAACCAGGT CACGGTCCGT GGCAGCGCGG CCACCACCGT GCAGAGCTAC GTGATCGAGG CCGGCAACCC CGGTCTCCAG ACGGTGTGA
|
Protein sequence | MNQVSRRSVV LGGIAGVGTV LAGAAAARAA SYPFTLGVAS GEPSADGFVL WTRLAPSPLA ADGLGGMSSG AVTVEWQVAT DQYFTQIAAS GSVSAVQAWA HSVHVEVGGL QPNREYWYRF RASGQISPVG RARTAPAVGS SPVLKMLFTS CSHYEAGYFT AYRRMAEENP DLILHLGDYI YEGGAGSGVR SHVPSAEISS LADYRVRHAL YKSDADLQAA HAAAPWIPVW DDHEVENNYA DLVRNDTSPA GDFTARRAAA YKAYYEHMPL RSAQVPVNEN LQLYRRLRWG SLATFHMLDT RQHRDDQACG DGTKVCAAAD DPARTLTGAT QEAWLLDGLG QRLGTWDIIG QQVFFAQRLA ASDGSKSMDA WDGYTANRGR IQAGWQASGN TSTVVLTGDV HQHWAADIMD NYATQNKVIG TELVSTSITS GGDGAGAGTG LSSLNPHVKF NWNRRGYVRT VTTPTQMTVD FRALNQVTVR GSAATTVQSY VIEAGNPGLQ TV
|
| |