Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1990 |
Symbol | |
ID | 5670391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2393006 |
End bp | 2394283 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240911 |
Product | metallophosphoesterase |
Protein accession | YP_001506333 |
Protein GI | 158313825 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.217213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.336367 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTCG TCCACGCCGC GGACATCCAT CTCGACAGCC CCCTTCGGGG ACTGACCCGG CTGGGTGACG GCGACCTGGC CCACCTGCTG CGGCAGGCGA CCCGGCGGGC CCTGGCGAAC CTCGTCGACC TCAGCGTCGA CCGGGCGGCG GACGCGCTCC TGCTCGCCGG GGACGTCTAC GACGGAACCT GGCGGGACTA CGCGACCGGC CGGTTCTTCG TCGAGCAGAT GGGCCGCCTG CGCGACGCCG GCATCCCCGT CTACATGATC TCCGGCAACC ACGACGCCGA GAGCGAGATC ACCCGGTCGC TGACCCTGCC CCCGAACGTC CGGGTGTTCG CCTCGGACCG GCCGGGCACG CATGTGGCGG ACGACCTCGG GCTGGCCGTG CACGGGCAGA GCTACGCGAC CGCCGCCGTC CACGACAACC TCGTTCAGCG CTACCCGGAC GCCCTGGCCG GGCTGGTGAA CGTCGGCCTG CTGCACACTG CCGCCGACGG CGCCGAAGGC CATGCCAACT ACGCCCCGTG CTCGGAGGAG GACCTGGCCC GCACGGGCTA CGACTACTTC GCCCTAGGCC ATGTCCACTC CCATCGCGTC GTTCACGGCG GCCCACTTCC CGGCGGCACG GCGACCCGCG GGGACGGGCC CGGGGGCCGC CAGGTCGCCG CGTTCAGCGG CAATCTCCAG GGCCGCACCC CGCGGGAGAG CGGGCCGAAG GGCGCCCTCG TCGTCGAGAT CCCACAGGAC GGCCCCGCCC GCATCGAGCA CGTCCCCTGC GACGTGGCCC GCTGGGCTGT CCTCACCGTC GACACCGCCG GCGCCGACAC CCTCGACGAC GTCCTGGGCC GGGTCACGGC CGAGCTGCGG GCGGCGCGGG ACTCCGCGGG GGACCGTCCG GTCGTCGCGC GCACCGTGCT CACCGGCGCG TCGAGGGCCG CCGCGGGTCT TGCCGACGCC GAACGGCTAC GCGAGGAGCT CCGCACGGTC GCCGACGGCC TGCAGGTCTG CCTGGAGAAG ATCGTCAACC GGGTGACCGA CCCGCGGCCG GCCGGCGCCG TCGACCCCGA GCTGGTCTCC GCCGTACGCG CGGCCTGCGA CGATCTGGGC ACCCGCCCCG AGGACCTCGC GCGGTGGATC GGCCCCCTGG ACCGCGAGGT CGGACGGCTG CTACGCGGAG CCGACCTGCT CGACCTGGGC GATCCCCGCA CACTCGCCGA CCTCGCCCGC CGCGCCGGGG ACGGCCTGCT CGCGCGCCTC TCCGGAGACG ACGCCTGA
|
Protein sequence | MRLVHAADIH LDSPLRGLTR LGDGDLAHLL RQATRRALAN LVDLSVDRAA DALLLAGDVY DGTWRDYATG RFFVEQMGRL RDAGIPVYMI SGNHDAESEI TRSLTLPPNV RVFASDRPGT HVADDLGLAV HGQSYATAAV HDNLVQRYPD ALAGLVNVGL LHTAADGAEG HANYAPCSEE DLARTGYDYF ALGHVHSHRV VHGGPLPGGT ATRGDGPGGR QVAAFSGNLQ GRTPRESGPK GALVVEIPQD GPARIEHVPC DVARWAVLTV DTAGADTLDD VLGRVTAELR AARDSAGDRP VVARTVLTGA SRAAAGLADA ERLREELRTV ADGLQVCLEK IVNRVTDPRP AGAVDPELVS AVRAACDDLG TRPEDLARWI GPLDREVGRL LRGADLLDLG DPRTLADLAR RAGDGLLARL SGDDA
|
| |