Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4900 |
Symbol | |
ID | 5673240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5883970 |
End bp | 5886114 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641243755 |
Product | FHA domain-containing protein |
Protein accession | YP_001509171 |
Protein GI | 158316663 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.542451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0483191 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCGCCG TGCTCCCGCC AGGGCCGTCA CCGGGGCCGG CCGGGCCCGC CGGGAGCGGT GACGGCCCCC GCATCGGCCC GGTCTACCTC GCCGGGCCGG CGTACGAGGC CTCGTCCGGC GCGGCGGAAC CCACACGGCC CGCCGAGCCC ACCGCGCCGC CCACGCACCC CGCACCGCCT GCGTACTCCG CGCCGTTCTC GCCGCCCGCG CCGCCCGCGC ACTCCGTCCC GCCGGCGGCG CCGCCGGCGC ACCGGGCGGA GGCGCCGACC TCCGGGAGCT CCTGGTTCGA GCCGGGACGT CCCAGCTACC CGGCGCTGCA CCTGCCCACC CCGTCGGCGC CGACCCCCGC CTCGCCGGAG CCGGGCCAGT CGCCGGAGCC GGCGTCCGCG GGACCGCCGA TCGCGGACGA GCCAGCGGCG AGTCCGTGGA CAGCGAGTCC GTGGGCGGCA GCTCCGCCGG CACCACACCC GCAGGCACTG GACCCGCAGG CACTGGACCC GCAGGCACTG GACCCGCAGG CACTGGACCC GCTGGGACCG GGCACACAGG CGACGGCATG GGTACCGGAC TCATGGGCCC CTGACCGATG GACGCCTGAC CGGTGGTCCC CGCGGCCGTC GGAGCCGTCC GAGCCGTCGG CTCCGACACC GCTGACACCG ACAGCGGCAC CGGCACCGGC ACCGACCGCA GCAGCCCCGG AGCCGCTCGA GACTGCTACG GGGCCGGCCC GCGATCCGGT CGGGCGAACC GGCCCGCTCC AGGAACGCCT CTACGCCCGC GGCAGCCGGC GCAGGTCCGC GCGCGGCCCG GCCGGCGCCA CCGACGGCCG GCCGGCGATA TCCGCGGGCA CGCCGACCCG GGCCCAGGCG GCCATGGCCT GGGCGCAGCG CTGGTACCTG CGGGACGTGC TCATCGCCTG GATCGCGGCG CGGGCCGTGG TGGGCGCGGC GCTGGCGCTG ACCAGGTTCG TCGCGGACAC CGTCGCCGGT GACAACGGGG CCACGCTGGA CACCACCGAC CTGCTCGGCT GGGACGCCGG CTGGTACCTC AACATCGCCG ACAACGGCTA CGACGCAGCC GGCGCCGAGA GCCGCCGTTT CTTCCCGCTG CTCCCCCTGC TGGTCCGGCT CTTCACCGCC CTGCCCGGCC TGGGCGGCCA CGGCGGCCAG GTGCTGCTCG CCCTGGTGAA CCTGATCGCT GTCGTGTTCG CCCTCGCTCT GGTCGGCATC GCGCGGGTCG AGGGGTTCGA CGACGACACG GTGCGACGGG TGATCTGGAT CTCCGCGCTG GCCCCCCCGG CGTTCGTCCT GGTGATGGGC TACGCGGAGG CCCTGGCGGG GCTGCTGGCC GTCACGGCGT TCCTGGGCGC GCGGACGCGG CGCTGGGAGC TGGCGGTGGT CGCCGGCCTG CTCGGCGGCC TGTGCCGGCC GCTGGGCCTG CTTCTCGCCG TCCCCGTCGC GATCGAGGCC GCGCGGGGGC TGCCGCTACC CCTGGCCCGG CGTCTCGGCG CCCCACCACC GAACCGCCTT GGTGCCCCGC AGCCGAACCG CCTTGGTGCC CCGCAGCCGA ACCGCCTTGG TGCCCCGCAG CCGGACCGCC TCGGTCCGGC TGAGCCGGAA GACCTCGGTG CCCCGTCGGC ACCACGCGCC GCCGCACGGG AGGCGCTTGC GCGGCTGTGC GCGGTCGCCG CGCCCGTGGC CGGGGCGGGG ATCTACCTGC TGTGGTCGGC GGCCATCCAC GGCGACGGCC TCGCTCCGTT GACCATGCAG CGGGACGCGG CCCGCCACGG CAGCACGGCC AATCCGCTGC TGACCGTCAT CGACGCGGCA AAGGGCGCCC TGCACGGCGA GCTCGGCACC GCGCTGCACG TTCCCTGGCT GCTGCTGGCG CTCGTCGCGC TGGTCGTCAT GGCCCGTGCC CTGCCGGTGT CCTACTCCGT GTGGTCGGCC CTGGTGCTCG CCGCGGTGCT GACGGGCAGC AACCTCGACT CATCGGAGCG ATACCTGTAC GGGGCTTTCC CGTTCCTGCT CGTCGCCGCG CTGGTGACGG CACGGCGCGA GGTGTGGATG CTGGTGATCA CCGTCTCGAC CGCGGCGATG ACCGTGTACG CGACACTGGC GTTCACGCTG TCCTATGTTC CCTGA
|
Protein sequence | MPAVLPPGPS PGPAGPAGSG DGPRIGPVYL AGPAYEASSG AAEPTRPAEP TAPPTHPAPP AYSAPFSPPA PPAHSVPPAA PPAHRAEAPT SGSSWFEPGR PSYPALHLPT PSAPTPASPE PGQSPEPASA GPPIADEPAA SPWTASPWAA APPAPHPQAL DPQALDPQAL DPQALDPLGP GTQATAWVPD SWAPDRWTPD RWSPRPSEPS EPSAPTPLTP TAAPAPAPTA AAPEPLETAT GPARDPVGRT GPLQERLYAR GSRRRSARGP AGATDGRPAI SAGTPTRAQA AMAWAQRWYL RDVLIAWIAA RAVVGAALAL TRFVADTVAG DNGATLDTTD LLGWDAGWYL NIADNGYDAA GAESRRFFPL LPLLVRLFTA LPGLGGHGGQ VLLALVNLIA VVFALALVGI ARVEGFDDDT VRRVIWISAL APPAFVLVMG YAEALAGLLA VTAFLGARTR RWELAVVAGL LGGLCRPLGL LLAVPVAIEA ARGLPLPLAR RLGAPPPNRL GAPQPNRLGA PQPNRLGAPQ PDRLGPAEPE DLGAPSAPRA AAREALARLC AVAAPVAGAG IYLLWSAAIH GDGLAPLTMQ RDAARHGSTA NPLLTVIDAA KGALHGELGT ALHVPWLLLA LVALVVMARA LPVSYSVWSA LVLAAVLTGS NLDSSERYLY GAFPFLLVAA LVTARREVWM LVITVSTAAM TVYATLAFTL SYVP
|
| |