Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4926 |
Symbol | |
ID | 5673266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5914807 |
End bp | 5916411 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243781 |
Product | sulfatase |
Protein accession | YP_001509197 |
Protein GI | 158316689 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.131881 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGGAC GACATCGGCC GCTCGGAACA CGCGCCCGGA TCGGGACGGC CAGCCTCCTG GTCGCGGCCC TCGGCGCGGC GGTGTTCGCC GCCGCGGGCG GCCCGGAGCC ATCGACGCCG CCCGGGCTGT CGGCCACGCC GTTGGCAGCC GACACGCAGC GGCCCAACTT CGTCTTCATC CCCGCCGACG ACCTCGACGC GACCACCTCG CCCTACTGGG AGGCGATGCC GAGGACGGCC GCGCTCATCC GCGACGCCGG CCTGACCTTC ACCGAGAGCT TCGCGCCCAC CCCGATCTGC TGCCCGGCCC GCGGGTCGCT GCTCACCGGA AAGTACGGGC ACAACACCGG GGTGCTCACC AACAGCGGCG ACGAGGGCGG CTGGGCGACG TTCGCGGCGA ACGGCAACGA GGAGCGCACC TTCGCGAAGT ACCTGCAGGA CAGCGGCTAC AACACCGCGC TCGTCGGCAA GTACATGAAC GGCATCGAGG ACGCGCCCGA CCACGTTCCT CCGGGCTGGA CGGAGTGGTA CGGCAGCGTC GACAACTTCT TCTACACCGG CTACAACTAC GCGCTGAACG AGAACGGGAC GATCGTGCAC TACGGCGGCC CGTCCGATCC CGCGAACTAC TCCACCGACG TCGTCGCCGC GAAGTCGGTG GACTTCCTCG AGCGGGCGGC GGCGAAGGAC GAGCCGTTCA TGCTCTACAC CGCCTCGACC GCCCCGCACC TGCCGCTGCC GCCAGCGCCG CGCGACAGCA ACAATCCGTT CACGGACGAT CTCGCGCCAC GCTCGCCCAA CTACCAGGAG CCGGACGTCA GCGACAAGCC CGCGTGGCTG CGGACGAGCG CCGGGGTCCG CAGCGCCCAG GTGAACCTGA TCAACGACAA CGACTACCGG AACAGGATGG GATCGCTCCT CGCGCTCGAC GACATGGTCG GCGACATCGT CACGACGTTG CGCGACACCG GCGAGCTCGA CCACACCTAC CTGGTCTTCA CCTCGGACAA CGGCTACAAC CTCGGCGCGC ACCGGCTGAT CCACAAGATG GCGCCGTACG AGGAGTCGCT GCGGGTCCCG CTGGTCGTCG CCGGGCCTGG GGTGACCAGG GGAACCGACG ACCACATGGT CGCCGCGATC GACATCGCGC CGACGTTCCT GGAGCTGGCC GGGGTGCCCG TCCCGGCGGA CGTCGACGGC ATGTCACTCG CGCCGCTGCT GCGCGGACAG GACCCGGCGC AGTGGCGCTC GGACCTGCTG GGCCAGTACG CCGGCCCGGG CGGCCAGGGT GACGACGGCA TCGCCGCCGA GCAGGTGCCC GGCCAGCCGA TCGTGGCGGC CGCCACCGAC CCGGTCGCCC ACTACCTGGA CATCCCAGCC TGGAGCGGGC TGCGCACCGA CCGGTACACG TATGTGCGCT GGTACGACAC GGACCGGACC GTGGTCCACG AGCGCGAGCT CTACGACCTG TCCAACGATC CTTACGAGCT CACGAACCTG CTGGCGACCC CGGCGGGACG GGCGGCGAAC GCCGAGCTCG TCGCACGCCT CGACAGCCGT CTGGACACGC TCGCCGCATG CGCCGGAGCG ACCTGCCGGA CGTAA
|
Protein sequence | MPGRHRPLGT RARIGTASLL VAALGAAVFA AAGGPEPSTP PGLSATPLAA DTQRPNFVFI PADDLDATTS PYWEAMPRTA ALIRDAGLTF TESFAPTPIC CPARGSLLTG KYGHNTGVLT NSGDEGGWAT FAANGNEERT FAKYLQDSGY NTALVGKYMN GIEDAPDHVP PGWTEWYGSV DNFFYTGYNY ALNENGTIVH YGGPSDPANY STDVVAAKSV DFLERAAAKD EPFMLYTAST APHLPLPPAP RDSNNPFTDD LAPRSPNYQE PDVSDKPAWL RTSAGVRSAQ VNLINDNDYR NRMGSLLALD DMVGDIVTTL RDTGELDHTY LVFTSDNGYN LGAHRLIHKM APYEESLRVP LVVAGPGVTR GTDDHMVAAI DIAPTFLELA GVPVPADVDG MSLAPLLRGQ DPAQWRSDLL GQYAGPGGQG DDGIAAEQVP GQPIVAAATD PVAHYLDIPA WSGLRTDRYT YVRWYDTDRT VVHERELYDL SNDPYELTNL LATPAGRAAN AELVARLDSR LDTLAACAGA TCRT
|
| |