Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4176 |
Symbol | |
ID | 5703964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4743480 |
End bp | 4745084 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641273603 |
Product | 4-phytase |
Protein accession | YP_001538956 |
Protein GI | 159039703 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00061687 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCGTTC GTAGAATCGC CGCCTGGACC GCCCTCCCGC TCGCGGTGAC CCTGGGCCTG GTGGCCTGCG GCTCGGGCGG TGACGGTGGA TCCGGCGGGA CCAACCCCGA CGCGGCTGTG CGGATCGAGA TCGCCGAGCC GCAGCACCTG GTACCGACCA ACACCACCGA GACGAGTGGC TCACAGGTGC TCGCCGCCCT GTTCAGCCCG CTGGTCAACT ACGACGAGGC CAAGCAGGCC TACGAGGTGG CGGCCGAGTC GGTGACGTCC GAGGACAACG TCACCTGGAC GATCCGACTC AAGGACGGCT ACACCTTCCA CAACGGCGAG CAGGTCACCG CCGACAACTA CATCGATGCC TGGAACTACG GCGCCTACGC CCCGAACGGC CAGGGCTCGA GCTACTTCTT CGAGAAGATC GCCGGGTACG AGGACCTCCA GGGCGAGACC CCGGCGGCCC AGACGATGTC CGGCCTACAG AAGGTCGACG ACCTGACCTT CACTGTGACC CTGTCGCAGC CGTACTCCGG GTTCCGGACC ATGCTCGGAT ACAACGCCTT CTACCCGCTC CCGGAGGCCG CGTTCTCGGA GCCGGGCGTG CTCGACGAGA GCTACGAGCA GGCGCCGATC GGTCAGGGCC CGTTCAAGAT GAAGGGCACC TGGCAGCACG ACTCCCGGGT CGAGGTCGAG CGGTACGACG CCTTCCCCGG CGAGCAGCCG AAGGTGGCGG GCGTCGAGTT CCGGATCTAC CAGCAGCCGG CGTCCGCGTA CGCGGACGTG TTGTCGGACA ACCTCGACGT GATCAAGGCG ATCCCGACCG AGAACCTGTC GACGGCTCCC ACAGACCTGG GTGACCGGTT CAAGACCAGC CCCGCATCGT CCTTCCAGTT CCTGTCCTTC CCGACGTACC AGGAAGAGTT CAGCAACCCG GACGTGCGCA AGGCGATCTC GATGGCGATC GACCGGGAGG AGATCACGAA GGCGATCTTC AAGGACTCGC AGACACCGGC CCGCTCGTTC GTCTCCCCGG CCGTCGCGGG CTACCGCGAG AACACCATCG GCGCCGCCGG TACCTTCGAC CCGGCGCAGG CCAAGACGCT CTACCAGAGC GCGGGCGGCC CGGACCGGAT CGAGATCTCG TACAACGGCG ACGGCGGCCA CAAGGACTGG GTCGACGCCA CCTGCAACCA GCTCAAGGCG AACCTGGGTG TGGACTGCGT CGGCAGCGCC GAGCCGAAGT TCGCGGACCT GCTGACGAAG GTCAAGGCGG AGGAGCCGGT CGGCCTGTTC CGGATGGGTT GGGTCATGGA CTACCCGTCC ATGGAGAACT ACCTCGGCCC GCTGTACAGC AGCACCGGCT CGGCGAACTT CTACGGCTAC CGCAACCCCG AGTTCGACAA GCTGGTCGCC GAGGGCTCGG CGGCAGCCAC CGACGCGGAG GCGATCGAGA AGTACCAGCA GGCCGAGGAT CTGCTGGCCG AGGACATGCC GGTGATCCCG CTGCGGTTCG GCCAGAACAT CTTCGGGCAC TCGACCCAGG TCGCGAACGT GGAGATGGAC CTGTTCAACC GGGTCGACCT GCTGAAGATC GAGGCTGTTC AGTAG
|
Protein sequence | MRVRRIAAWT ALPLAVTLGL VACGSGGDGG SGGTNPDAAV RIEIAEPQHL VPTNTTETSG SQVLAALFSP LVNYDEAKQA YEVAAESVTS EDNVTWTIRL KDGYTFHNGE QVTADNYIDA WNYGAYAPNG QGSSYFFEKI AGYEDLQGET PAAQTMSGLQ KVDDLTFTVT LSQPYSGFRT MLGYNAFYPL PEAAFSEPGV LDESYEQAPI GQGPFKMKGT WQHDSRVEVE RYDAFPGEQP KVAGVEFRIY QQPASAYADV LSDNLDVIKA IPTENLSTAP TDLGDRFKTS PASSFQFLSF PTYQEEFSNP DVRKAISMAI DREEITKAIF KDSQTPARSF VSPAVAGYRE NTIGAAGTFD PAQAKTLYQS AGGPDRIEIS YNGDGGHKDW VDATCNQLKA NLGVDCVGSA EPKFADLLTK VKAEEPVGLF RMGWVMDYPS MENYLGPLYS STGSANFYGY RNPEFDKLVA EGSAAATDAE AIEKYQQAED LLAEDMPVIP LRFGQNIFGH STQVANVEMD LFNRVDLLKI EAVQ
|
| |