Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_3823 |
Symbol | |
ID | 5060301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 4379480 |
End bp | 4381105 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640476081 |
Product | 4-phytase |
Protein accession | YP_001160632 |
Protein GI | 145596335 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGGGA AATTCCTGAA GGTGGCGGTC GCGGCGACCG CCACCGCTAT GTTGGCCACT GCCTGCGGTG GCGGCGGCTC GAGCGATGCT GATGGTGACG CCAGTGGCAC GCTTCGTGTA TATGCGTCAG AGCCGGCGTA CCTGCTGCCG TCGAACGCCG ACGACGAGCC GTCGATCTAC GTGATCCGTC AGCTCTACCG CGGTCTGGTC AAGTACAACG CCGAGACCAG TGAGTCGGAG ATGGATCTGG CCGAGTCGAT CACCTCGGAC GACCAGAAGC TCTGGACGAT CAAGCTGAAG GACGGCTACA CCTTCGACAA CGGTGAGCCG GTCGACGCCG ACTCCTTCAT CCGGTCGTGG AACTTCGCCG CCTACGCGCC GAACGCCCAG AACAACGGCT ACTTCATGAA GCGGATCGCC GGTATCGACG AGGTCCTGCC CAAGGACCCG GACGGCGAGG GCCCGGCGGA GGCCCCGGCA CCGACGGTCG AGACGATGTC CGGCCTGACG AAGGTCGACG ACCTCACCTT CACCGTCGAG CTCAAGGAGC CCTTCACCGG CTTCCCGACC ATTGTCGGGT ACTCGGGCTT CTTCCCGATG GCCCAGGCCT GCGTCGACGA CGCGGATGCC TGCAACGAGA CCCCGATCGG TAACGGCCCT TACAAGATCG ACGGTAGCTG GGAACACGAC GTCGAGATCA ACCTGGTCCG CAGCGAGACC TGGAAGGGCG AGCCGGGCAA GCCCGAGGCG ATCAACTACC GGATCTTCGC CGACGTGGAC GGCGCCTACG CCGCCTTCCA GGCCGGCGAG CTGGACGTGA TGTACACGAT CCCGCCGGCG CGCTTCAAGG ACGCCAAGGC CAGCTACGGC GACCGGCTGT ACGAGCAGGC GGGCGACAGC CTCAACTACG TCGGCATGCC GCTGTACGAC GACAGCTTCA AGGACAAGCG GATCCGCCAG GCGATCTCGC TGGCGATCGA CCGGCAGTCC ATCGTTGACG CCGTCTTCGA CGGACGGTGG ACTCCCGCCA CCGGCTTCGT CGCGCCGATC TTCGAGGGCG CTCGCGAGGG TATCTGCGCC TACTGCGAGA AGGACGTCGA GAAGGCCAAG GAACTGCTCG CGGCGGCCGG TGGCTGGCCG GAGGGCAAGA AGCTGACCCT GTGGGCCAAC GCGGGTGCTG GCCACGACGC CTGGCTCCAG GCCGTCGGCG ACCAGGTCAA GGCCGCGCTG GGCATCGACT ACGAGCTGAA GGTCAACCTG CAGTTCGCCG AGTACCTGGA CGTGGCGGAC AACCGGGAGT TCACCGGCCC GTTCCGGCTC GGCTGGGGCC CGGACTACCC GTTCCTGGAG ACCTACCTGA CTCCGCTGTA CAGCACCGGC AACGACAGCA ACAACAGCAC CTTCAGCAAC CCCGAGTTCG ACAACCTGCT GAAGCAGGGC GACGCCGCTC CGACCATGGA GGAGGCCATC ACCTTCTACC AGCAGGCTGA GGACATCCTG GCTGAGGAGA TGCCGGTCAT CCCGATGTTC TGGCGCAAGG AAGCGGCGGT CTACAGCGAG AACGTGGACG CCTTTGTCTG GAACCAGGTC ATGGGCGCCG ACTACGGTGC GACCTCACTG AAGTAG
|
Protein sequence | MRGKFLKVAV AATATAMLAT ACGGGGSSDA DGDASGTLRV YASEPAYLLP SNADDEPSIY VIRQLYRGLV KYNAETSESE MDLAESITSD DQKLWTIKLK DGYTFDNGEP VDADSFIRSW NFAAYAPNAQ NNGYFMKRIA GIDEVLPKDP DGEGPAEAPA PTVETMSGLT KVDDLTFTVE LKEPFTGFPT IVGYSGFFPM AQACVDDADA CNETPIGNGP YKIDGSWEHD VEINLVRSET WKGEPGKPEA INYRIFADVD GAYAAFQAGE LDVMYTIPPA RFKDAKASYG DRLYEQAGDS LNYVGMPLYD DSFKDKRIRQ AISLAIDRQS IVDAVFDGRW TPATGFVAPI FEGAREGICA YCEKDVEKAK ELLAAAGGWP EGKKLTLWAN AGAGHDAWLQ AVGDQVKAAL GIDYELKVNL QFAEYLDVAD NREFTGPFRL GWGPDYPFLE TYLTPLYSTG NDSNNSTFSN PEFDNLLKQG DAAPTMEEAI TFYQQAEDIL AEEMPVIPMF WRKEAAVYSE NVDAFVWNQV MGADYGATSL K
|
| |