Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_1682 |
Symbol | |
ID | 7204526 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 384775 |
End bp | 386170 |
Gene Length | 1396 bp |
Protein Length | 437 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | anthranilate synthase. anthranilate synthetase. chorismate lyase |
Protein accession | XP_002185694 |
Protein GI | 219120925 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0944619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGACCCGATG AGCCAGCGTG GGTCGACGAA GACGATGTTG GATTGGAAGT AAATTTGCTC GCAACGAGCC GATTACGAAA GTTAAGGAAA AGTAAAGCAG AAGTAGCTGC TCTTGCCCTT TGCGGTACCG ACCTTGAGGA GCGTTTAAGG CAGCGATATC AACGGACAAC CCAAGCGACG GCTCGTACAG ACTGGGCTCG TTTAAGTGAT AGTTCGACTT TGATTGACAA AGACGCCGGC AATAACGACG AGGACTCGGA CTCGGACTTC AAGTCGTTAT CGACTTCTCT GTTGCTCCAC TCGACCTCGA CTAGACGCCT CCCTCCGAAC ACTTTAAATC TTGTGCGTTG CCCTGACGCA AATCAATCCG ACCCAAATCA ATCAGTTCTC CAGGCGGTAC ATTTCCACCC TGCATCTGAT GCTGACCAAC CACTCCTTTT AACTGCGGGT CTCGACAAAA CGCTCCGATT TTTTCAAGTG GGAGTAGACA AAAGTGAAAA AGTGCATGGA ATTCATTGTA AGTTGATGTC GTTGCTTTGT TCGAAAATAC GCACATTTAC AATTTCTGTG ATCTCAATTG TCCCCTTCAT TAGTCCCTAA ACTCCCGATT TATAGTGCTT CTTTTTTGGG ATCGTCAGGT AATGTTGTCG TGAGTGGGCG ACGGTCGTTT TTCTATATTT ACGACACTGT CGCAGGAAAG CTGGAACTCA TTCCCAAAAT TTTAGGAAGG GAAGAGCGAA GCTTGGAAAA ATGCTTTCCC TCTCCCGATG GTCGTACTAT TGCCTTTGTT GGAAATGATG GATACATTAT TTTGTTTGAT GCACACGCTA AGCAATGGAT TGCTGATTTG AAGATTAGCG GTAGCGTCAG AGCAATTACG TTTACTCCTG ATGGTGAGTA CATTCTAGCC AGTGGAAGTG ACGGAGAGAT CTATCGATGG GATCTTCGAA CGCGTCGATG TGTTGAACGA TTCACAAACC AAGATGGGAC GATCACATCT TATCTGGCAG CTTCCTCCCG TCATCTAGCT GTGGGAGCTG AAAGTGGTGT TGTGAACATG TTTTCGGAGC ACAGCATGGG ATACAGAGCG GTCGGCCATG TCGTGGTGTC TGAACGCAAT CCAATGAAGT CCATATTGAA CCTCCACACG TCGGCTGATA TGGTGCGGTT CAACGGCGAT GGACAAATTC TAGCATTTTC GTCTCGCCGT GAAAAAAATA GCATGAAGTT GCTTCATGTA CCGAGCGCCA CTGTCTTCTC AAACTGGCCC ACCTCTAAGA CTCCATTAGG TTTCGTTTGG TCAATGGACT TTTCCCCAGA AAGCAAATAC CTTGCCGTGG GCAATGACAA AGGAAAGTGC CTCTTATACC GTCTCATGCA CTATCAAGAA CAGTAG
|
Protein sequence | RPDEPAWVDE DDVGLEVNLL ATSRLRKLRK SKAEVAALAL CGTDLEERLR QRYQRTTQAT ARTDWARLSD SSTLIDKDAG NNDEDSDSDF KSLSTSLLLH STSTRRLPPN TLNLVRCPDA NQSDPNQSVL QAVHFHPASD ADQPLLLTAG LDKTLRFFQV GVDKSEKVHG IHFPKLPIYS ASFLGSSGNV VVSGRRSFFY IYDTVAGKLE LIPKILGREE RSLEKCFPSP DGRTIAFVGN DGYIILFDAH AKQWIADLKI SGSVRAITFT PDGEYILASG SDGEIYRWDL RTRRCVERFT NQDGTITSYL AASSRHLAVG AESGVVNMFS EHSMGYRAVG HVVVSERNPM KSILNLHTSA DMVRFNGDGQ ILAFSSRREK NSMKLLHVPS ATVFSNWPTS KTPLGFVWSM DFSPESKYLA VGNDKGKCLL YRLMHYQ
|
| |