Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_17238 |
Symbol | |
ID | 7196073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 297936 |
End bp | 300860 |
Gene Length | 2925 bp |
Protein Length | 958 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | anthranilate synthase |
Protein accession | XP_002177062 |
Protein GI | 219110621 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.438773 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCATTCCC AAGCGCAACG TTGTATTGCG AAATGAGATA TCTGTTCTTG TCGCTCGTCG CGTGGATTGC AACAACATCG TCCTCTTCGG CCTTTGTACC GCTCGCGCGT CGGATCGGAT GGGCGACACC GCCGACAACC TCGTCTCCGA CATTGCGTTT TTCTACCAAT ACACAATCAA GTAGCGCCAC CAACAACAAT GTAGTAGTCG TGAAAGAGGG CGGCCGGGGC GTTCCTACGG CGGGACAGCA GGCAGCCGAG CAAGGCCTGA CGTTGGGAGC CCCTCCGGTC CGTCCCCAAG GTGGACATTT TCTTACCAAA GGAGGGGTTC AGGTCACCGC ACACGTTTCG AGGCTCGAGT TCTCTCCTCA GCTCCAGGCC GGTACCTCCG CACAGGCCAT TGAAGATTTG GTCACACAGT TGGATTCTCA AAAGGGTGTT CTCCTGACGA GCTCGTACGA ATTTCCGGGA CGGTACGCTC GTTGGTCACT GGGTTTCGTT GACCCCCCTT TGGAAATCTC TGGACGAGGA CAGGCGTGCA CTATTACTGC ACTGAACGCA CGGGGTAGAG TGCTGATGCC CGCTATTGAA GCTGCCATGA AAACTTTGCG GGAACAAGAA ATCCTCGAAC AAGTTAAAAT TGTCAAGGAA CACGAAGAAA CGACGCGCGT GGAGGTTCAA GTTGTGCCGC CTTCCGAAGT TGGTACATTC AATGAAGAAG AACGCAGCCG TCAGCCTTCT CTGTTTTCGG TTGTACGCGC GTTGGTAGAC TTGTTCAGCT ATCAGGCGGG GGATCGGCAA TTAGGCCTGT ATGGTGCTTT CGGCTACGAT TTGACGTTTC AATTCGAACC CATCGACTTG GCGCAAGAAC GCGATTCCGA ACAACGGGAT TTGCTCCTTT ACCTACCCGA TACTATGCTG GTTGTTGATC AAGACAAGCG CGACGCTTGG AGAGTATGCT ACGACTTTAA CGTCGATCAA AAGTCAACGC AGGGAATACC TCGAACGGGG ACACCACAAC CATTTCAAGC ATACGCCACC AGTACCGATT TTGTTGAACG CGATACACCG CCTAGCGAAT TCGCCAACTC CGTCCTCAAA GCCAAGGAAG AATTCAAGGT TGGTAACTTG TTTGAAGCGG TTTTATCGCA GACCTTTCGC GAAAAATTGA CTGTCGAACA GCCTCCATCG ACGCTCTTTC GTCGGTTGCG CGCACGAAAT CCTGCTCCGT ACGGCTTCCT CATAAACCTA GGCGAGCAAG AGTATCTGGT CGGGGCCAGT CCCGAAATGT TTGTCCGCTG CGAGGCTACA AACGACAACG ATTACCGACC GGGTGCAATT CGTGTGGAGA CCTGTCCAAT TTCAGGAACG GTCGCCCGCG GAGCCGATGC TTTAGAAGAC GCACAGCGCG TTAAATCACT CATGATGAAC GCAAAGGAAG AGTCTGAGCT TACCATGTGT ACTGATGTTG ATCGCAATGA CAAGTCGCGA ATTTGCGAAC CCGGTTCAGT CCAGGTGATC GGAAGACGGC AGATTGAAAT GTATAGTCGA TTGATTCACA CCGTAGATCA TGTAGAAGGC TATTTACGTC CGGAGTTTGA TGCACTGGAT GCTTTTCTTT GCCACACTTG GGCTGTCACG GTGACAGGTG CCCCCAAAAC GTGGGCGATT CAGTTCGTCG AAGATAACGA GCGTTCTCCT CGATGTTGGT ATGGTGGCGC CGTTGGGATG GTCGGTTTTG ATGGTGGCCT GAATACAGGG CTCACGTTAC GAACGGTGCG CGTCAAAAAC GGTATTGCTG AAGTACGAGC TGGAGCAACA CTCTTATTCG ACTCAGAACC GGAAGCCGAA GAAAAGGAGA CGGAACTGAA AGCATCCGCT ATGATTGACG CGATTGTGAG GGCGGGACCG GAAGACAGTA TCGAGACCTC TATTCTGTTG AAGAAAAAAC CCAAGAAAAT GTACCAAGGG ATGTCGTTGG TACTGATTGA TCACGAGGAC TCATTTGTAC ATACGCTTGG TAACTACCTG CGGCAAACAG GTGCGCAGGT GACGACACTT CGTAGCGGCC CGTCCGCCAT CAAGACCTTG GAAGCTATGA TAGCAAACGA AAAACAGCCG GATTTGGTTG TTTTGTCTCC GGGTCCAGGT AATCCCTCCG ATTTTGGATT GTCAACGACT ATTTCTTTTC TTGAAAAGCA TCGAATAGCC GCATTTGGCG TCTGTCTGGG CCTACAAGGA ATGGTAGAGC ATTTTGGTGG GACCTTGGGG GTTCTTAGCT ATCCTATGCA TGGCAAGCCG TCGACTATAT CGCTTACTCC CGCAGGGAAG GAAGAGAACA GTATCTTCAC CTCGCTTCCC GATTCTTTTG AAGTTGCACG GTATCATTCT CTACACGGCA TTCGTGAGCA GATGCCATCC TGCTTGGAGG TGACTGCTAC AAGTGAAGAC GGCGTCGTGA TGGGCATTCA GCACAAAACA TTGCCGTTTG CTGCCGTACA ATTTCATCCG GAGTCTATTC TCACAAGCCC TGCTACAGGC ATGACAATTC TGCAAAACGC GTTGACAGTA TTGAAGTATG AGAAAGAAGT TGACGCTATT GTAAAGGATG CTGGCCGTCA AACAGGCTCT GAGCTTGTTG GTGAGCTAGA AAAGCTTAAT GTTGAAGCAT TGAAGGACCG TTTGGAGACA GCCGGACTTT TGTCATCGGG TTCAAAAAGC GACCTAGTAG TCAAACTGGC ACTCTGGACG CACAAGAGCA AGGAAGCAAA TGCAGGAAGA CTTAATTTGG AAGGGATGAC TGTTATCGAG CTTAAAGAGC TTAAAAATAG CTTAGGAATC AAAGGTTCAG CATCATCTAA GATTGAGTTG CTGAAGCTAT TGAAACGTTG CCTCCAAATA AAAAGATAAT GAAAACTAAC TTGTT
|
Protein sequence | MRYLFLSLVA WIATTSSSSA FVPLARRIGW ATPPTTSSPT LRFSTNTQSS SATNNNVVVV KEGGRGVPTA GQQAAEQGLT LGAPPVRPQG GHFLTKGGVQ VTAHVSRLEF SPQLQAGTSA QAIEDLVTQL DSQKGVLLTS SYEFPGRYAR WSLGFVDPPL EISGRGQACT ITALNARGRV LMPAIEAAMK TLREQEILEQ VKIVKEHEET TRVEVQVVPP SEVGTFNEEE RSRQPSLFSV VRALVDLFSY QAGDRQLGLY GAFGYDLTFQ FEPIDLAQER DSEQRDLLLY LPDTMLVVDQ DKRDAWRVCY DFNVDQKSTQ GIPRTGTPQP FQAYATSTDF VERDTPPSEF ANSVLKAKEE FKVGNLFEAV LSQTFREKLT VEQPPSTLFR RLRARNPAPY GFLINLGEQE YLVGASPEMF VRCEATNDND YRPGAIRVET CPISGTVARG ADALEDAQRV KSLMMNAKEE SELTMCTDVD RNDKSRICEP GSVQVIGRRQ IEMYSRLIHT VDHVEGYLRP EFDALDAFLC HTWAVTVTGA PKTWAIQFVE DNERSPRCWY GGAVGMVGFD GGLNTGLTLR TVRVKNGIAE VRAGATLLFD SEPEAEEKET ELKASAMIDA IVRAGPEDSI ETSILLKKKP KKMYQGMSLV LIDHEDSFVH TLGNYLRQTG AQVTTLRSGP SAIKTLEAMI ANEKQPDLVV LSPGPGNPSD FGLSTTISFL EKHRIAAFGV CLGLQGMVEH FGGTLGVLSY PMHGKPSTIS LTPAGKEENS IFTSLPDSFE VARYHSLHGI REQMPSCLEV TATSEDGVVM GIQHKTLPFA AVQFHPESIL TSPATGMTIL QNALTVLKYE KEVDAIVKDA GRQTGSELVG ELEKLNVEAL KDRLETAGLL SSGSKSDLVV KLALWTHKSK EANAGRLNLE GMTVIELKEL KNSLGIKGSA SSKIELLKLL KRCLQIKR
|
| |