Gene PHATRDRAFT_17238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_17238 
Symbol 
ID7196073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp297936 
End bp300860 
Gene Length2925 bp 
Protein Length958 aa 
Translation table 
GC content50% 
IMG OID 
Productanthranilate synthase 
Protein accessionXP_002177062 
Protein GI219110621 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.438773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTCATTCCC AAGCGCAACG TTGTATTGCG AAATGAGATA TCTGTTCTTG TCGCTCGTCG 
CGTGGATTGC AACAACATCG TCCTCTTCGG CCTTTGTACC GCTCGCGCGT CGGATCGGAT
GGGCGACACC GCCGACAACC TCGTCTCCGA CATTGCGTTT TTCTACCAAT ACACAATCAA
GTAGCGCCAC CAACAACAAT GTAGTAGTCG TGAAAGAGGG CGGCCGGGGC GTTCCTACGG
CGGGACAGCA GGCAGCCGAG CAAGGCCTGA CGTTGGGAGC CCCTCCGGTC CGTCCCCAAG
GTGGACATTT TCTTACCAAA GGAGGGGTTC AGGTCACCGC ACACGTTTCG AGGCTCGAGT
TCTCTCCTCA GCTCCAGGCC GGTACCTCCG CACAGGCCAT TGAAGATTTG GTCACACAGT
TGGATTCTCA AAAGGGTGTT CTCCTGACGA GCTCGTACGA ATTTCCGGGA CGGTACGCTC
GTTGGTCACT GGGTTTCGTT GACCCCCCTT TGGAAATCTC TGGACGAGGA CAGGCGTGCA
CTATTACTGC ACTGAACGCA CGGGGTAGAG TGCTGATGCC CGCTATTGAA GCTGCCATGA
AAACTTTGCG GGAACAAGAA ATCCTCGAAC AAGTTAAAAT TGTCAAGGAA CACGAAGAAA
CGACGCGCGT GGAGGTTCAA GTTGTGCCGC CTTCCGAAGT TGGTACATTC AATGAAGAAG
AACGCAGCCG TCAGCCTTCT CTGTTTTCGG TTGTACGCGC GTTGGTAGAC TTGTTCAGCT
ATCAGGCGGG GGATCGGCAA TTAGGCCTGT ATGGTGCTTT CGGCTACGAT TTGACGTTTC
AATTCGAACC CATCGACTTG GCGCAAGAAC GCGATTCCGA ACAACGGGAT TTGCTCCTTT
ACCTACCCGA TACTATGCTG GTTGTTGATC AAGACAAGCG CGACGCTTGG AGAGTATGCT
ACGACTTTAA CGTCGATCAA AAGTCAACGC AGGGAATACC TCGAACGGGG ACACCACAAC
CATTTCAAGC ATACGCCACC AGTACCGATT TTGTTGAACG CGATACACCG CCTAGCGAAT
TCGCCAACTC CGTCCTCAAA GCCAAGGAAG AATTCAAGGT TGGTAACTTG TTTGAAGCGG
TTTTATCGCA GACCTTTCGC GAAAAATTGA CTGTCGAACA GCCTCCATCG ACGCTCTTTC
GTCGGTTGCG CGCACGAAAT CCTGCTCCGT ACGGCTTCCT CATAAACCTA GGCGAGCAAG
AGTATCTGGT CGGGGCCAGT CCCGAAATGT TTGTCCGCTG CGAGGCTACA AACGACAACG
ATTACCGACC GGGTGCAATT CGTGTGGAGA CCTGTCCAAT TTCAGGAACG GTCGCCCGCG
GAGCCGATGC TTTAGAAGAC GCACAGCGCG TTAAATCACT CATGATGAAC GCAAAGGAAG
AGTCTGAGCT TACCATGTGT ACTGATGTTG ATCGCAATGA CAAGTCGCGA ATTTGCGAAC
CCGGTTCAGT CCAGGTGATC GGAAGACGGC AGATTGAAAT GTATAGTCGA TTGATTCACA
CCGTAGATCA TGTAGAAGGC TATTTACGTC CGGAGTTTGA TGCACTGGAT GCTTTTCTTT
GCCACACTTG GGCTGTCACG GTGACAGGTG CCCCCAAAAC GTGGGCGATT CAGTTCGTCG
AAGATAACGA GCGTTCTCCT CGATGTTGGT ATGGTGGCGC CGTTGGGATG GTCGGTTTTG
ATGGTGGCCT GAATACAGGG CTCACGTTAC GAACGGTGCG CGTCAAAAAC GGTATTGCTG
AAGTACGAGC TGGAGCAACA CTCTTATTCG ACTCAGAACC GGAAGCCGAA GAAAAGGAGA
CGGAACTGAA AGCATCCGCT ATGATTGACG CGATTGTGAG GGCGGGACCG GAAGACAGTA
TCGAGACCTC TATTCTGTTG AAGAAAAAAC CCAAGAAAAT GTACCAAGGG ATGTCGTTGG
TACTGATTGA TCACGAGGAC TCATTTGTAC ATACGCTTGG TAACTACCTG CGGCAAACAG
GTGCGCAGGT GACGACACTT CGTAGCGGCC CGTCCGCCAT CAAGACCTTG GAAGCTATGA
TAGCAAACGA AAAACAGCCG GATTTGGTTG TTTTGTCTCC GGGTCCAGGT AATCCCTCCG
ATTTTGGATT GTCAACGACT ATTTCTTTTC TTGAAAAGCA TCGAATAGCC GCATTTGGCG
TCTGTCTGGG CCTACAAGGA ATGGTAGAGC ATTTTGGTGG GACCTTGGGG GTTCTTAGCT
ATCCTATGCA TGGCAAGCCG TCGACTATAT CGCTTACTCC CGCAGGGAAG GAAGAGAACA
GTATCTTCAC CTCGCTTCCC GATTCTTTTG AAGTTGCACG GTATCATTCT CTACACGGCA
TTCGTGAGCA GATGCCATCC TGCTTGGAGG TGACTGCTAC AAGTGAAGAC GGCGTCGTGA
TGGGCATTCA GCACAAAACA TTGCCGTTTG CTGCCGTACA ATTTCATCCG GAGTCTATTC
TCACAAGCCC TGCTACAGGC ATGACAATTC TGCAAAACGC GTTGACAGTA TTGAAGTATG
AGAAAGAAGT TGACGCTATT GTAAAGGATG CTGGCCGTCA AACAGGCTCT GAGCTTGTTG
GTGAGCTAGA AAAGCTTAAT GTTGAAGCAT TGAAGGACCG TTTGGAGACA GCCGGACTTT
TGTCATCGGG TTCAAAAAGC GACCTAGTAG TCAAACTGGC ACTCTGGACG CACAAGAGCA
AGGAAGCAAA TGCAGGAAGA CTTAATTTGG AAGGGATGAC TGTTATCGAG CTTAAAGAGC
TTAAAAATAG CTTAGGAATC AAAGGTTCAG CATCATCTAA GATTGAGTTG CTGAAGCTAT
TGAAACGTTG CCTCCAAATA AAAAGATAAT GAAAACTAAC TTGTT
 
Protein sequence
MRYLFLSLVA WIATTSSSSA FVPLARRIGW ATPPTTSSPT LRFSTNTQSS SATNNNVVVV 
KEGGRGVPTA GQQAAEQGLT LGAPPVRPQG GHFLTKGGVQ VTAHVSRLEF SPQLQAGTSA
QAIEDLVTQL DSQKGVLLTS SYEFPGRYAR WSLGFVDPPL EISGRGQACT ITALNARGRV
LMPAIEAAMK TLREQEILEQ VKIVKEHEET TRVEVQVVPP SEVGTFNEEE RSRQPSLFSV
VRALVDLFSY QAGDRQLGLY GAFGYDLTFQ FEPIDLAQER DSEQRDLLLY LPDTMLVVDQ
DKRDAWRVCY DFNVDQKSTQ GIPRTGTPQP FQAYATSTDF VERDTPPSEF ANSVLKAKEE
FKVGNLFEAV LSQTFREKLT VEQPPSTLFR RLRARNPAPY GFLINLGEQE YLVGASPEMF
VRCEATNDND YRPGAIRVET CPISGTVARG ADALEDAQRV KSLMMNAKEE SELTMCTDVD
RNDKSRICEP GSVQVIGRRQ IEMYSRLIHT VDHVEGYLRP EFDALDAFLC HTWAVTVTGA
PKTWAIQFVE DNERSPRCWY GGAVGMVGFD GGLNTGLTLR TVRVKNGIAE VRAGATLLFD
SEPEAEEKET ELKASAMIDA IVRAGPEDSI ETSILLKKKP KKMYQGMSLV LIDHEDSFVH
TLGNYLRQTG AQVTTLRSGP SAIKTLEAMI ANEKQPDLVV LSPGPGNPSD FGLSTTISFL
EKHRIAAFGV CLGLQGMVEH FGGTLGVLSY PMHGKPSTIS LTPAGKEENS IFTSLPDSFE
VARYHSLHGI REQMPSCLEV TATSEDGVVM GIQHKTLPFA AVQFHPESIL TSPATGMTIL
QNALTVLKYE KEVDAIVKDA GRQTGSELVG ELEKLNVEAL KDRLETAGLL SSGSKSDLVV
KLALWTHKSK EANAGRLNLE GMTVIELKEL KNSLGIKGSA SSKIELLKLL KRCLQIKR