Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29758 |
Symbol | |
ID | 7194860 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 545623 |
End bp | 548698 |
Gene Length | 3076 bp |
Protein Length | 895 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183121 |
Protein GI | 219125718 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGCTCGCTA GTAGTATTGT GAGAGTCAAC TGGCCGTACG TATCGTGTCT GTGCATTATA GTTATCACCA CTACCAGAAC TACCACTGTT ACCGTTAACA TGGGGTCCGG CAACCACGAC GACAAGACGG CGGGGCGGGT CCTCTTGCCC GCACACGTTG TACCCACTCG GTAAGTGCCT GGCAGCGTCT CGGGGGGGGG GAGGCGTTGG GGTCGCGACG CTATTCGGTC CAACCTCGTG GCGGCGCGTC TACACACACA CACACACGTA TACGTATATA CATACGCTCT CCCTCACACA CACACACACT CGCTGTAATC AGTTACGATT TGGCGCTGAC CCCAAATATA GAAGCCTTTA CCTTTACGGG TACGGTGGAC ATTACCTTCC GGATTGACGG TAGTTTGTTG AACGAGACCA ATAACAAGTC GATTACCTTG CACGCCAAGG AACTCTTGTT CTCAACCGCG TCTTACCACT TGTTGGATGG CCCCGACGCG ACGCCCGTGA CGGCGGAACA AATGAACGTC AATCTCAAGG CTACCACGGT GGAGTTTCTC TTTCCCGAGC CCATTCCGCC CGACGCCTCC ACACTCAAAC TCACCGTTGC CTACACGGGA TTCCTCAACG ACCAAATGGC CGGCTTTTAC CGGTCAACCT ACACCGACAT ACAGGGACAA TCCAAAATTA TGGTGTCCAC GCAGTTCGAA GCCCTCGATG CCCGTCGCTG CTTTCCCTGT GTCGACGAAC CCTCGCGCAA GGCCGTGTTC GGGGTTACCC TTACCGTACC CGCGCATTTG ACCTGTCTCT CCAACATGCC CGAAGCCAAG GTTACCGCCA TCAACGCACA GCAGAAGTGT GTCACCTTTA TGGACTCGGT CGTCATGTCC ACCTACCTCC TCGCCTTTGT CGTGGGCGAA TTCGATTTCC TCCAGACCCG CTCCGCGCAC GGTGTTCTCA TCAAAGTCTA CACGCCGCCG GGGAAGGCCG CGGCGGGACA ATTCGCCCTC GACGCCGCCG CCCGCGCCTT GGACGCCTAC AACGACTTTT TCAATCTACC CTACCCTCTG CCCAAACTAG ACATGGTCGC CATTCCCGAA TTCGCCGCCG GTGCCATGGA AAACTGGGGA CTCGTCACCT ACCGCGAAGT CGATTTGCTC ATTGACCCCG TCAAGGCCAG TACCATGCAG AAACAACGCG TCGCCGTGGT TGTCACGCAC GAACTCGCCC ACCAGTGGTT CGGAAACCTC GTCACCATGG CCTGGTGGGA CGATTTGTGG CTCAACGAAG GATTCGCATC GTGGGCCGAA AACTGGGCCA CCAACGTACT GTATCCGGAA TATCGAATGT GGGATCAGTT CACCACGGGG CATTTGAGTA CGGCATTGCG GTTGGATGCT CTGCAAAGTT CACACCCCAT TCAGGTACCC ATTGCACACG CCGAAGAAGT GGAACAAGTC TTTGACGCGA TTTCCTACTG CAAGGGGGGC AGTGTGGTGC GCATGATCAA GGCCGTAATT GGCTTGTCTG CCTTCCAGGA CGGACTGGGT GCCTACATGA AAAAACACGC CTACGGAAAC ACGGAAACGT ACGATTTGTG GAATGCCTGG GAGGCCTCCT CGGGCATGCC CATTGGTGAA ATGATGAAGT CCTGGACGGA GCAAATGGGA TTTCCGTTGG TGCGTGTGCG GAAGGAAGAC TTTGCGGACG ACAAGGTTGT GCTGGAGTTG GACCAGACGT GGTTTTTGTC GGATGGATCC GATATGCAGT CCGACAAGGT TTGGACTATT CCCATCTTGA CCTGCACGGG CGCAGGGGCG CAAGCCGATA TGACCTTGAT GCGCGACCGC ACAGCCACGG TCACGATTCC GTTTGATCCC AAGGACACGG CGCCCCGGTG GATCAAGCTC AATGCCGGTC AAGAAGTCCC GATGCGTGTT TTGCCGGGCG TGGAAATGCT TCGACGCATG TTAGTTGCCA TTGCGTCCAA GTCGATGAGC GCAATTGATC GCGCGGGGGT GCTGAATGAT TCAATGGCTG TTGTCAAGGC TGGTCACATG TCGCCGGAAG CCATGATGAC GCTTTTGAAA AGTTACAAGG ATGAGGATGA GTACGTTGTT TGGGAAGGGC TGTCGGATGC GTTGGGTGGC TTGGATGCGG TCCTCTCGGA CGACGAGAAC ATGACGGGCT ACTTTCGAGT GTTTGCCAAG ACTATGGTTG TGAATCTTAT GAATAAGGTT GGCTGGGAGG CGTCCGATTC GGATGAGCAT CTGACTAAGT TGTTGCGTGG GATTATGATC AACCTGCTTG GTGCCTTCGC CTACGACGAC GAGAGTGTTC AACAAGAGGC GAAGAAGCGC TTTGAGGCTT TCCTGGAAGA CGCCAACGAT ATAGAGTCGC TCCCCAGTGA CATGCGCACC GCCGTCTTCA AGATTGTTCT AAAAAATGGC AGTGCCAAGG AATACGAACA AGTGAAAGCT TACTTTGCCA CGGCATCGGA CAACGCCGAG CGCAAGCATG TTCTTAATTC GCTCGGGTGC ATTCAGGACG ATGCGTTAAA ACTTGCTACC ATGGAATGGT CGCTTTCGGG TGAAATTAAG TTGCAGGACT TTTTTTACCT CATGGGATCG GTAGGCCGGT CTTCAAAACA GGGGCGTGAG ATTGCTTGGA AGTTCTTCCA GGAAAACTTT GAGCGCATTC GCATTCTGCT GCAAAAGGCA CACCCCGCTT TGATGGACGC TTGCATTGTC ATGTGCGCCG GCGGCTTTTG TTCGGAAGAA AGAGCGGACG AAATCGACAC GTTTTTTCAA GCCCATCCCC TGCCGTCCAG TACACGCAAG ATTGCGCAAA CGACCGAACA CATGCGGGCG AACGGCAAGT TCTTGCGAGT CCTGAAAGCC AGTGACTTGG CCAAGGCGGA GTTTTGGGAA AAATTGTAAA GTCCAGAATT CGTTACACAA ATTACTGCGC GCTCACAGTC AAGTTCGTCG AGCTTGGCAC CTACAATAGT TTACGGGTCG ACCGGAAACG AAACGACCAC AGACTGTGAA CCTCTAGAAA TTTCGAAACT AGGCTT
|
Protein sequence | MGSGNHDDKT AGRVLLPAHV VPTRYDLALT PNIEAFTFTG TVDITFRIDG SLLNETNNKS ITLHAKELLF STASYHLLDG PDATPVTAEQ MNVNLKATTV EFLFPEPIPP DASTLKLTVA YTGFLNDQMA GFYRSTYTDI QGQSKIMVST QFEALDARRC FPCVDEPSRK AVFGVTLTVP AHLTCLSNMP EAKVTAINAQ QKCVTFMDSV VMSTYLLAFV VGEFDFLQTR SAHGVLIKVY TPPGKAAAGQ FALDAAARAL DAYNDFFNLP YPLPKLDMVA IPEFAAGAME NWGLVTYREV DLLIDPVKAS TMQKQRVAVV VTHELAHQWF GNLVTMAWWD DLWLNEGFAS WAENWATNVL YPEYRMWDQF TTGHLSTALR LDALQSSHPI QVPIAHAEEV EQVFDAISYC KGGSVVRMIK AVIGLSAFQD GLGAYMKKHA YGNTETYDLW NAWEASSGMP IGEMMKSWTE QMGFPLVRVR KEDFADDKVV LELDQTWFLS DGSDMQSDKV WTIPILTCTG AGAQADMTLM RDRTATVTIP FDPKDTAPRW IKLNAGQEVP MRVLPGVEML RRMLVAIASK SMSAIDRAGV LNDSMAVVKA GHMSPEAMMT LLKSYKDEDE YVVWEGLSDA LGGLDAVLSD DENMTGYFRV FAKTMVVNLM NKVGWEASDS DEHLTKLLRG IMINLLGAFA YDDESVQQEA KKRFEAFLED ANDIESLPSD MRTAVFKIVL KNGSAKEYEQ VKAYFATASD NAERKHVLNS LGCIQDDALK LATMEWSLSG EIKLQDFFYL MGSVGRSSKQ GREIAWKFFQ ENFERIRILL QKAHPALMDA CIVMCAGGFC SEERADEIDT FFQAHPLPSS TRKIAQTTEH MRANGKFLRV LKASDLAKAE FWEKL
|
| |