Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42513 |
Symbol | |
ID | 7196063 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 266059 |
End bp | 270510 |
Gene Length | 4452 bp |
Protein Length | 1379 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176556 |
Protein GI | 219109603 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGA ACCGTTCGAT TCGTGACGGG TCGCCGAACG TTGCTAACGA CCATCCCGGA GACACGGAAC GGAGGAAGAG TCCCATCTCG ACCCCTGCGG CTTCACCGTG TCACGACGAA ACCGACGACG CTCCCGTGGC AACGCTAGAT ACCGCTCCAG AATTTGACGG CGGCACGAAT GAGAGTTGGA TGTCTCCCGT TTTGGAAACA CCCAGAGCTC GAGCGTACTC TTCGGAAGAA CGAGGTGCAA TATCCGCGTC GGTGGGACGG CCTCCCCTCA TCGTCACGAC TTTGGCCGAC ACGGACAACA CGAGCGTCAC CGAGGGCACC GCACCCGTGT CTCCCGCGAC GTCGACTATT CCCGTGCACC GACGTGTCGA CAGCAGCGAC GCCAATTCTC TGCCTTCGCT GTCCGGACCC ATACAACCGG CCCATCCCTT CTTCAGTCCA TCCTTTTTCG AAGAATCCGA CGCCAGCGAT GGTAGTATTG TGTTTCAACC ACCAACGCCG CAACGCGATA CTTCCAACCC AGATTCTTTG CGATTCACTG GTCCGCAACA TGTTCGGCGG GACAGCCGTG GACGGGATGG GAGGGATCGC TTTCCTAGTT TCGATAGTCT CGGGAGTTCC GGATCCATTC GCATCGTTCC TCCCAGTAAT AGTAGTCAAA TCATGCGAGG AAAAGAAAAG TTCTCCGCAT CGTCAAACTC CTTACCCGCA AATCCAGCTC CGGTGATACC CAGCCTGCCG ACTATTGGAC AACTGCCGTA CCACGAACGA CGAAAACTCA TGGCACAACG TGCCAAAGAT CAGCAACAAC AACTACAGCT GCAACAACAA CAACTACAGC TGCAACAACA TCGCCCGCAT ATACCTCCGA ATTTAGCACC CATGCCGCTG CCGGCTCCGA CTTTGCAGCA TCTTCCACAC CAGCAGCAGC ACCAGAATCT TCCTCAGATT ACACCACACA CCAAAGAGGT TGCTTCTCCG TATGGACAAA ATGTATCGTT TCCTCCCCAT CAACCTCAGC ATCCACTACC GCAGGCTCAT CCTCACCACT ATATTCAAAA CTTGCCATCG GGCTACGCAC CGCATGCCGG AATACCGTTG AGAGCATCGC ATGGGCCACC ACCGCCGCCC CTTTATGCTT ACCCAGTACC TCAAGTACCG CTAGGCCCGT ATCCACCGCC ACCTCTTCAG CAGTACTACG GGCACCTTCC ACCACCACCA TCGCAGCAGC AGCAACTGCA ATCACTCTGG ACACGAAACA ACCCTCTCCA AACCGCGGAC AAGGTAGCAT CGGATCCTCG TCAGCAGCAA CATTTATACC GAACGTCCGG CAATCACCAT CAAATACCTC TTCCCCCACG TGGAGGAAGT CATTCGCGGA ACAATTCGTC GACAATTGGA CTGTCTTCGA ATATGTCCTC ATCGGATGGC GATGTGGAAC GGAAGCCGTC GGCACGAGTG CGTCCCGAAA TTGCCCCAGC CCAGCCTCCA TTACCACCCC CTCCCCCTCC ACCTGGTTTG CCGCCAAATT ACGCTCATAT GCGAACGGAC AGCTCCGGAA GTCTCTCGTC CTTAGGTAGT TTCGACCGAC CAGCGAAGAT AGAACCACGA AAAGCTAGTT TCTTGGAAAA ACTCAATCCG TGGTCACCGA AAGTACCAAA CGTAAACGAT TATCATCGTA AAAATCAGCA GTTTTTGCGG CGGGCAAGTG TAGAACGCGG AAAACTTTGG AGTACGAGCC CACAAACACC AGCTGGACGA CGGTACGTGC ATTACTTGGA GATCTCTGCG CATCTTTGTT GACTTCCGTC GTTTACTCAT TTGCCTTTCA TAAATCCGAA AGAAAACCAT CAGTTTCCAC CGAAGGACCA CCGCCCACAC GCGGATCACA CAAGCGGTTA AACTCCATAG ATGGTGACGA CTGGGAGGAG AGACCTGAAT CTCTTGATAC CGTGCCCTAC GGCAGCAAGC AGAATTCGGC CGATCCAAGT GACCAACTAA CAGTGAGTTC CGACGTCAAT TCCAGGTATG TAGACATTTT ACTTTTTTAC CAATTCATTG TTGTTGGAGC GAGTACTCAT AGTCTGATGG CTGTACAGTG AGGACAATGA TGTAGATTCT CTGGACTCGC CACAGTTTTA CGGGCATGGT TCCCGACGTG GATATGACGG AGGTGCTGAG CCGAATGAGC GCTCAAATCT ACTGCCGCCT CCTGGCTTGA ACACGAGTGA ATACTACGAC AAGTCAAATG CATCTGAGTC TCGTGCTGGC TTAAGCAGCG AAAGCTCCCA ACGCCGACAG AGCCGTCGTA CGAATTGGCC AAACGAATTT CAGTCGGCTT CAACGGAACG CAGGGGAAAT ACGACGGGAG CCAAAGAAAT GGAATATCTG GTACGCCTTG GCCTTGTGTG CCTGCCTTGA TAAATGAAGT TTTGGATTTC ACTCAACACC ATATATTCCT TTGATAGACC GAGAAAGAAA GACGAAAGCT GAATCGAAAG AAGAAGAAAA AGAAGAAGAA ACACGAACGA CACTCGAGAA AAGCTCGTGT GCAGAGTGGT TCCTCGGAGG AAGAATCCAG CAGCTCTGTG TCGGCATCAT CAGCGTCTTC TCACGAATAT CAACGATGGA TGAAGAAGCG TGCACGGATG CTGGAGAAAG AAAGATCTCG ATTGATCAAG CAATGGAGAG CTGAAGCATT TGCGGAAGAA CGGTCTACGC AGCAGCACAG TCGTTGGTAC CGTCGTTTCA GCCGCTATCA AAAAGAACAG TTTGGCGAAT GGGTCAGCCA GCTTTTCCGA TTCTTTATTT GGCTAGAATC CTTTGTTGCC AATCTTCCGC TTACGATTGG TGCAATTGCT CTAGCAGTCG CCAACCTTGG TGTTGACTGG TTCAAATTTG CTGAAGAAAA CATGGATTCC TGCGAACCGG TGCACTTTCA TTCATCTCAG TGCACATTCC CCGAATTTCC TGGTTGTTTT TATTGCGACA CTAGTGCTAG AATGTACAAA GTTGCGCTGA ATTTCCATTT TGCTTGTTCG ATTATCGCAG GAGTCATAGC GTCGACTTTT ATTGCCAAAC TCATTTTGGC CCGCCGTGTG GTATTCGACG AACTGAGTTC TCCAACTACG GCAACACCAG CGGGTTTGCT TTGCATGACT CTGAATGTGG TTTTTGCCGG ACGAGGACTG ATAGGACAGG TAGTGGTCTC GCTAGCTGGC TTTATTCATC TCTGTCTTGC AATCTGGTTC ATTTACATGG CGCTAGCATA CCGCATCATG CCTGAACCAA GTTGGTTCCC AAACACGGTT TCTATTGGAC TTTCGGCAGT GAAAATATGG TTGTACTATC CAATGGCTGG GCATTTCCTT ATGGCGGTGT GTACTACTTG TTTTTGTTGC TACTAAACAG AAAGGGCGTT ACAAATTTTC TCATGGCGCT GGCTCTTTTT TGCTTCAGAT ATCCCTCTCG TTGAACTTTT TCTTTTTCCC GATCAGTCTT ATTCGTGTTG CCATGAATAG AAAAATTTCG GCAACAGTAG GGTGGATGCA AATGTCCGCC CCAAACATAA GTCTTTATGC AATGACGCTC ATGGCCCAGC CTTCCTTTAA GGAAGAACAC CCAGATATCA ATCGGTTTCA AGTAGTCCAT CGCATGGTAT ATCTACCTTG CATGCATTTT TTCTTTGGCC TTTGCATAGT GGGAATGCTA GCTAGCGTCC ACAGCTTGTT GGTTCGATGG ACTGAGTTCC GAAAGATTCC ATTTTCTCCA GCTCATGCTG CTTTTTGTGT TCCGACCTTA TCTCACGCGA ACGCTATCCA GGCGTACCGA GCAGCCGTCA ATTCATTTTC AAAGGTGCCT GTTGGAAGTC CGTTTCGCAG CTTCCTTTAT GTTTACTGGG TCTTTGTTCT CATAGCTGGA ACGTTCCTGA CACTTTGGAT TGCGACGAAA TTTATGTGGA GCTTACCAGG TTGGACTCAT ATTGATACGG CAGGCGAAAT GGAACCGCCA GCCCCATACG AAACAGCCAT GACGTCATCT AACCTAATTA CGACCGGAGA AAGCTTGGTG CAGCCGTTCA TCAGCCCAGC GATTCTACAG GCCAATGAAA CGGGTGCTTT GGTAGTTTCC CGCGACCAAT ATGGAGCTCA AGTCTACCGA CGAACGCGAA TGGTGACTGC GCTCGGTTTC GAACCGATTA TGAACCAACT GCAAATGGAC GTAGAGCGCG AACTACTTTT GGACTGGGTC GGAAAGAATC CTCCGCGACG GCGACACCGG ACACTAAGCG TACCGGGAAT TGACTTTACA TACGGAGCCA CGGGCGCTTT CGGTGCGGGC AACGCCGGTG TGTACGGAAT GGACGAGGGA ACAGGGTCAC CGTGGTTTTC TCGTCCGAGA GCGAACACCA GCTCTCCCAA TGTGAGTCAT CGGTATACTT AA
|
Protein sequence | MSTNRSIRDG SPNVANDHPG DTERRKSPIS TPAASPCHDE TDDAPVATLD TAPEFDGGTN ESWMSPVLET PRARAYSSEE RGAISASVGR PPLIVTTLAD TDNTSVTEGT APVSPATSTI PVHRRVDSSD ANSLPSLSGP IQPAHPFFSP SFFEESDASD GSIVFQPPTP QRDTSNPDSL RFTGPQHVRR DSRGRDGRDR FPSFDSLGSS GSIRIVPPSN SSQIMRGKEK FSASSNSLPA NPAPVIPSLP TIGQLPYHER RKLMAQRAKD QQQQLQLQQQ QLQLQQHRPH IPPNLAPMPL PAPTLQHLPH QQQHQNLPQI TPHTKEVASP YGQNVSFPPH QPQHPLPQAH PHHYIQNLPS GYAPHAGIPL RASHGPPPPP LYAYPVPQVP LGPYPPPPLQ QYYGHLPPPP SQQQQLQSLW TRNNPLQTAD KVASDPRQQQ HLYRTSGNHH QIPLPPRGGS HSRNNSSTIG LSSNMSSSDG DVERKPSARV RPEIAPAQPP LPPPPPPPGL PPNYAHMRTD SSGSLSSLGS FDRPAKIEPR KASFLEKLNP WSPKVPNVND YHRKNQQFLR RASVERGKLW STSPQTPAGR RKPSVSTEGP PPTRGSHKRL NSIDGDDWEE RPESLDTVPY GSKQNSADPS DQLTVSSDVN SSEDNDVDSL DSPQFYGHGS RRGYDGGAEP NERSNLLPPP GLNTSEYYDK SNASESRAGL SSESSQRRQS RRTNWPNEFQ SASTERRGNT TGAKEMEYLT EKERRKLNRK KKKKKKKHER HSRKARVQSG SSEEESSSSV SASSASSHEY QRWMKKRARM LEKERSRLIK QWRAEAFAEE RSTQQHSRWY RRFSRYQKEQ FGEWVSQLFR FFIWLESFVA NLPLTIGAIA LAVANLGVDW FKFAEENMDS CEPVHFHSSQ CTFPEFPGCF YCDTSARMYK VALNFHFACS IIAGVIASTF IAKLILARRV VFDELSSPTT ATPAGLLCMT LNVVFAGRGL IGQVVVSLAG FIHLCLAIWF IYMALAYRIM PEPSWFPNTV SIGLSAVKIW LYYPMAGHFL MAISLSLNFF FFPISLIRVA MNRKISATVG WMQMSAPNIS LYAMTLMAQP SFKEEHPDIN RFQVVHRMVY LPCMHFFFGL CIVGMLASVH SLLVRWTEFR KIPFSPAHAA FCVPTLSHAN AIQAYRAAVN SFSKVPVGSP FRSFLYVYWV FVLIAGTFLT LWIATKFMWS LPGWTHIDTA GEMEPPAPYE TAMTSSNLIT TGESLVQPFI SPAILQANET GALVVSRDQY GAQVYRRTRM VTALGFEPIM NQLQMDVERE LLLDWVGKNP PRRRHRTLSV PGIDFTYGAT GAFGAGNAGV YGMDEGTGSP WFSRPRANTS SPNVSHRYT
|
| |