Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45988 |
Symbol | |
ID | 7201053 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 874798 |
End bp | 878063 |
Gene Length | 3266 bp |
Protein Length | 693 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180338 |
Protein GI | 219119143 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.284744 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTAACGGTGG CACCGGAAAT GCGCGTCCTC TCTTATTGTA GAGACCGCCT CATTCGGAAG CCAGCTGAAC CAATTTTTGG CACGAGCGAG CCTTATAACA CGAAACGAAG CATGTCGAGC TGCTTCAATC TTGCGCTATG TGTGCTTGCA GCGGTGTGTC TTCCGTGGTC CGCAACGGCC CATTTCAACT GCGGAACACG AGATCCTTCT CCTTTTGAGC AGCGCTTGGA TCAAGTCCGC ATCAACCATT TCAAACAGTC CACGGAAGGC CGGCGATTGA TTACAGATTC TTGCGAAGAA CTTTGCGTCC AGTGCGTGGA AATTGACGTA TATTTCCATT TGAGCGCCGT TCCTGCTCCA TCGGCCGATG ATAGTGATCG GTTCTTTTTC CCGCATCCCC TCGAATCGGT TGATCGCTTT GCGGAAAGTG ATACGACACT GACTATCGAA GACTTTGCCT CATTGCAAGG TATTTACAGC CTGATTGATG ACAATATGCG GGTTCTCAAC GAGCGGTACG CGGAGTCTCC ATTCACATTC ACTTGGAGAA ACTCTGATCC TGCGAGTGCC AGTGTTTCCG TCAATACGGA TTTGGTGGAC TTTGTCGTCG ACACCATGTT TGACGAGAAT GGAGTTGCGT CTGAACTGCA TACCGGGGAT GCCAGTGTAC TGAATGTCTA CTTGACGTAC AGACAATGTG CTATCTCAAA CCAGCTCGAC CCTGATACCG GCGAGCCGCT GCTCTCGTGT GGCCTCCTCG GCATTGCTGT TTTCCCAAGT TTTCAGCAAT CCAACCGAAA CGCCGATGGT GTGTACGTTA ACTACAGCAC GCTTACCGGT GGAGGGTATG TATGTCAACG AGTCTGTAAT TTTGCTCTTC TGCGTGCTGA ATGATTGCTC GCCACAAATT ACCGTACTAA CAGCAAAATA CTTGCATGCT TTCTCTCTGC TTTCAAGCTT TCCAAACAAC GACGCTGGGT TGACACTTGT TCATGAAGTT GGGCACTGGC TTGGACTGTA TCACACGTTT CAGAATAGTG CTGCCCAGGA GGGAACCGAC CCGTGCTCAC CGCAGAACGG GAACGACTTC GTCGCCGACA CACCTACACA GTTGGCATCG TCCCAGGACC TATACAACTG CTCATTGAGC TTTTATGAGG GTGAAGAGAT CCCGGACTCG TGCCCCAACT TGGCTGGAAG CGACCCTGTG TTCAACTACA TGAACTATGT CTCGAACGAA GAGTGCTGGC CCCCTGGTGT TGGGGAGTTC ACGTGTGGAC AGTACGAGCG CATGTACATG CAGTGGCTAC TGTACCGCAG ATCCGACGAG CCTTGTCAAG ACAACGAAAT GGAGATTGAG ATTTCGATGG AGATCAACCG AAGGTTTACT AGTGAAAACG CATTCTACCT GACCTACGTG GATACGGGCG AAGTGTTGCT GAACTCGACT CGTGATTTTG AGGCCCTTGG ACCTCCTTTC CAGACCGAAG TGTTGTCCGA TTTCTGTGCT CCCGTGGGAC AATACTCGTT CGTGCTTGTG GATGCCGCTC GGGACGGATT TTTGGACGGC GGTTTTCTCG AAGTGTCCGT GAACGGGGAG TTGGTCGGAA GTGTATCGGG CAACTTTGGA GAGTCGGCCA CGATTGACTT TGGCACACCA GATGGAGATA GCGGTGCCAA TTTTGTTGGG GGCTCCAACA GTGATGCTAG GCGTCGTTGT TCGACTCCTG CGATCGTTTT CTTCTGCGGG ATAATATACT TGTTCGTTTA AGGTTTCTTA ATGTAGCATC TGTCATCGGA AAAATCTTCT TGTTCATTGC TAACAGTAAG CGCCTTGTTT TTTTGCTTCG GTCGAGATCT AGATCCTGGA GTTCATTTGT GGCCACGGAT CAACTGCTTG CCACTAAACA AGTTTACAAT TGGCATTTCA ATTTGGCGAC ATTAAAACAG TTGTTTTTTT GCTTCGGTCG AGATCTAGAT CCTGGAGTTC ATTTGTGGCC ACGGATCAAC TGCTTGCCAC TAAACAAGTT TACAATTGGC ATTTCAATTT GGCGACATTA AAACAGTTTT TACTTCTTTC ACTCTCTAGA TCGCGAGCTT TCCTAAACAT TCAGCAGCGT ACAGCTAGCG GAAAAGTCTT GAGCGCCTCG TCACTGTTGC TTTCTTACTT TTGGTCCTCA CATCAAAAGT TTTGTTCAAC ATTTCCATAA TTGACAGTGA GGTGATAGCT TTTGAAGTCT AACTGTAAGT GTGAGCGGCA ATGGGAAATC GATTAACAGG AACCATGAGG GTCTACCTCT CTCTCTGGAT AGGTCAAGAA AGGACAGGAG CATCCAGGTC TTTCTCTGAA GCGTACAGAG ATGAACAAAT TGAATTTTTA GGTTTGTGAA TGAGTCCCAT TTCCTCGCAG ACAGCAAAGA GGGCTTTTCT TGGTGTCGCA CCAAAAAAAT TTCTTCCCCC TGCTCTGTCA ACCAGGATCG ATGAGAGCTG CTATGGGTGG AATAGAATAG CCTAAGTTCA GAGAGTCATG TGTTGTCCGA TTTCTGTGCT CCCGTGGGAC AATACTCGTT CGTGCTTGTG GATGCCGCTC GGGACGGATT TTTGGACGGC GGTTTTCTCG AAGTGTCCGT GAACGGGGAG TTGGTCGGAA GTGTATCGGG CAACTTTGGA GAGTCGGCCA CGATTGACTT TGGCAGTTCT GCGGGAGGTC CGGGCCCCGT CGGCTTCCCA AGCAGCATAA CGAACAGCCC GGTGGAGGCA AATGCGCCTG TGTCCAGCCC AGCGGACGGG GATACCTTTG TTCCCACACC CGTCGGTAAT CCATCTTTTG TCAGCAAAGC GGCGTACCGC GGTTGGGCTG CGGCTCTTGG TGTGGCCCTG GCGGGAACGG GCTGGACGGT TGATTTGTGA CACTAAATCG AAACCGTATC ACTGTCTTGT TTCGTATTTT GTGCATGAGG CTGTTTTTAT TCGTCCGCCG AGACCGAGAG CCAGAGCATA GCGTCGATGG ACACAACAAA CAAACATTGC GGTGGCGACG AAGTTTTTCA GTGTAAACTA TGTAATCTTT GTATGACTTG ACAATCTTTG CTTTGGAAAG CATGATGCCT GATGTGTTCA CAAGAGAGAC TTCGACCAGG GCCCGTATTC CCAAACCCGA GACTTTATTT CGAATAGGCG ATGTCTCGAC GTAACGGCAC CATGGAGCAT TCTCGACAAC TTTCGTACTC CTGTAGGATC GTTCGACTGT TCCCAAGTTC GGGGTT
|
Protein sequence | MRVLSYCRDR LIRKPAEPIF GTSEPYNTKR SMSSCFNLAL CVLAAVCLPW SATAHFNCGT RDPSPFEQRL DQVRINHFKQ STEGRRLITD SCEELCVQCV EIDVYFHLSA VPAPSADDSD RFFFPHPLES VDRFAESDTT LTIEDFASLQ GIYSLIDDNM RVLNERYAES PFTFTWRNSD PASASVSVNT DLVDFVVDTM FDENGVASEL HTGDASVLNV YLTYRQCAIS NQLDPDTGEP LLSCGLLGIA VFPSFQQSNR NADGVYVNYS TLTGGGFPNN DAGLTLVHEV GHWLGLYHTF QNSAAQEGTD PCSPQNGNDF VADTPTQLAS SQDLYNCSLS FYEGEEIPDS CPNLAGSDPV FNYMNYVSNE ECWPPGVGEF TCGQYERMYM QWLLYRRSDE PCQDNEMEIE ISMEINRRFT SENAFYLTYV DTGEVLLNST RDFEALGPPF QTEVLSDFCA PVGQYSFVLV DAARDGFLDG GFLEVSVNGE LVGSVSGNFG ESATIDFGTP DGDSGANFVG GSNSDARRRC STPAIVFFCG IIYLSRAFLN IQQRTASGKN SLSSESHVLS DFCAPVGQYS FVLVDAARDG FLDGGFLEVS VNGELVGSVS GNFGESATID FGSSAGGPGP VGFPSSITNS PVEANAPVSS PADGDTFVPT PVGNPSFVSK AAYRGWAAAL GVALAGTGWT VDL
|
| |