Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40608 |
Symbol | |
ID | 7198384 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 435360 |
End bp | 438320 |
Gene Length | 2961 bp |
Protein Length | 911 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184615 |
Protein GI | 219128847 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.158773 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCA AGTCTACTCC CAAAGACCTC ATTGACTCGT TTCCGCACAG CAAACTCACA CCGATTGCTA CCGCGACAAC CGAACCCGAT TACTTGTCGC TCCATCAGCT GCAGTATGAA ATCAACGACA ATGCGGAGAC CCTTTCCTCC ACTCTTGGAG ACGGCCAACA CGGTCACCTT TTTCTCGTAA TTTCCGAAAC CGAGTACCTC GAAATGACCG ACGGCGTTCC ATGCATTCCT CCTGTGCAAC CGCCTTTCGA CCCAGTTCAC GCTGCCAACG CCACAGCTCC TCAAATTATC GAAGCTAACC ATCAGAACGA CAAACGTCAA AAGCTTTTTG ACCTTTACCA CAACGCCATT AAAATCAACT CCTTGAAGCC ATTCCCATTG AATACATTGA ATCTCTCGGT CATCCTACAC GAGGCTTTAA CAAAGTCTCT CCCCTCGAAA TCCTCTCTCA TCTCTGGGAA AATTTTGGTA AAATTCAGGC TTCGGATCTC ATCGCTAACG ACGAACGCAT GAAAGCCGCC TGGCATCCAC CAACGCCTAT CCAGCAACTT TTCCAGCAGC TTGACAAAGG CAATCAGTTT ATCATCGCGT CTGGCCAAGT CATGGACGAA CGAATTATCG CTCGCATCGG CTACCAGATC ATCGAAAAAA CCGGGCTCTT TGATCTTGCT TCTCGCGACT GGCATTATAA AGATGAAGCC GATAAAACTT TGGCAAATTT CAAAAAACAT TTCCAGAAGG CCAACAAGGA TCTCGCCCTC ACCGCCACCA GCAGCTCTGC AGGTTATCAC ACCGCAAATC AGAGTACTGT CACCAAGGGA AAATCGTATT GCTGGACACA CGGCATCGTT CACAACACGA AGCACACCAG TGCGACATGT GAAAAACAGG CCCCGGGGCA CAAAACTGGC GCTACCTTGC ACGACAAACA AGGCGGGTCG ACCAAGACCT ATCAATACAC GCCACCGGTG CCCAAATAGG AAAGGGGGAC GGCCAAACTG TTGAGTGTGC CGCTGAATAC TTATCATAAC AAAAATAAGT CTTCAGTTGC ACCAAACACT CCTCCGTTAG CTTCCTCCCC GCCATTTTTT CCTCCCGACG CCATTGCAGA CACTGGCTGT ACCGGACATT TTTTGAGCAC CAACATTGCT CACATACACT GCCAACCGAC AGTCCCCGGC ATCAACGTGG TCCTCCCTGA TGGTCGCACA ATCACTTCGA GTCACATCAC CGAACTCAAC ATTCCCTCGC TTCCGCCGGG AGCTCGTACC GCCCATATCT TTCCTGGTCT CTCGAATGGA TCCCTCATTT CCATCGGCCA ACTTTGTGAC CACGGCTGTA CCGCCACGTT CACATCCGAC TCAGTCCGCA TTGAGCTCAA TAACACTGTC GTTCTCCGCG GCGGCCGTTC TCCTTACACC CGATTGTGGA CCCTCGACTC CCCTGTAACG CCAAATCCTC CCGCCACTGA ATTGCATGCG CCTTTGCACG ACAAAAATTT TGCGAATCAC CTCGGAGACC ACTCAGGGAC CCTTGCCGAC CGCATTGCCT TTGTTCATGC ATCCTTATTC TCGCCACAAC TTTCGACATG GTGCAAGGCC ATTGACAAAG GCCGCCTCAC AACCTTTCCG GACATCACGT CTGCACAGGT AAAACGGCAC CCCCCACAGT CCGTCCCTAT GGTCAAGGGA CACCTTGACC AGCAACGGTC CAACCTACGC TCAACCAAGC CCAAGGTCAC CCTGTCTGCC TCTGTTGATC CTGATGACAT CAATTTCGAC ACCAATCCTG TCGTACAAGA CCCTCCAGCC GCCAGGACGC AGTTTTTGTA CGCCGATTTC GCCGAAGTCA CCGGAAAAAT TTTTACTGAC CCTACCGGCC GTTTCGTTAC CACTTCAAGC TCCGGCAATG CATACATGCT AGTGGTTTAT GACTACGATA GCAATTTTAT TCATGTCAAA GCCATGAAGA ACCGCACCGG TCCCGAGATT TTGAGCGCCT ACAAGCGTGC TCACGCCATG CTGTCCTCCA AAGGTTTGCG CCCCCAACTC CAACGCTTAG ACAACGAAGC CTCAACTGCG TTACAACAAT TCATGTCCTC TGTTGACATT GATTTTCAAT TAGCTCCTCC GCACGTGCAC CGTCGGAACG CCGCCGAACG GGCAATCCGC ACGTTCAAAA ACCACTTCAT TGCAGGTTTG TGCAGCACCA ACAAGAACTT TCCGCTCCAC CTTTGGGATT GCTTACTCCC ACAAGCCATC ATGACTCTCA ACCTTCTTCG AGGGTCTCGT ATCAACCCAA ATCTGTCGTC CTGGGCCCAA CTCCACGGCT CGTTCGACTA CAATCATACC CCTTTGGCTC CCCCGGGCAT CCGCGTGCTT GTACACGAAA AACCGTCAAT TCGCAGAACT TGGGCCCCCC ACGCAGCCGA CGGTTGGTAC GTTGGCCCCG CCATGAATCA TTACCGATGC TATCGCGTCT GGGTCAAGGA GACCACCAGC GAACGCATTT CGGACACTCT GACCTGGTTT CCCAGCCAAG TCAAAATGCC CAGCACCTCG TCTCGCGACA CAATTGTCGC CGCCGCTCAC GATCTTGCCC ATGCTCTGGC ACATCCCTCT CCTGCGTCGC CTTTATCACC TCTTTCGGTC AACGAACGCG AAGCCCTCTC GCAACTTTCA GATATTTTTT CGAAAGCCGC TAACCCAGTT GACTCGTCCC TCCCAGTTGC TCCCACGGCA ACCCTAAGTC CGCCAACTTC ATCTACTTCT TCACCTTGTC AAGTCCGCTT CCGAGACCCG GTCACTGAAT CACTTCCGAG GGTGCCGACC GCCACAGCCG CCCTTCCGCA GTCACTTCCG AGGGTGCCTC CCCCGGACTC CGAGGCTGAG ACATACAAGC TTGTCACCTG CAACCCTCGC CAAGCACGTC GTAGGGCCGC TCGCAAACTG A
|
Protein sequence | MTTKSTPKDL IDSFPHSKLT PIATATTEPD YLSLHQLQYE INDNAETLSS TLGDGQHGHL FLVISETEYL EMTDGVPCIP PVQPPFDPVH AANATAPQII EANHQNDKRQ KLFDLYHNAI KINSLKPFPL NTLNLSVILH EALTKSLPSK SSLISGKILQ LFQQLDKGNQ FIIASGQVMD ERIIARIGYQ IIEKTGLFDL ASRDWHYKDE ADKTLANFKK HFQKANKDLA LTATSSSAGY HTANQSTVTK GKSYCWTHGI VHNTKHTSAT CEKQAPGHKT GATLHDKQGG STKTYQYTPP SSVAPNTPPL ASSPPFFPPD AIADTGCTGH FLSTNIAHIH CQPTVPGINV VLPDGRTITS SHITELNIPS LPPGARTAHI FPGLSNGSLI SIGQLCDHGC TATFTSDSVR IELNNTVVLR GGRSPYTRLW TLDSPVTPNP PATELHAPLH DKNFANHLGD HSGTLADRIA FVHASLFSPQ LSTWCKAIDK GRLTTFPDIT SAQVKRHPPQ SVPMVKGHLD QQRSNLRSTK PKVTLSASVD PDDINFDTNP VVQDPPAART QFLYADFAEV TGKIFTDPTG RFVTTSSSGN AYMLVVYDYD SNFIHVKAMK NRTGPEILSA YKRAHAMLSS KGLRPQLQRL DNEASTALQQ FMSSVDIDFQ LAPPHVHRRN AAERAIRTFK NHFIAGLCST NKNFPLHLWD CLLPQAIMTL NLLRGSRINP NLSSWAQLHG SFDYNHTPLA PPGIRVLVHE KPSIRRTWAP HAADGWYVGP AMNHYRCYRV WVKETTSERI SDTLTWFPSQ VKMPSTSSRD TIVAAAHDLA HALAHPSPAS PLSPLSVNER EALSQLSDIF SKAANPSASE TRSLNHFRGC RPPQPPFRSH FRGCLPRTPR LRHTSLSPAT LAKHVVGPLA N
|
| |