Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_26387 |
Symbol | |
ID | 7199857 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 42771 |
End bp | 46028 |
Gene Length | 3258 bp |
Protein Length | 1053 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178916 |
Protein GI | 219116242 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.421794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGTCG AACAGCTGCA CGTTGTCTTG CAGCAATCGT TTTCGGCCGA TGCGTCGATT CGAAACCCTG CCGAACAAAC CATTAAAAAC CTCAAAAACT TGCCCGGTGC CGTCAATCTA CTCTTGCAGG TCGCTACGGA AAAGCAGGTA TGCTGAACAC AGCTTAACGG AGCTCTGCGA CGAGTGTAGC CCTCATCCGT CTCACGTAAT CGTACATAAT GTACGCGTTT CTTCCTCATT GGCCAGGTCC GTTTCGAAGT CCGACAAGCC GCTGCCATTC AACTCAAAAA TATTTGCCGC GAAGGCTGGG CGGAACGTAT TCATTACGCT CCGTATGCTG AAGAAGCCAC GAAACCAGCT CTGCTCGCCG ATGAAGACAA AGCAGTTGTG AGGGTCGGCC TGCTCAAGAC GCTCCTCGAC GAACCAGAAA AGAGTATCCG AGATTTGCTC GCGGAAACCT TACACACGGT GGTGATCCAC GACTTTCCCG AAAAATGGCC TCAGCTCATT CCCACGCTCC TCGCGAGTAT TCAAACGGGT GTCGGTGACA TGGGAAAACA CGGATTGCAG GTACACAATG CTCTCCTTGC ACTCCGCAAA GTTTGCAAGC GATACGAATA CAAAAGCAAG GAGCAACGCG GACCCCTCAA CGAAATTGTC CAATCGAGTT TTCCGCTCTT GCTCCCGTTA GCGCAGCAGC TATCTGCCGA AAACGAAAAC TCGCTGGAAG CCGCCATGAT GCTCAAACAG ATTCTCAAAA TTTTTTGGTC CAGTACGCAG TTCTATTTGC CCGGTGGCGA CGGATCGGAA ACGTCCTCCA TTGGGTTGGC ACGACCGGAA CAGCTGCAGC CTTGGTTTGA TGTTGTGAGA AGCGCTTTAC AAAAGCCCCT ACCAGAAGCG TCGACGGGAC TTGAACCACG TAACCAGCCA GTCGATGTCG ATGCTCGCAA CGCGTGGCCT TGGTGGAAGG TTAAAAAGTG GTCAGTGCAA ATTATGAGCC GACTGTTTTC TCGCTACGGT ATTCCAAGCT ACGCGGACGA TCAGGAAGCC AAGGATTTTG CCGTTTTCTT CAGTCAAAAC GTGGCGCCGC AATTCTTGGG GCCTGTCTGC GAAACACTAA ATCTAAGACC CTCGGGAAGT TTCTGTACCG ACCGGGTAAT CCACTTGTGT TTGACCTTTG TGGACTTGGC GGTCGAGCTA GCCAGTACCT ACAAGTTGCT GAAGCCACAT TTGGACTTTC TCTTGTATCA GGTGTGCTTT CCAACAATGT GTTTGACTCA AGAGGACATT GACTGTTTCG ACAACGATCC GGTGGAATTT GTGCACAAGC AGAACAGTCC CTTGGCCGAC TTTTACGACC CGCGCATGTC CGCGGTCACT CTCGTCACCG ATCTAGTCAA ACATCGTGGA CAAGACGTAA CTCAGAATTT GTTGGGACGT ATGACGGCCA TTTTGCACAC TTACAGCCAA GCAGCCCCTG ACCAAAAGAA TCATGTGGAA AAGGACGGTG CCTTATTGGT GTTTGGCTCG TTGTCGAAGA ATTTGTTGGC AAAAGAAAAG TATGCTGCCG AGCTTGAAGG CTTATTGGTA TCCTCAGTTT TTCCGGATTT TGGGTCACCG GTCGCCTTCT TGCGATATCG TGCGTGCTGG ATGGTACAGC AATATAGCAC TGTCCAATGG TCCGACGATG GAGCTCATTT GCGAACTTTA CTCGAAATGG TTCTAAACCG CTTGAGCGAT CCCGCTCTCC CCGTACAGAT TGAGGCCTCC AAGGCCCTGC GATTTTTGGT AGAAGCTGAT GGCGCGGAAG AAACTCTTCT TCCCGTCCTA CCTCAGCTAT TGACAGAGTA TTTTCGTATC ATGAACGAAA TTGGCAACGA CGAAGTTGTG TCTGCCTTAC AGGCTTTGCT CGATAAGTTT GGCCGTCACA TTGAACCACA CGCAGTCGCT CTTGTAACAC AATTGACGAG TGCCTTTTCA CAATATTGTA CAGCCGGGGA AGACGACGAT GATGACGACG CCGCCATGGC CGCAGCGCAG TGTCTTGAGT GCGTTGCGAC GGTTCTAAAA GGCGTTTGTG GGAAAGCTTC CATGCTGAAA ACTCTCGAAC CACTACTGAT GCCGCTGGTC TTGAAAATTC TAGGGAGCGA CGGTGATTTT ATTGAATATT TGGAATGTGG ACTCGATATC TTGACTTTTT TAACTTTCTT TCAAGAACAT ATTTCGCCAG AAGTCTGGCA AGCCTTTCCT TTGATATATT TGGCTTTTGA TCAATTCGCC TACGATTATC TGAACATGAT GGTACCTTGC TTGCAGAGTT ATATTGGCAA GTCAACCAAT ATTTTTTTAA CCGGTACTGC CCAGCTCCCT GAAGGAGACA TTCCGTATAT TGATTTGATC ATCAGCATAG CCGCCAAGAC AGTCACGAAC GACCGCGCTT CTGAATCAGA ATGCCGGTAC GCGCTTAGTC TGTTCATGAC GATTCTTCAC AATTGTCCTG GCAAGGTAGA TGGATACATT CCATTTATGA ACGAGATTGC GCTCGGCAAG CTCGGACAGC AAGTCAATAC CGAGATTCCT TTGACTCGGT TTTCAATATT TCAGGTTCTC GGGTCTGCGC TCTACTACCA GCCTCAGCTT GAGTTGATGG AGCTCGAAAA GCGAAGCGTC ACACAACAAG TTTTCACGCA ATGGATAATT GATGCGGACA AAATGGAGCG ATGGCTCCCA AGAAAGTTGA CCGTGCTTGG TTTGTCTTCC ATTTTGAGCC TGCCCACGTC GACCTTGCCT GCATCAATCA TCAGCTTGCT ACCGCAACTA ATTCACATGG CGTGTAAATT GGCACTCGTC CTCAAAGCTG AGGCCGAGCA AACCGAGAAG GATGCCGACC AACTAATCGA GGAAGCACCT GAAAGGGATG ATGGCGTTGG CGACGTTGAT CTAGGATTCG ACGAAAGCCA AGATGTGACA AACGAGGTAG ACGAAGCTTA CAGAAAAGCG CTGCAAGGAG TCTCAGGCTG GGACGATGAC ATGGCAAAAT TCTTACTCGG TGGTTGGGAG GACGAAGGTG ATGACATTGA CGAAGACTAC AGCTCGCCAA TAGATAAAAT TGACGAGCTC ATTCTGCTGA ATGACACCAT CAAAATGGCT TTTCAAAGAG AACCTGAAGC CTATCAACAG ATTCAGTCCG CCCTTCCGCC GGAACCTGTT GCGGTGGTTC AGAATTTATT TGCCAGCGCC GATATCGTAC GAGCGCAA
|
Protein sequence | MDVEQLHVVL QQSFSADASI RNPAEQTIKN LKNLPGAVNL LLQVATEKQV RFEVRQAAAI QLKNICREGW AERIHYAPYA EEATKPALLA DEDKAVVRVG LLKTLLDEPE KSIRDLLAET LHTVVIHDFP EKWPQLIPTL LASIQTGVGD MGKHGLQVHN ALLALRKVCK RYEYKSKEQR GPLNEIVQSS FPLLLPLAQQ LSAENENSLE AAMMLKQILK IFWSSTQFYL PGGDGSETSS IGLARPEQLQ PWFDVVRSAL QKPLPEASTG LEPRNQPVDV DARNAWPWWK VKKWSVQIMS RLFSRYGIPS YADDQEAKDF AVFFSQNVAP QFLGPVCETL NLRPSGSFCT DRVIHLCLTF VDLAVELAST YKLLKPHLDF LLYQVCFPTM CLTQEDIDCF DNDPVEFVHK QNSPLADFYD PRMSAVTLVT DLVKHRGQDV TQNLLGRMTA ILHTYSQAAP DQKNHVEKDG ALLVFGSLSK NLLAKEKYAA ELEGLLVSSV FPDFGSPVAF LRYRACWMVQ QYSTVQWSDD GAHLRTLLEM VLNRLSDPAL PVQIEASKAL RFLVEADGAE ETLLPVLPQL LTEYFRIMNE IGNDEVVSAL QALLDKFGRH IEPHAVALVT QLTSAFSQYC TAGEDDDDDD AAMAAAQCLE CVATVLKGVC GKASMLKTLE PLLMPLVLKI LGSDGDFIEY LECGLDILTF LTFFQEHISP EVWQAFPLIY LAFDQFAYDY LNMMVPCLQS YIGKSTNIFL TGTAQLPEGD IPYIDLIISI AAKTVTNDRA SESECRYALS LFMTILHNCP GKVDGYIPFM NEIALGKLGQ QVNTEIPLTR FSIFQVLGSA LYYQPQLELM ELEKRSVTQQ VFTQWIIDAD KMERWLPRKL TVLGLSSILS LPTSTLPASI ISLLPQLIHM ACKLALVLKA EAEQTEKDAD QLIEEAPERD DGVGDVDLGF DESQDVTNEV DEAYRKALQG VSGWDDDMAK FLLGGWEDEG DDIDEDYSSP IDKIDELILL NDTIKMAFQR EPEAYQQIQS ALPPEPVAVV QNLFASADIV RAQ
|
| |