Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42854 |
Symbol | |
ID | 7196506 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1321814 |
End bp | 1324988 |
Gene Length | 3175 bp |
Protein Length | 768 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177269 |
Protein GI | 219111035 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.652023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTGTTCCCA CAAGGGTGCA AGTGCTTCTT CTTTACATTA GAAACTTTGT CATTGTAAAA TTTTTGCACA TGAGTTCTAC TACAACACAG AGGCGAAAAA TTGGCTCGGT CAAGTCAGGA GATTCGTTGA CGGTACAGGT AGAACTGGAG GAAAATCGCG AGTCCACCTC GGTCCCAATG GTGACAGCAA CAGTTCGTCG ACGATTCTGT GGTACCGGAC CTTTCGACAG ATATTGGTTG AACCTGGATT GTTGTGGCCT CTTTTGCGCC CTGATAACTT ATTGTCTGCA CGCCTATGGA GTGTATGCTG TCTGCTTCGT TCTGATCCCA CCCTGGATGA GCACTACGAG CGAAGATGGC ATCCGGAGTT TGAGCATAGC AGGCATCGGG AATCGCATCG GGTTTTCTCT GTTGGCAGCA TTGGCGGTTG CGGCTCATTT CAAAACAATG ACAACCGATC CCGGTACGGT GCCACCCGAT GCGCAACCGC TTCCCGAAAC CGAGGAGAAA ATAGAAACTG AGGAAGAAAA GCAATTGCAA AGTTTGATGA TTATGCCAAC TCAAAAGGGA CGTCGACTTT GTCGTCGTTG CAAAGCGTTC AAGCCGCAGC GGGCACATCA TTGTAGTGTC TGTCGCCGAT GCGTTATTAA AATGGACCAT CACTGGTACG TGTTTTGGTT GTCGATTTTT TCCCTGCGAC TGACTCTCCT TGTTTGCTAA CGCGACATTA TATCCATTTT CTTTACGACA GCCCCTGGGT CAAGTATGTT TTGTTATATC CTCCTTCATT TCTAGCGTTT GGCAGTAAAA AACAAACTAA CCCGATTTGC TTCCATTCAC AGCAACTGCG TAGGTATTGG CAACCACAAA TACTTTCTAC TATTTGTGTT CTACACCTTT TTGACCTGCA CGTATTCCAT GGTGTTTGTC ATTACTCGAT TTGCGACGTG TGTGTCACAC GATACGACGG GCGGACGTCA CAATCGCCAC CATATTGCCT GCTTGGATCA CCCTACCCAG ATGCTTACAG TTCTCGGTCT TTTGATCGAA GCTTTGCTCT TTGGAATGTT TACCTCCTGC ATGATGTATG ATCAATCCGA AGTAATCCGA TCCAAATTGA CACACATTGA CCGTCTGAAA GGTCTCGATA TTGGTGGTTC CTTGGAAGGC ATCACCGAAG TCTTCGGAAT TGGCAGCTGC AGTCGGGATG TCAACCACAC AGGATTTCGC TGCGATTGGT TGTCCCCCTT TCGTCGAGTT TGCTACCCAC CCTCAGCGGT GGACGAAGTA ATGGGATTTT GCCGACTCGC GAGAAAGGGC ACGTCCGAAA CAGAGTTGCC GGCCCGGTCG AACGGTTCCG CGTTGCGTAA GGTGGCGGAC TTGGTATGAC ACAGCAGAAG TATTGCATAC TTTGGTTTTT TTTCAAGAAT GGTTCCTGAA TACACAAAAT TAGTTCACCA ATCACCTTCT ATAACCTGAG TTCAGAGAGT GAAGACTCCG ACGTTTGTCT CTCGTCTAAT CAGAGTGTCG GCCAATAGTT TGAATGCTGA AATATGCGGA ACGCATAGTT TCTCATATCC AACTATGATA GGTTCATTCG TCGGAATCTA TTGTAGCAGC GCCGGCCAAT GCCGGTGTCG TTTTGAGACA GGAGGGCGCT CTCAAATATT GTTCAATAAC AATACTATAG TACTGACTCG CGCCAACGAC TACGCCGGCC GGGGCAGCGC TTAATTTCGC CCCGTCACAC AACCGGACCG GACACGTTAC AGGGGGCCGT TCGTGGTGGT ACGGATCTGT TTTTCCAGAA GAGTGAGAGA ATCAAGAATC TATTGACTAT GTGACCATGA AATTGTGCTC ACTGTCAAAT CCGCATCGTC TTCTCTTAAG ACGGACTTTA CCTATCTAGG TAACAAAACG GCCACCGCTG CCACCAAATT TTTGATTAGT AGAACGGAGC TGATCGAAAA AAATGAGTGC TCCTGGTACG TAAACCAAAC TCGCTCGTTT GGTGCGTAGG TAGTGTCCCG CTGCCGCCTC GACTCACTAC GCGCATCGCA TTGTTTTTTC TGTTATGTTG CAGTCCCAAT GAATATGGAG GAATGGCAGC GGCGAATTCG GGCGGATAAA GAAAAGGAGC GCCAGCAGAA ATCGCAGTCT GCCGAAATCC TTCAAAGCTA CCGGGGTGGC GTCAAGGACG AGGATCTGAA ACTTTCGGCC CTGAGGCAAG AAGAGCGGGA AAAGCATTTG GACGCCGAAA GGTTGATGCA TAGTTACCAA AATAATGAAC GGATCGAAGT CAGGCAGCGA CCTGTTCGGG TAGATCCGCA GCTTGCGTCA CCACCGGTAC GAGCGGAAAG CGATCGGTCG GTCAACGTGA CACCAGGATC GGTATCGGCC ATGGCGGGGA GATTTGCGCA GGTATCAGAC TCTGACAATA GTTTGTCTGT GTCACCGTCA GCACGGACAA GAAAAACAGT CGTTGTGGAG GCTCTTCCGC CGTTTGGGGT CACTTCCCTA GTGGAAGAAG GTGCGGCGAA GGAGACAAAT CTCACCAACT CGACGGCTCT ATCGCCAGTG CTGGATGCAT TCGGAACAGA AAGTCAAGAA TACGACCAAG TCGCTTTAGA GAACGCTGCA GATCTCGCAG CATTCTCATC CGCGACCGTT CCAGAATCCT TTCCGAGAAT GATTCGATTG GATGTGCTAA TTTCTTTTGG ACTGGTCACG TCTTCTGAGA ACCCTATTTT AGACGGCTAC GTTAAGGCGG CCGGCCAGAT CGTCCAGTGG CGTCTGACGG AAAATTCTGA TTTGGGGAGA AGTGTAACGT ACAATACTGA TGTTCCTGCT TTTATCAAAA AGTCGAATTG GGACGGTACG TCAAGCTAGG ATGAACGCGC TGTAGATGGT AGGCCTTTAT TCTCACCTGT TCTATAATCC TCCTTCGAAT AGACTTCTAC GTGGACTCGT CGGGTCGCTC CGATGTCCGG CGCTGCGTAG CGGTGGCGGC TATCCCACTA TTTCTGACGA ACGGATTTCC TGTGGATACC GTCAAAGATG ATATTGTTCG GAGTTTGCAA CACTCGATTC ATTCAGGGGA ATTTGTCGAG CTTGCGCAAG CTTTCCGATA GCAGCACTCT AGATTACTCT TAAAGTATGG GGAAAAATTC ACCGG
|
Protein sequence | MSSTTTQRRK IGSVKSGDSL TVQVELEENR ESTSVPMVTA TVRRRFCGTG PFDRYWLNLD CCGLFCALIT YCLHAYGVYA VCFVLIPPWM STTSEDGIRS LSIAGIGNRI GFSLLAALAV AAHFKTMTTD PGTVPPDAQP LPETEEKIET EEEKQLQSLM IMPTQKGRRL CRRCKAFKPQ RAHHCSVCRR CVIKMDHHCP WVNNCVGIGN HKYFLLFVFY TFLTCTYSMV FVITRFATCV SHDTTGGRHN RHHIACLDHP TQMLTVLGLL IEALLFGMFT SCMMYDQSEV IRSKLTHIDR LKGLDIGGSL EGITEVFGIG SCSRDVNHTG FRCDWLSPFR RVCYPPSAVD EVMGFCRLAR KGTSETELPA RSNGSALRKV ADLRLISPRH TTGPDTLQGA VRGGTDLFFQ KSNKTATAAT KFLISRTELI EKNECSWYVN QTRSFVPMNM EEWQRRIRAD KEKERQQKSQ SAEILQSYRG GVKDEDLKLS ALRQEEREKH LDAERLMHSY QNNERIEVRQ RPVRVDPQLA SPPVRAESDR SVNVTPGSVS AMAGRFAQVS DSDNSLSVSP SARTRKTVVV EALPPFGVTS LVEEGAAKET NLTNSTALSP VLDAFGTESQ EYDQVALENA ADLAAFSSAT VPESFPRMIR LDVLISFGLV TSSENPILDG YVKAAGQIVQ WRLTENSDLG RSVTYNTDVP AFIKKSNWDD FYVDSSGRSD VRRCVAVAAI PLFLTNGFPV DTVKDDIVRS LQHSIHSGEF VELAQAFR
|
| |