Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_769 |
Symbol | |
ID | 7202840 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 382519 |
End bp | 384318 |
Gene Length | 1800 bp |
Protein Length | 550 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181897 |
Protein GI | 219123158 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.945125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATCTCGGCG GAATGGATGA AGTTGTGAAA AATATCCGGC AGCTGGTTGA ATATCCATTG ATACGACCCG AGTTGTACAG CCACTTGGGC GTCGACCCAC CCCGTGGTGT TCTGCTGCGA GGGCCACCAG GAACCGGGAA GACTCACTTG GCAAACGCAG GTACGGTCAC CATCCTCTAC CTTGTTGCCT TTGATACAAT TTTTCTCACC TCTTCAACTG GTAATTTTCT GTTAGTCGCT GGTCAGCTCG GGGTTCCTTT TTTCCGCGTC TCAGCACCTG AATTGGTATC AGGCATGTCG GGAGAGTCGG AAGGACGCAT TCGCGATCTC TTCAGAACAG CGTCTTCTAT GGCTCCGGCC ATAATATTCC TGGATGAATT AGATGCAATC GTGCCAAAAA GGAGCGAAGC AGGATCCTCT CGAGGGATGG AGAAGCGCAT GGTTGCACAA CTTCTTACGA GCATGGACAT GCTGGCGCCC GTCAATAACA ACAAAAATTC GACTGTTATT GTGCTCGCGG CAACAAACCG GGCAGACGCC ATGGATCCGG CTCTACGACG AGCCGGAAGA TTCGATAAAG AGATCTCATT GGGGGTTCCC GACGAACAAG GCCGTGAGCG TATTTTAAGA GCAATGACGA AAGGAATGCG TCTGTCGGGG GATTTTGATT TCAAAGTGCT AGCACGAAAA ACGCCAGGGT TTGTTGGGGC TGACGTACGG AGTTTAGCTA AAGAAGCAGC CGTTCTCGCC ATCAATCGAA TTTTTAAAGA TGTTCTTAAA GATCAAGATT CCGTCTCAGA CGAGCTTGTT ACTGCGTTCG GTGAAGTAGA CAACTCGGAC ACCAAAGACC AAGCCATTGT GAGTCCTATG ACAGCCGAGC AGATGGAGCC ATTGTTTGTC ACCATGGACG ACTTTTTATG CGCAATTCCA ATGGTACAGC CCTCAAGTAA GCGAGAGGGT TTTGCGACCG TTCCAGATGT CACCTGGGAT GATATTGGTG CCCTACATTG TATCCGTGAG GAGCTAACCA TGTCTGTCTT GGAGCCCATC CGCAATCCGG AGAAATTTCA AGCCCTTGGT CTCCCCTTAC CAGCTGGTGT AATGCTGTAC GGCCCGCCCG GTTGTGGAAA AACGTTGCTG GCTAAGGCGA TCGCCCACGA AAGCGGCGCC AATTTTATCA GTGTGAAAGG ACCAGAATTA TTGGACAAGT ACGTCGGTGA AAGCGAGAAG GCCGTTCGGC TAGTCTTTGA ACGGGCCCGA AGTTCTAGTC CTTGCATTGT CTTTTTCGAC GAATTGGATT CATTGGTTCC ACGACGCGGG AGCGACGCTG GTGGTGGAGG TGTTACAGAG CGTGTGGTCA ATCAACTTTT GACGGAAATG GACGGTTTGG AAAGTCGTCG AAGTGTATTT GTAATTGCCG CTACGAACCG TCCGGAATTG ATTGACCCAG CAATGATGCG ACCTGGACGA CTCGACAAGC TCCTTTTTGT TCCACTTCCC GGTCCCGAGG ATCGCGTACT GATTCTCAAG GCCCTTTGTA CGGGCATCAA CTTGGCGGCG GACGTTGATA TGGACCATAT CGGTCGCAGC CCACGGACCG ACGGCTATAG TGGTGCAGAC TGTGCGGCGC TTCTACGCGA AGCTGGTTTA GCGGTTCTGA AAGAAGACGC AACCGCCTTT GCCGCGGGCA AACCTGACTC GGTGGAGCTC AAAATCACGT CGAAGCATTT CGATGCAGCC TTCCACTCGG TCATGCCCTC GGTGTCGAAG AACGACCAAG CACGGTACGA ACGCATTCGC
|
Protein sequence | DLGGMDEVVK NIRQLVEYPL IRPELYSHLG VDPPRGVLLR GPPGTGKTHL ANAVAGQLGV PFFRVSAPEL VSGMSGESEG RIRDLFRTAS SMAPAIIFLD ELDAIVPKRS EAGSSRGMEK RMVAQLLTSM DMLAPVNNNK NSTVIVLAAT NRADAMDPAL RRAGRFDKEI SLGVPDEQGR ERILRAMTKG MRLSGDFDFK VLARKTPGFV GADVRSLAKE AAVLAINRIF KDVLKDQDSV SDELMEPLFV TMDDFLCAIP MVQPSSKREG FATVPDVTWD DIGALHCIRE ELTMSVLEPI RNPEKFQALG LPLPAGVMLY GPPGCGKTLL AKAIAHESGA NFISVKGPEL LDKYVGESEK AVRLVFERAR SSSPCIVFFD ELDSLVPRRG SDAGGGGVTE RVVNQLLTEM DGLESRRSVF VIAATNRPEL IDPAMMRPGR LDKLLFVPLP GPEDRVLILK ALCTGINLAA DVDMDHIGRS PRTDGYSGAD CAALLREAGL AVLKEDATAF AAGKPDSVEL KITSKHFDAA FHSVMPSVSK NDQARYERIR
|
| |