Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48952 |
Symbol | |
ID | 7195369 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 127571 |
End bp | 130650 |
Gene Length | 3080 bp |
Protein Length | 791 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183682 |
Protein GI | 219126894 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTCAGTCCAA TCCGAACCAA GGCCCGCTTG TGTATTGCCG AGAGAAGAAG GGTCTCGATT GAGAAGAGCA CTTCGTGGTG CATTTTGTTA TCGCCTCGCT CCATCGGCCT TTCATTCGAG CTACTGGTTC CCCAGTATTC TACATGATAT ATTTTGTAGT TTTTTACGGT AGCTTGAATT CGATTGCAGA AGTATTTTTA CGTTGAGTAT ACGTATTGTA TTAGCTCTTT GTCGATAAAA TCCTTTTATT TCCCACAACA ACATGGAAAA ACCACCCCCG ACTGTCGCAG ATCTGTAAGT CTCTGTAGCG ATCGCAGGCG TCTTCGCTCT TGTCTCTCGT TTGGCCGATT GTCTCATATT CTCTCCATTG TTCGCCAATA TTTCTTGTCC TCCGCTACAC AATTTTGCTG CTGTGCAGCG TCTTGATCGG TGGCGGTCAC GCGCACGCGC ACGTTCTGAA AATGCTCGGC ATGAAAAGCA CCGCGCATTC CTCACGCAAC ACGGTATACG GATCACCCTC ATTGCCAAGG ACATACACAC CCCGTATTCC GGTATGCTTC CCGGCTTCGT GGCGGGACAC TACGATCACG ATCAGATTCA TCTGGATCTG AATCAGTTAT GTCACGTGGC CAATGCACGA CTCGTTCACG CGGCGGCCTG CAAAATTACC TACCGCAACG GTGGCGGTGG ACTCGTCTAC CTGAACGATG GCCGTCCCCC CATACGGTAC GATTGCGTTT CCATTGATAT CGGGAGTGCC CCGGCGTATG GTGACGTTGT GTGCCAGCCC GGTGTGGTAC CAGTCAAACC GATCGCCAAT TTTTGCACAG CTTACCAAGC ACTCGTCAAC CGTTGGGAAA ATGAAACACC ATCGTCCACA GCAGCAAATG ACCAGGACAC GACGTCAACC AACGGCACAA GGGCAGAACC GCCAAGCCCC TACGTCGTGG CGGTTGTTGG TGGTGGTGCG GGAGGGTTGG AACTCGCCTT GTCAGTTCAA TATCGGTTGC GAAGCATCAA CCCAAATGCT CCGTTACGCC TCTTGGTTGT CACGCGCGGG AAGACAATAC TGGAAGGACA CAATCATCGA GTCCAGGCCA AGTTCCAACG CATTATGCAG GAAAGAAATA TCGAAATTTA CTACCTCGCC ACCGTTGTAA AGGTGGCAGA AGACTCATCC ACAATGCGCA AGCGCCTTAT TCTTTCGCCA GAAGGCGCCG CTGTGCACGG CCGAGACTCA TTCGTCGTCG ATGCTTGTTT ATGGTGCGTC ACGGCGGGTG TCGCGCCGTG GCTGAAAGCT GACACGCCAT TCGCCACGAC GAAGCAAGGA TTTCTCCGTG TCCACGATAC GTACGAATCA ATTGAACATC CTGGAGTCTT CGCCGCCGGC GATTGCTGTC ACGTAGACAA GCATCCGAGA CCCAAAGGTA TGGCTAGTCT GTCTAGATGC AAATCTTAGC ACTCATGGAT CTAGTCATTC CTAACTATTT TCACATTGTC TTACATTCAA ATCCAGCTGG AGTATTTGCT GTACGGGCCG GACCATACTT GCTTGACAAC TTGCTGCGAT ATGTTTCTTC AAAGCCCCTT ATTTCGCACA AACCCCAAAG TCATTTTCTG GGTATTTTAG CGACCGGGAA CAAGTATGGT GTAGCGTCGA AATCATGGTG GTTAGCCACG GAAGGATCTT GGATATGGAC CTGGAAAGAT TACATTGATC GTACCTGGAT GGCCAAATAC AGTACTGATC TACCCGATCT TAAAGACATG ATGGCGAATC AAAAGCACTC AACTCAGCAA AGCGGCCAAC AAACGAATGC TTTTGTTGCT TCTAAAGGGG ACGAAGTACT GCAGGCCTTT ACATCAGATC CAATGCGGTG TGGAGGGTGC GGCGCCAAAG TTGGGGCCAC GATCGTTTCA CGCGTCTTGG CAGCTGTCTA CGAACGGCAA ATTGAACGAG CTAAATTATT GGGGCTGCCC CAGCCGTCAC GGATCGATCA CGACGATGCA GCGGTTCAGA TACTTCCAAA TAAAGCAGGT GGCGCCATTG TTCAAACTAT TGACTACTTT CGAGAAATCG TAAAGGATCC ATTTACCTTC GGGAAAATTG TGGCAGTCCA TGCTCTAAGC GATATCCACG CTATGGGGGC GACTCCCCAA ACAGCCATGA CACTGGCTGT TGCACCGTTT GCTGCTGACG AAGAGGTAAC AGAATCGACT CTATTGCATT TACTTAGTGG AGTCAGTGAT ATTTTGCAGG ATGAAAATGT GCAGCTTGTT GGTGGACACA CTTGTGAAGG ATTGGAGTTA GCATGCGGAT TGAGTGTCCA AGGATATACG GATAATCCGA AACTGCTCTT ACGTAAGCAA GGTGGTGCAA TAGGAGATAA AATCGTCTTG ACCAAGCCAA TAGGTACCGG GGCACTCTTT GCAGCCGACA TGAGGGCTCG TTGTAAAGGC TCATATATGT CGGAAGCCCT CGACAGCATG ATTCACAGCA ATTGCCATGC AAGCCAAGTT GCGATGCGAG CAAAGGGCAT CAGATCGTGC ACTGACGTCA CTGGTTTCGG TCTCATTGGT CATTTGCTCG AAATGCTCAT GGCAAACGAA ACGGTTAAGG AATTGGACAG CATCGGCGCG GTAGTAAACA TTGGCGATAT CGATTTTCTA CGCGGTGGGT TAGAGGCATC TGCGAATGGC ATTTTTTCGA CACTCCAATC ACAAAACGGG CGAAATCGAC GCGCTATTGT CAATCACACT GAAGCCGCTG AAAAGTATCC GGTCAAGTAC CCACTATTGT TCGACCCTCA GACTGCTGGC GGTCTAATGT TTTTCGTCGA TGCTCTGAGT GCTAGTGAAT TTCTAGCTGA ACTACGTGCG GCTGATGTGA ATGCCCATAT AGTGGGAGAG CTAGTTTCAT ATCCTGCAGA AAGTAATGCG GCCGCCGGTT TCTCCGAGAG TGTTTGCACG ATTGGAAGTG GCGGAGCAGT GACGGGCAAA AGGATCCGTG TGCGGTAACA GCGGACGCCG GACCAATTTT TGTTCCGCGA TTCCACTGTG TTAAGAAAGC AGTCTCTACT
|
Protein sequence | MLPGFVAGHY DHDQIHLDLN QLCHVANARL VHAAACKITY RNGGGGLVYL NDGRPPIRYD CVSIDIGSAP AYGDVVCQPG VVPVKPIANF CTAYQALVNR WENETPSSTA ANDQDTTSTN GTRAEPPSPY VVAVVGGGAG GLELALSVQY RLRSINPNAP LRLLVVTRGK TILEGHNHRV QAKFQRIMQE RNIEIYYLAT VVKVAEDSST MRKRLILSPE GAAVHGRDSF VVDACLWCVT AGVAPWLKAD TPFATTKQGF LRVHDTYESI EHPGVFAAGD CCHVDKHPRP KAGVFAVRAG PYLLDNLLRY VSSKPLISHK PQSHFLGILA TGNKYGVASK SWWLATEGSW IWTWKDYIDR TWMAKYSTDL PDLKDMMANQ KHSTQQSGQQ TNAFVASKGD EVLQAFTSDP MRCGGCGAKV GATIVSRVLA AVYERQIERA KLLGLPQPSR IDHDDAAVQI LPNKAGGAIV QTIDYFREIV KDPFTFGKIV AVHALSDIHA MGATPQTAMT LAVAPFAADE EVTESTLLHL LSGVSDILQD ENVQLVGGHT CEGLELACGL SVQGYTDNPK LLLRKQGGAI GDKIVLTKPI GTGALFAADM RARCKGSYMS EALDSMIHSN CHASQVAMRA KGIRSCTDVT GFGLIGHLLE MLMANETVKE LDSIGAVVNI GDIDFLRGGL EASANGIFST LQSQNGRNRR AIVNHTEAAE KYPVKYPLLF DPQTAGGLMF FVDALSASEF LAELRAADVN AHIVGELVSY PAESNAAAGF SESVCTIGSG GAVTGKRIRV R
|
| |