Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41055 |
Symbol | |
ID | 7198860 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 248370 |
End bp | 249803 |
Gene Length | 1434 bp |
Protein Length | 456 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185073 |
Protein GI | 219129810 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0669426 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGCGG AGAAAGTTGA GAGCGGGCGA TTCTATCAAG CCGTGTCTCT TTCGCCAAAC TACGGTGGTG TCTTTTTTTC AAGTCAGGAA GAGCAAGGCT CTGATGGCGA TCCGTTCCGT TCAAAAAGTA CTACAATCGA TAGCGAAGAA ACACCTCTGT GCCAATCACA GATTCAGTCG AGGAGAGCCG GCACGAATAT TTTCAGATTG TTTGATTGGA ACGACGCGTG GGAGGCGATT CAAAAAAAAC GTGAAGCGAG CATTCTCGTC ACGTCGACGA CATCGACCAG CCACAAAAAG CGACGACGAC ATCAATGTAT GGCACTCTTC TTTGGCCAGG CGATTGCCCT TTGTGCTTCC AGTATGAACG CGTCATCTTA CACACTCAAC TATAAGTACG GAGTTCGAAC CTTTTGTTTC CAGATGATAT GGGTCTATCT CATTTTAGCC ATGCACTTGT TTGCGATACC GAAAGACGAT TCAGCAGTAG CCACAACAAT ACTGGTCAAC AGTCCTACCG TATCCATACA AAGACAGCAG TATACACTTC CCGGCACAGC AATCCGTCTA CAGATCCCTT GGTGGATTTA CCTCGGCATG TCTTTGTTGG ATGTACTGCC CAACTTTTTG ACACTGTTAT CTTTCAACTT TACTTCATTG ACCAGCACTA CCTTGCTAGG TTCGCTCACG GTCCCGTCAA CAATGTTTTT TTCTAGACAC ATTCTCGCAA AAGTCTTCCG GCCACACCAT GTCTTTGGTG TCATGCTCTG CATTTTTGGT GGGTGCCTGA CTGTTTGGTC TGATTTGGGC GATGTCAGTA GCGCTAGCAA TCCTATGGAT GGTGATGACC CACAACTGCA ACACCCCGAG TCATCACGTT TTTATCTGGG CGATTTGCTT GCCGTCACGG CGGCTTTGGC GTATGGGTTA GGAGACACCG TAGCAGAGTA CTCCATCAAA CACATTGACC GTAACGAATA CCTCGGGATG ATTGGCGTCT TTGGTTGTGT GTTGACAACC ATAGCTTTCT TGGCACGTGA ATGGTCTGAA GTGGAAAAGG TTACGACCTT GACTGTCGAA ATTCAAGTAC AAGTGTTGGG TGTGCTGGTG TGGTACGTCA CGTCAGTTGT ACTGTACTAT ATTGCCGAGG CCCGTTTTTT GGTATCATCG GATGCTACCT TGTTAAATTT GTCGATGCAA ACCACCAATC TCTATGCCAT TATTTTTTCA ATCATGGCCT ACGGAGAGGA ACCGTTTACA TTGTTCTACG TGGCTGTTGG GTTGGTAGTG GCCGGTGTCT TTGTGTACGA AGTCGGTGGG AGCCTCAGTT CCGGCGAGGA CGGGATCCAT GTAAGTAGAG CGATACAATT TCCACCAAGC CGCACCACCA CTAAAATGAA CTTTGAATGC TAGGTGCTAG TGTCTTCGAT ATGA
|
Protein sequence | MAAEKVESGR FYQAVSLSPN YGGVFFSSQE EQGSDGDPFR SKSTTIDSEE TPLCQSQIQS RRAGTNIFRL FDWNDAWEAI QKKREASILV TSTTSTSHKK RRRHQCMALF FGQAIALCAS SMNASSYTLN YKYGVRTFCF QMIWVYLILA MHLFAIPKDD SAVATTILVN SPTVSIQRQQ YTLPGTAIRL QIPWWIYLGM SLLDVLPNFL TLLSFNFTSL TSTTLLGSLT VPSTMFFSRH ILAKVFRPHH VFGVMLCIFG GCLTVWSDLG DVSSASNPMD GDDPQLQHPE SSRFYLGDLL AVTAALAYGL GDTVAEYSIK HIDRNEYLGM IGVFGCVLTT IAFLAREWSE VEKVTTLTVE IQVQVLGVLV WYVTSVVLYY IAEARFLVSS DATLLNLSMQ TTNLYAIIFS IMAYGEEPFT LFYVAVGLVV AGVFVYEVGG SLSSGEDGIH VLVSSI
|
| |