Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40634 |
Symbol | |
ID | 7198563 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 22062 |
End bp | 25263 |
Gene Length | 3202 bp |
Protein Length | 1038 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184717 |
Protein GI | 219129062 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0207387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGACG CTGTGTTCTT CTCTATCGTA GCTCTCGCCA AGGCGGGCAT TGAGAGCTGT CGAAACGCTC AAATATGCCA GGATGAGGCA GGCCGCATCG GCAAGCGTCT GACGATAGTG GTCGCGCGAG CGCATGAATG GGGAGCGGTT TGTGCAAGTG CGCGCCTGAT TCATTTCCAC GAAGTCGTGG AGAATGTCTT CTTGTGCTTA CAAGCCGTTA CATCGCCTAG AAGCAAGCGG TCCTCATGGA ACAAAATGTT CAAGTCTACG CTACAATCCC AAACTTTGCT CGACAAAATC CTCGAAGCAG AGAGTCAGTT GAATACTGCT ATCAATGATC TACAGATGGA GCAATCCAAT GCCATCTTTT CACAGCTGGT TGACGTCTCA AAAGGAGTTG CAGAATTGCT CGACCAGTTT GGCACTCTTG CAATGAGCAA ATCGAATCCT TCCGTGACAG TACAGCAGCA ATTTGATAAG GTCCTGGCGG ATGCACAGAC ACAAGCCCCC GAAGTCGCCG TTTCTATCCC CAGTGACCGA ATCCACCATC CCGTACAAGA GTACAACTCT CTCGACTGTG TGGGAGATGA GGTTGCCTTA TCTCCACAGC AACAAAAGAT AGCGTTTCGT CCATCCAAAG AGGATGTGCT TGCTATCTCG CTCAAGGCAT CATTGCTGGA GTTTAGCGAT GACCAAAAGA ATCTTCTGGG CGGTGGAGGA TTTGCGGAAG TCTTTCGAGG GACCTACAAC CACCGGCCAG TTGCGGTCAA GCGCCTCAAG GCGTACCATG GAGATGTAGC GTCTCTCTCT CTCTCTCAAA TCGCCCGTGA TGTGGAACGA CTCGCCGCCG AAGCCATTTT GACGCACAAG TGCAGCAAGC ACTCCAATAT TATCCACGTT ATTGGATGCA TCACCGTATT GAGTGAAGTC GAGAGACCTC TCATTGTCAT GGAGCTAATG CATACAACAT TATTTGATGC TCTCCATGAT CGAAACCAAA AGGATGCTAT GGGATTTTCT CGTCGGCTCT TTCTGTTAAA AGGTATTGCT GGAGCCTTAG AGTTTCTTCA TCTGCAAGGT ATTGTTCACC ATGATATTAA GTCTCTGAAT ATCTTGCTGA ACAAACAATT GACAATTGCC AAGTTGGCTG ACTTTGGAGA GTCGAAAGTA AAAGGCCTCC ACACCACGAA ACTCCGTCTC AGTACAATTT TGGCCACAAC AAGCCACCAG GGCAACCAGA TAGCAGGTAC AGCTGCATAT CAAGCACCAG AAATCCTCTC GGAAGAAGTG CTTGACATAT CACGCGTTTG TGAGATGTTT TCGTTTGGGG TGACAGTATG GGAGTGCATG ACAAGCAAGA TTCCACATGG AGGGAAGAAA GAATCATCCA TAGCACTTCT GGCAGCGACA AAGAAGCACT TGCCCATGCT CGTAGTCCCA TCCAAACCAA AGGATCTCCC AGAGATAGAG ATGGTGTCCT GGAAAGCGCT CAAAATGGTT GCCACATCAT GCCTCTCTCG TGATCGCTTG GTGAGACCTA CTGCTTCTGT GGTGGTGGCA CTTTGGCATA AAGTAAAGTC TCCAGGGAAT GTTGAACCAC TGTCTTTCTC TCTCCAAAAC CCACCTTCAG CAAAAAGTGG TGGCATTGGC CAGACTTGGC TGCCAACAAG TACTCAGGGA TCAACAAGTC AAGACATTGT CTTTGATACG AAAGGCTATG AAGACGAGTC CAAAGCAGGA TATACTACTT CCTCAAAGAA ACGTTGCTAC ATTAGACTGT CTGTCATTGC AAGTATTGTG GTGTTGCTAG GAGTCATAGT GCTATTGGCC GTTATCCTGG TGCCCAGAAG TTCTCCTGAT CCCTCGTCAC CGTTGCCGGC TCACCTGTCT TTTCAAACCA CACAGGAGTT GTATGATGCT GTTGATGCTT ACGTTGGTAC AACCAGCCCC GTAGACTCCA CCGCAGCGAC TGTGTATGGA TATCCTATTG GATCATGGGA TGTGTCACGA ATCTCCAACT TTTCTCAAGT TTTTGATGGA TCAGCTCGGA ACAGCGCCAT TGGAATGTTT GATGAAGATC TGAGTAACTG GGATGTTTCG GCCGCGTCAA CAATGCATTC GATGTTCAAT GGTGCTTATG CGTTCAATAG CAATCTGTCA GCTTGGAATG TGAGTCGGGT AGCAGACATG AGTTTCATGT TTTGGGGCGC ATCGGCGTTT AATGGGGATC TTTCATCATG GAGGGTTGAC CGGGTTGCAA GTATGGAGTC TATGTTTGAG GGTGCAAGCT CTTTCAATGG TGATCTTGCA TTGTGGAATG TGAGTCAGGT AACAGACATG AGTTTCATGT TTTGGGGCGC ATCCTCCTTT AATGGAGACC TTTCAACATG GAGGGTAGAT CAGGTTTCAA ATATGGAGTC AATGTTCTAC AATGCAAGTG CTTTCAACAG TGATCTTGCT ATGTGGAATG TAGGACAGGT AACAGACATG AGTTTCATGT TTTGGGGTGC ATCTTCCTTC AACGGGGATC TTTCATCATG GAGGGTAGAC AACGTTGCAA ATATGGAGTC TATGTTTTAC AATACAAAGA CTTTCAATAG CGATGTCTCA GCATGGAATG TAGATCAGGT GATCAACATG TCAAGTATGT TCCAGGCTGC ATCTGTCTTC AATGCTGACC TCTCATCCTG GAATGTAATG CGGGTTACAA ACATGAGAGC TATGTTTGAG GAGGCAGGTG CCTTCAATGG CGATGTCTCA ACATGGAATG TGGGCCAGGT AACAGACATG AGTTTTATGT TTTGGCATGC ATCCTCCTTT AATGGGAACC TATTTTCATG GAGAGTAGAT CAGGTTGCAA GTATGGAGTC TATGTTTCAG TTTGCAGCTG CCTTCAACGG CGACCTGTCA AGCTGGAATG TCAGCAAGGT AACTACCATG CAAGAAATGT TTAATGGCGC ATCTTCATTT GAGGGCAACC TTTGTCCCTG GCTGGCTTGG CTTCCTTTGG ATTGTAATGT TGATGGAATG TTCCTTGCTG CACAGTCCTG CACAGACACA GCAGACCCTA TACTACCAGA TGGGCCAATG TGCAATACCT GCGCAACGTA AGGTAATTAT ATTCAAATGC ATGAACCTAA GGGAGGTGAA TTTCCACAGA ATTATGACTA TACATGACTA TATAAAAATT AGTCCCAAGT GA
|
Protein sequence | MADAVFFSIV ALAKAGIESC RNAQICQDEA GRIGKRLTIV VARAHEWGAV CASARLIHFH EVVENVFLCL QAVTSPRSKR SSWNKMFKST LQSQTLLDKI LEAESQLNTA INDLQMEQSN AIFSQLVDVS KGVAELLDQF GTLAMSKSNP SVTVQQQFDK VLADAQTQAP EVAVSIPSDR IHHPVQEYNS LDCVGDEVAL SPQQQKIAFR PSKEDVLAIS LKASLLEFSD DQKNLLGGGG FAEVFRGTYN HRPVAVKRLK AYHGDVASLS LSQIARDVER LAAEAILTHK CSKHSNIIHV IGCITVLSEV ERPLIVMELM HTTLFDALHD RNQKDAMGFS RRLFLLKGIA GALEFLHLQG IVHHDIKSLN ILLNKQLTIA KLADFGESKV KGLHTTKLRL STILATTSHQ GNQIAGTAAY QAPEILSEEV LDISRVCEMF SFGVTVWECM TSKIPHGGKK ESSIALLAAT KKHLPMLVVP SKPKDLPEIE MVSWKALKMV ATSCLSRDRL VRPTASVVVA LWHKVKSPGN VEPLSFSLQN PPSAKSGGIG QTWLPTSTQG STSQDIVFDT KGYEDESKAG YTTSSKKRCY IRLSVIASIV VLLGVIVLLA VILVPRSSPD PSSPLPAHLS FQTTQELYDA VDAYVGTTSP VDSTAATVYG YPIGSWDVSR ISNFSQVFDG SARNSAIGMF DEDLSNWDVS AASTMHSMFN GAYAFNSNLS AWNVSRVADM SFMFWGASAF NGDLSSWRVD RVASMESMFE GASSFNGDLA LWNVSQVTDM SFMFWGASSF NGDLSTWRVD QVSNMESMFY NASAFNSDLA MWNVGQVTDM SFMFWGASSF NGDLSSWRVD NVANMESMFY NTKTFNSDVS AWNVDQVINM SSMFQAASVF NADLSSWNVM RVTNMRAMFE EAGAFNGDVS TWNVGQVTDM SFMFWHASSF NGNLFSWRVD QVASMESMFQ FAAAFNGDLS SWNVSKVTTM QEMFNGASSF EGNLCPWLAW LPLDCNVDGM FLAAQSCTDT ADPILPDGPM CNTCATPK
|
| |