Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40573 |
Symbol | |
ID | 7198364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 307914 |
End bp | 309941 |
Gene Length | 2028 bp |
Protein Length | 675 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184599 |
Protein GI | 219128814 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.338329 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCAA CCCCGTCCAC ATCCACGACG ACGTCCACGC CATCATCGCG CTCCAAGGGT CTCCCCAAGT CACCCAAGCG GCGTCTACAA CAACCTCCAC TCATTACTAC ATCCCACTTT CTTTTGGCTC ACGTCTTTTG CATCGCACTC TATCTTGTAC CCATTCTTAC TACCAGCAAT GCCAATACTC ACACCAACAC CAAGTACGGT ATCAACCCAC CGCATACGCT GGCACCCGTT CTCGATGAAA TACATATCGT CTCCCCGGAC AATGCCGACG TCAACGACCC CAACGCTACT CTACGGAACA TTTTCACCAA CGATTATTGG GGTCGTCCCA TGCAAGCACC CAACAGTCAC AAATCCTGGC GACCCTTGTC CATTCTCTCC TTCCGCTACC TCCAAGGTGG ACACGTCGAT CAGTGCCAGT GGTGGTGGTG GTTACGGTAC TGCACCGGGT TCTCGTTGCC GCCTCTCCTG GCGCATCGAC TCGTCAACGT TGTCACCCAC GCCTGTCTCG CCGAAATGGT CGGGATCCTC GCGGCGCAAC TCGTACCCTC GCCGGACGCA CACTTTCGTC GTCTTTTGCG ACTCGTGGCC AAAATCGCCT TTGGCCTACA CCCGACTCAC GTGGAAGTCA CGGCCAACGC TGCCAATCGA CCCCACCTAC TCGCACTCCT CGCCAGTTTG GCCGCACTCG ACGCCGGTAG CAGTGTTGCC TCTAGCACAA CTGTGTGGCC CTGGCTACGC ACTCTTCTTT TCCTCGTGGC CGGATTCTTG TCCTGCGAAA CCTTTCTCTT CCAAACCGTA CCCATCGTCG TCACCTACAC CGTGCTGGTC TACGTGCAGC TATACCACAA CGCTCCCACG TCCGGGTCGA GTCGTCGCGT CTACCGTCAA CGCAACGGGT GGTGGTACCG GCAACTATGG AGTCTCGTAC CGATCGTGAG ACTCCGCGTC GCTCTAGTTG TGGCCAGTGG AATCCTCTAC TACACGGCCC GATCCGCCCT CGACACCTTG TCCATTCCCG ACGGACTCAT CCGACCGGCC GAAAATCCTT TTTTTGCCCT CCAAGGTTGG CATCGCGTCC GCAACTACCT CTACATTGTT GCCGTACACG TCGCCAAGGC TTGGGATTTG GACGTGCTCG GATTCTCACA CGAGTACGGG TACAATTGCG TGCCGGAAAT TAACGAATGG ACCGATCGAC GCTTGCTGCT ACCACTCACA ATTGCCGTAC TCTACCTGGC CACGGCCGTC TTCTTTCTCT TGCAACACGC CCGTCGTCGC CAAGTCTGGT CGATTCCCTT CCTCCTCTTT GTCGTGCACG TTTCCTGGAT GGTCACGCTC TTTCCCGTCG CCGGGATTGT CAAAGTCGGC ACCTTTGTGG CGGACCGCAT CGTGGTGGCG AGCTCCGTCT CGACCAGTAT CGTGCTCGCC TACGTGGCGA CCCGCTGGAT GACGGCGCCC CGGTCCCGCA CGGCCGTCAC ACGCCGCGTG ACCCTCCTCG CCCTCACGGT CGGTGTCTTT CAAACCCACC GTGTTTACGT CCGTACCACG CAATGGATGG ATTCCTACCC TCTCCTCACG TCTAGTCTCG TTACCTGTCC ACGGTTCGCC AAGGGACATT TGGAACTGTC CAAAATATAT TCCGGACTCT ATCCGGAACG CTTCAATTTA ACCACGGCAC GGTGGCACTT GGCGCGGGTC GAAGATATTG ATCCCACTTT TTGTGACGTG CACCAGCAAG TAGCCCACGT GGCCATTCAG GAACGGCGGT ACGAAGAATT CGAAGAGCGC CTGGTCCAAG CCTTGCTCTG TCCGTTTACC CTGGGCGGTG CCACGGATCT GTGGCAACGA TACTGGAAAA TTACGTTAAA TTCGCAACAG AATCCGTCCG ACGTTGTTGC CGCGGCCGAA CAACGCTACC AGACCTACAT GAAACGCATC CAGGTGGCCA TTCAACAAGA ACAAGAAAAC GAGCCGGTCC CCGTATCTAC CTCACCAATC GTGGGATGGC AAAAATGA
|
Protein sequence | MSSTPSTSTT TSTPSSRSKG LPKSPKRRLQ QPPLITTSHF LLAHVFCIAL YLVPILTTSN ANTHTNTKYG INPPHTLAPV LDEIHIVSPD NADVNDPNAT LRNIFTNDYW GRPMQAPNSH KSWRPLSILS FRYLQGGHVD QCQWWWWLRY CTGFSLPPLL AHRLVNVVTH ACLAEMVGIL AAQLVPSPDA HFRRLLRLVA KIAFGLHPTH VEVTANAANR PHLLALLASL AALDAGSSVA SSTTVWPWLR TLLFLVAGFL SCETFLFQTV PIVVTYTVLV YVQLYHNAPT SGSSRRVYRQ RNGWWYRQLW SLVPIVRLRV ALVVASGILY YTARSALDTL SIPDGLIRPA ENPFFALQGW HRVRNYLYIV AVHVAKAWDL DVLGFSHEYG YNCVPEINEW TDRRLLLPLT IAVLYLATAV FFLLQHARRR QVWSIPFLLF VVHVSWMVTL FPVAGIVKVG TFVADRIVVA SSVSTSIVLA YVATRWMTAP RSRTAVTRRV TLLALTVGVF QTHRVYVRTT QWMDSYPLLT SSLVTCPRFA KGHLELSKIY SGLYPERFNL TTARWHLARV EDIDPTFCDV HQQVAHVAIQ ERRYEEFEER LVQALLCPFT LGGATDLWQR YWKITLNSQQ NPSDVVAAAE QRYQTYMKRI QVAIQQEQEN EPVPVSTSPI VGWQK
|
| |