Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47538 |
Symbol | |
ID | 7202762 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 49938 |
End bp | 55136 |
Gene Length | 5199 bp |
Protein Length | 731 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181834 |
Protein GI | 219123027 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTGTA CGTATAGCTT GATGCTATTG ATAGTTTGCA GATAGCTTGG TACGTGAAGA TATGGTCCTA CCGGAAGTAG AAACCACACC GCTGCTGCAG ATTCGTTTTC TACCCCAAGC GTCGGACGAG CTAATTCACC TGGTCCAATC TCGTTTAGCG GAAAACGGCA TTGTCGTCTT GTCGAGTCAC AAACTCATGG GAAAATCCAC GATTTTGAGA ATAACGGCCA AGATGGAAAC GTTGGAGATA CAGGCTGAAA AGATTCACCT GATGAAGGAA ACGGTTGGCA GTCGTCGCCG GGTCGTCGAT TACTTTCGCC GAGAGCACCG CAGTCGGTTT TGCGATCTCA CCAAACCGCC CCAACGTGAT GCACAAGGTT TGTTTACGGC TGCTGAGTAT GCACTGTTAA TAAGGCATTT GTTGGATCGC GTTCATGTAT TGAAAACAGG GCAAGTTTCA TCTCCCTTGT CCCAATTGTT TGACACTAAT TATCGCGTAA AATATTTGGT GGACTTTGAC GACTCAAAAG AGGCTTCGAG AGGGAGCTTG GCATTTTTAA CGGCTTCGCT CCGCCGCAAA CTTCACGAAC ATGGAATTCA GAGTGCCTGT CTTATGCATG TTCTGATCAC GTACAATTTG GTCGATGCGG TTGTTCCAGT GCCGGTTCCT GCCGTAAATC GTGAAATTTT TCGAAAAACG TGGTGGCCCT GGTCTCGTTT GGATCTACCT ATTGAATTAA TCCAAGACTA CTACGGTTGG GAAATTGGTT TCTATTTTGC CTGGATGGAA TTTTTGACTC GCTGGTTATT CTTCCCTGGT ATCCTCGGTC TGCTGGTATA TATCATTCGC TGTTACCGGG GAGATACCAT TGACACGGAT GAATACACTC CTTTTTACGG TCTCGTCACC TTTCTCTGGG CCGTATTGTT TTTAAGATTT TGGGAACGCC ACGAACACAG GCTCGCCTAT CAGTGGGGTA CTTTCTCGCT GTCGCAATAC GAGCGTCAGA AGTTCTTTGC CGTCCGGCCG GAGTTTCGGG GCTACCTACG GAAGTCCCCC GTAACGGGGG AAGTTGAGAC ATATTATGAG CCGTTACAAA GAAGAATCAA GTATATTGGA AGTGCGCTTG TAACTTCGGT TATGCTGGCG GTTGCCTTTT CTGTCATGAT ATTGTCTCTC AATTTACAAG GCTACATTCG CCCGAAGTCA AACCCCACTC GTTGGACCAA AAACAGTCCA CATCCTTTCT TCATTGCTGA TTTGGCATTC GTGTCCGAAC CGGGCCAAGT CTTCGACGCC TTGTCTCTAC GCGGCTATAT TCCGGTAGTC GGGCACGTGA TATGTATTTT CTCTTTGAAC TTGCTGTACC GTCGAATAGC CGAACGATTG ACAAGCTGGG AAAACCACGA AACGGAGAGT AGTCACCGTA ACTCACTCAT TCTCAAGCGA TTCCTGTTTG AGGCCTTCGA CTGCTACGTC GCACTCTTCT ATTTGGCGTT CTATGAGCGG GACGTTGAGC GCCTCCGTCT AGAGCTGATT GCTGTTTTCC AGATTGATAC AATCCGGCGC GTTTTGCTAG AGTGCGTTAT CCCCATACTG ATCCAGCGAT TCAATGCGGC GCATCACTTG AAACGAAAGC TGAATCCTAT GCAGTCCCTC CTGGTGATTC CCACCCACGA CATATTAATG GACGAACTCG ATAAGGATAC CTACGATCAG TTTGATGATT ATATGGAGAT TGTAATTCAG CTCGGATACG TCACCTTGTT TGCGTCAGCC TATCCGTTGG CATCCTTAAT TAGCATCGCA GCTAATTGGG TGGAGATTCG TTCCGATTGT TTCAAGCTAA CCCAGGGTTG TCAACGACCA GCTGTCTTTC GATCTTCTGG TTCGGGTATG TGGAAAACCT TGGCATCTTG TATCATTTGG ACGAGCGCCT TGACGAATTG TCTCATCGCG GGTTTCACAT CTGACCAGTT GGTGCACTAC CTGCCTTCAT TTTACGTTCA TGTGCAGGAG GGCTATACAG ATATGGGTCA CGAAAAGGGT TGGTTGCTTG TGTTTTTGAT TTTTGGACTC GAGCGGATCC TGGTTTTAAC CGGCTTGCTC GTGTATGCGA TTGTGCCAGC CGTACCAGAA GATGTAGTCG ATGAGCTAGA GCGGCGTCAG TACATTCGAT CGCAGCAGGA AGCGTGGGAG CATTCACCCG AGAACAAAAA GAACGACTAA GAAATTTGAA GTAATCTTTT TATTCTCTTA AACAACGAGA TCGATTGGCT GTGAATTGCA CGTTTGATAA TATATTAGAG CGGGCGGAGG TGTCTAAAGT TTACCAGGAA TCCGTATCGC CGTAGTTGAT GACGATGCAG TGTCTTCTGC AGTCGTTGAT ACTGCTCCTA CCGCAGGTTG TTGTACTAGC CCGTACAAGA GCACGGAACC AGCTTCTCCC CGTTCAATAG CAATGGATCC GACAGCCAAC GTGAAATTAC TGGCCATGGC AATTGCGACA CCGAAATCGT CACCAGGCCT TTCTCCGGTC CAAGATACAG AATGCACTTG CCATAATGAA GAAACATCAG TGGTGTTTAT TGTGCTGGTG CTGGGTGGGT ATCGGTACAA GCGAACCTCC CCAACTTTGC TGTTGGCACC GGGAACTCCC ACTGCCAACA AACCTTGACT TAAAGACACA GCAGAACCAG CTTCATCGTT GGCCGTGGGC GTTTCCTGGA CAATAGGATC ACCTAAAGCG TTCCAACCAT TGGAGGCTTG GTTATACTCG TACACTACCG TCATGCCAGC ATTACGTTGA CCGTTGACGG TCTTCCACGG AATGCCCACG GCAACACGGA ACGTCGCTGT ATCTCCGCCG ATGTCTCCGA CGGTGGTAAG GGCGATGGAA GCGCCGTAGC GATTGTCGGA AGAAGAGGGT ACAAAATCGC TGTTGACCAT GTCGAGGGGG CCGCCAAGTA CTGTCCAGGT TTGCGTGGAG GTGTCCCATT GCCAGGCCCG GACATAGCCA GTGCTACCGG TGTTCCGAGG GGCACTAGCA ATCACTACGC TTCCATCCGG CGAAAGATCA ACAGCAGAGC CCAACCAATC CAGATCTCCT GTCCCGGTCA AAGGCGCACC GCTTCCCATC ATTTGCCATA CATTGCCGGA CGGGCTGTAT TCGTACACCC CGACGTGTCC GGCGAACCGA AATTCCGGTG TAGAAAAGTA AGGTGCACCA ATTGCCACAC GAAGACCGTT TTGCGACAAA GCTACACCCA CACCAGCGTA AGACATGGAA GAGAGTCCTA TGATGCGGTT TCCCCTTCGC GCGTACTGGC CCGTCGCTTC GTTATAAAGG TATATTTGGA TACCGCCTGC TTGCTCGCCA CCCGAGTCGT CGTTCGGTTC CGAAACAGCC AAAATGGATC CATCCGCATT GAGCGCCACG GCTGAACCCA GGGCATCCTG GGCCGCGTCT CCCCGAAGAG CGCTACCTCT TGGTATCCAC TGGTCCTGAA CTCGTTGGTA TACCTGTACA AATCCAGCAC GAGCCAACAC GCCGGCCGCA CTGTCACTGT TGGCCACGTC CGGAGCGCCA ACAGCCACAA CTTTACCGTC ACGCGACAAA GCCAATGCTT GTCCAAACAT ACCGTTCGGT GCCGGACCCG TCAGTCTACC CAAGGGTGAC AATTGCAAAT TTTCTAAAGC TTGCGTAGGC GTTGCGGTGG GAGCAGACGT CGTGGAAAGC GCGGAGGGTG CGATGGAGGG CTGTGCCGTA GACGAGACAG TCGTGGGACG AACAGTCGGC CTGTTGGTTG GGGCGCCAGT ATCCGACGCT GCTGTAGGAG TATCGTCTGT ACCGGCAGCA ACGTCGATCA CAGAAGGGGC TTGGGTCCCT CGCACGTACC GAGTTGGCGC TGCGGTGGTG CCAGCCGCCA ACGCAATCGC TGGAGACGCC GTGGGTTCCG TTGTCTGTAT ATTGGACCAA GAGTCGCCAC TAAGATTCGT TTGCGACGCA TCATCTCCCG GGTCATTACT GTTTCTGGTG AATCCCGTCG TGACCGAAAC AATCATGGTC GCGATGGCCA GGAGGAGTAT CAAGAGCAAG GCGTACGCTA ACTTTCCTTG TTTAGAGCGA AGGTTGAAGA AACCGTAACA GCACGCGTAT GAGTTGGTGC TCTGTCGGTG CGACGTCCCG AAGCCTTCGG CCGCGCTGTC GTCGCTCCCG CGGGCAGTGG GGCTAACTTC GTGAATTGAG CCCAATGCAC CGAAATTGTA GTTGGAAGTG GCCTCGTCGC TGTCGTAAAT GGTAGCGTGA TAGGGTGGAT GTGGGTGCGA TCGCATTTTG CGACTATTGT TGCTAATGAC GGGAGTATGC TCGGAAGCGG CGTTGGAAAC CGTGGAAGCC CCATCGTCTA GAGTCGGTAA AGTGTCGTTG TCGTCGTCGG TGTCATCGCC ATTGCAGGGT GCCGATTCGA CTTCGAAAGA CAGTTTGTGC GTATCGGGTA TGACGGCTTG GGCCGCCGCA TCCCCGTAGG CATCGACATC CACGGGTCCC GTGATGCCAC CATCCGCCTC AGAGCTGTTG AGGACAGCGT TAATGATAAA CTGCTCGTGT GGACTGGTGC CAACGTCGGC GGTATGGTCG TACTCGGGAC TGTGTATACC GACATCTCGT TCGTAGGGTA TCCGCTCGCG ACCTTCCTGC CCGTGACGCT GTTGTTGCTG ACGATAGCGA GCCAGCCACC AAGCGGAGGC GATTCGTTCG TGTTGTTGCG ATTGCCGCGG TGACGTGGAG AAGTGTGCTG TTGGTATCGG TCATGCTGCG GATGCGGCTA GTCGGCGAAG CGTTACCCGT GTCCGCGGAC GACTCCACGC TGGTGTTTCC GTGCGGAGCG TGTCGGTCAC CATCCCTTTC GTCGTCGTCG TCTCCGCTAC TGCCGCTCTC CACGCGCTGT CGGTAGAGAT CTCCGGGATG ACCTCCGGTG CGGAAGAAGG GATTGGACGG TGTAAAGGAG GAAGTGGTGG TGTTGGCGGC GGCGGCGGTT GTTGGAGGCG TCGCGGTAGC CAGAGGAAGA GGTGCAGGAG TCGTATCCGC GGTCGCGGCA ACTCCGCGAT CTTCCTGTGG ACGGGCACGG CGCGAACGCA GGATCGAGGG ACGGCGAGGT GGTGCCATGG TGCCAAAATG TTGCAATGGG ATACAGCAA
|
Protein sequence | MDYSLVREDM VLPEVETTPL LQIRFLPQAS DELIHLVQSR LAENGIVVLS SHKLMGKSTI LRITAKMETL EIQAEKIHLM KETVGSRRRV VDYFRREHRS RFCDLTKPPQ RDAQGLFTAA EYALLIRHLL DRVHVLKTGQ VSSPLSQLFD TNYRVKYLVD FDDSKEASRG SLAFLTASLR RKLHEHGIQS ACLMHVLITY NLVDAVVPVP VPAVNREIFR KTWWPWSRLD LPIELIQDYY GWEIGFYFAW MEFLTRWLFF PGILGLLVYI IRCYRGDTID TDEYTPFYGL VTFLWAVLFL RFWERHEHRL AYQWGTFSLS QYERQKFFAV RPEFRGYLRK SPVTGEVETY YEPLQRRIKY IGSALVTSVM LAVAFSVMIL SLNLQGYIRP KSNPTRWTKN SPHPFFIADL AFVSEPGQVF DALSLRGYIP VVGHVICIFS LNLLYRRIAE RLTSWENHET ESSHRNSLIL KRFLFEAFDC YVALFYLAFY ERDVERLRLE LIAVFQIDTI RRVLLECVIP ILIQRFNAAH HLKRKLNPMQ SLLVIPTHDI LMDELDKDTY DQFDDYMEIV IQLGYVTLFA SAYPLASLIS IAANWVEIRS DCFKLTQGCQ RPAVFRSSGS GMWKTLASCI IWTSALTNCL IAGFTSDQLV HYLPSFYVHV QEGYTDMGHE KGWLLVFLIF GLERILVLTG LLVYAIVPAV PEDVVDELER RQYIRSQQEA WEHSPENKKN D
|
| |