Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_19329 |
Symbol | |
ID | 7199762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 305757 |
End bp | 309958 |
Gene Length | 4202 bp |
Protein Length | 597 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178974 |
Protein GI | 219116358 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0319191 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACTTTTTGAT ACTGCATCGA TCTGCATCCA CTTCTCAAGT GCTGTCTCAA GAAAGCGATG CAGCGTTCTA TGCTTTATCG TACCTTCGTG CCGCCCGTGT TCCGTCAACG GACGTCTCCA ACACCGGCCT GTGCGTCTTT TCGTCGTCCC GGCTTCTTCT CCACCGCGAC TGGCTTTTTG GAAAGAAATA TTACGTACGA TTCACCCAAA ACTTTCAATA TCCTTCCCGT CTTGAAGGGA TGTGACCTTG ATACTCGTAG TGATGTCGTT CGATGCGCCT TGTCCCGCAC GCAACACGAC GTTGAGCTTT TGAACGAACG TCTCGCTCAC GTCCGTCTCG GTGGGGGGCC ACAGGCCGTG CAGCGTCACG TCTCTCGTCA GAAACTCTTA CCAAGGGATC GAATCGAATG TATTATCGAC CCCGGGACCC CGTTGCTCGA ATTGTCCGCA CTTGCCGGCT TCCAGGACGA CATCCCATCC GGTGGTATTG TAACAGCGAT AGGGATCGTC GCTTCACAGC CCGTCATGTT TATCGCCAAC GATGCGACCG TCAAAGGTGG CACCTACTTT CCCATAACTG TCAAGAAACA TCTGAGGGCT CAAGAGATTG CTTTGCAGAA CGACCTACCC TGTATCTATT TGGTCGATTC TGGTGGAGCC TATTTGCCCC AGCAAGACCA AGTATTTCCC GATCGCGATC ATTTTGGGCG CATCTTTTAT CACCAAGCGA CCTTGTCGAG CCGAAATATT CCACAAATTG CTGTCGTCTG TGGATCATGT ACTGCCGGTG GGGCCTACGT CCCTAGTATG TCCGATGAAA CAATTATTGT GCAAGGCAAT GGCACAATCT TTCTTGGTGG ACCACCGTTG GTACAAGCAG CGACGGGAGA AGTCGTATCG GCTGAGGAAT TAGGTGGCGC TCGAGTACAT ACCACCGTCT CGGGGGTGGC CGACCATTTG GCCGTAAACG AGCTCGATGC TATGCGATTG GCGCGCGACG TCGTAGCAAG TTTGGGACGG GCACCTTCCG GTATTACAAG CACCAAGACA ACTTCGGAAC ACTTGCCACA GCACGAAGAA CCCTTGTACG ACCCCAAGGA ACTCGGCGGG ATCGTTCCCG TTGATTCCAA ACAACCCATG GACGTCCGTC TGATTCTGGC GCGTATCTTG GACGGTAGTC GATTTCACGA ATTCAAAAAG AACTACGGGA CGACCTTGGT GACCGGCTTT GGAAAACTTA TGGGTCAAGA CGTCGGTGTG ATCGCCAACA ACGGAATTCT TTTCCCGGAA AGTGCCATGA AAGGTGCACA CTTTGTTGAG CTGTGTTGCC AACGCGGGAT TCCTATTTTA TTTATCCAAA ATATTACAGG CTTTATGATC GGGCAAAAAT ACGAACACGA AGGTATCGCG AAACACGGGG CTAAACTGGT GACGGCTGTG GCGACGGCAG CCGTCCCCAA GATAACGCTA GTGGTTGGTG GCTCGTACGG CGCCGGAAAT TACGGTATGT GTGGGCGTGC CTTCAGTCCT CGGTTTCTCT ATATGTGGCC CAACGCCAAG ATTGGTGTCA TGGGCGGCGA GCAAGCTGCC AACGTACTGG CCACTATCCA ACGAAACAAC ATTCACGCAC GGGGTGACCA ATGGAACGCG GAAGAAGAAG CAAGCTTCAA GCAACCCATT TTGGACAAAT TTGAGAAAGA AAGCTCAGCG TACTACTCGA CCTCGCATCT GTGGGATGAT GGCATAATTG ATCCGGCTGA TACACGAATG GTACTCGGTA GCTCGTTGGC AATAGCACGA CGAGTTGGTT GCAACCCAAA CGCTACCAAA TTTGGTGTCT TCCGAATGTA ACCCACACGT GTATGCGCTA ACAGTAAAGC AGCAGGGGAA GCGAAAGGAC TGTGAATTCA TTAGTAACAA ATATGTAAAG TTAAAACGCA TCATTCAGCA CTGCCTCGTA TAACAAAGCT CCCTTTACGA ATAATCTTAG AGTCCTTCGA CATTCTTGAT TGGAGCAACT AGGGAAGTCT TGTAAAGGAT ACCGATGATA TATAAGACCT TGTAGACGGC TCCAATCAAC AACAGAATGC CCAAGTCAAA TACGGTACTG TTTTCGGTTG ATAAGACAGG AAAGAACCGT TCGAGATCTC CGAGGACTTC GATTCCAGAC GTTGAGTTCA CGCATACAGC GGACACTCCT TGATTTGTGC AGGCTTCAAA GGTCGAGGCG GAGAATGTTT CGTAGATGTT ACTGCGTACG TAGTAGCTGT ACGGCATTAT GTAGTAGAAG AGTTCAAAAG GCCAGTACAA GTCATCCAAC GGAATTAGGA AGCCACCAAA CAAGAAAGAG GCAAACCAAA AATTCATAAA CTGAAGCATG CCTAGGATAG GATCTTCAAA CCAAACCGAA AGCATCTCCG CCAAGGATTC AAAGACGAAC ATGAGACAGG CGAACAACAA TACGCAGATA CCAAACGACT CTGCGGGAAC ATCTTGAACC GCGTAGGATG GAATACCCAG TGAAAAAATG GCAAAGACGA AAAAGAGCGG TAGAACAAGA ATGAATTTGG CAAGTACATA GCTGATCCCT GTCACCATTC CATTTTTGGT CTCCCGGAGA ATCGACTTGA ATTCATCATT GAGGGCGTAG ACAGCAACGA CTCCCATGTT TGATGGAACA CCAACGAACC AGATGCTGAC CCACATTTTA TTGATGGCTT GATCCTGAGT GAAATCACGA GCATTCCAGT AGACAATGGC AAAGATCAGA CAAGATACGA AGAAGATGAG ACATCGACCG ATGTATCTTA CAAAGTACAA AGAAGGAGGA AACAAACAAA TTAGTCAACC CCTAAAACCT TGCAAGCAAC TTGTACCATC AACATTGTGA GTTTCTTACA AAATAGGATC GCGCACAATC AGGGCTCCAT GGCGACGGAA CATAATCATA AGTTCCTTTC TCAAAGGGGC ACGCTTCATG TCTACAACCC CTTGCTGTTC TTCGTCGTCA AATCCTTTCT TGTGGTGCGA AGAGTTTCCT CCGCCGGGTC GCTTTTCTTG CCACGTGTTG AGAATGCCTG TAACAACGGA CTCATCGGAG AAATCCGAGT TGACAAGATC CAAAAAAAAC TCCGCAGGAT TTGTTGCCGG AGGACACGGA TGGCCGATGC TTTCGAAGTA AGGGATCGCT TCGTTCACGT CTCCACTAAA GGCCTCGCGG CCTCGCGACA TGAGCATGAG TTCATCAAAA CCTTGGTATA CTTTAGTGGA AGGCTGATGG ATGGTGCAGA TAATGATAAG ACGTTCTTCT TTGGCCACCC GGACAATCTC TCTCATAATG TTAGAAGCTG ACGCCGCATC CAAACCAGAC GTTGGCTCGT CCAGGAAAAG CATTGTGGGC TGCTTCAGTA GAGCGATTCC GATGGAAAGG CGACGACGCT GTCCACCGGA CAAACTTCCG TTGCGGGTAT TTTCGCAAAC TTTCAGGCCC ATCTTGTCGA GAATCTCGTC CACTACCAAA GCCATTTCGG CACGATTGAC AGCAACATCG TAGAGTTCGG CTGCGAACAT AAGTGTTTCA CGACAGGTCA AGTACGGCCA ATGTTTATCG TGTTGCTTGA CAACATAACA GTGTGATTTG AAAATCTTGT CAGTCAACGG AACGCCGTTG AGGGTAACCT TTCCAGTCGC TTCTCCATAC AATCCTTCCA AGGTCAAGGC GGAAACGAGA GTCGTCTTAC CAGCACCGGA CGGTCCCATA ATGGCGAGGA CATCTGTGGG GCACGCATTG ATGGTTTGCG AACGGTACGT CGCGAAACGG GTGAGAATCT AGAGATTGTA CGAGGAGGAG AAGTACTCCA CAGATAGACA CTTACGACCC CATTTCACTG TTCCAGACAC ATCCGTGAGA ATACTCTTTT GCTTATCTTT CTTGCCCACC ACAAAATTGA CGTTTTTGAA GCGGAGCACA GAACTTTCCA TTCCCTGACT GTGTGTGGAG CGATGATTAG AAGTACCCAT CTTGCTGGTC GAGGGGTTCT CGATGACGAC ACCGGTGCCA TCTTCTTCAG CGACAGGGTC ATCGTCGTGT TTTTGTTCCT CCTCGGCCGA CATGTTGGTT ATAAACTGCG CTGGAATATT GTGTTTGTGG TATCTTTGTT TGGTCTGCTT CGATACTTAC AGTCAGGACT TGTCTCAAGC TT
|
Protein sequence | MQRSMLYRTF VPPVFRQRTS PTPACASFRR PGFFSTATGF LERNITYDSP KTFNILPVLK GCDLDTRSDV VRCALSRTQH DVELLNERLA HVRLGGGPQA VQRHVSRQKL LPRDRIECII DPGTPLLELS ALAGFQDDIP SGGIVTAIGI VASQPVMFIA NDATVKGGTY FPITVKKHLR AQEIALQNDL PCIYLVDSGG AYLPQQDQVF PDRDHFGRIF YHQATLSSRN IPQIAVVCGS CTAGGAYVPS MSDETIIVQG NGTIFLGGPP LVQAATGEVV SAEELGGARV HTTVSGVADH LAVNELDAMR LARDVVASLG RAPSGITSTK TTSEHLPQHE EPLYDPKELG GIVPVDSKQP MDVRLILARI LDGSRFHEFK KNYGTTLVTG FGKLMGQDVG VIANNGILFP ESAMKGAHFV ELCCQRGIPI LFIQNITGFM IGQKYEHEGI AKHGAKLVTA VATAAVPKIT LVVGGSYGAG NYGMCGRAFS PRFLYMWPNA KIGVMGGEQA ANVLATIQRN NIHARGDQWN AEEEASFKQP ILDKFEKESS AYYSTSHLWD DGIIDPADTR MVLGSSLAIA RRVGCNPNAT KFGVFRM
|
| |