Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49992 |
Symbol | |
ID | 7198776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 65138 |
End bp | 66879 |
Gene Length | 1742 bp |
Protein Length | 411 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184821 |
Protein GI | 219129282 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000883094 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGACCGCTT GGCCGCCGCC GACGCACACG GCAACCCAGT CAGTGGTGCA GACCCTACTC TTCCGTAGGC TGTAAACGTC CACGATAGCC TAACTCAGCG GAGGTTCGGC AATGGACGCG AAGCTACGAC GGACGAGAGG CACAATGCCC AAGCCTCCGG GCCGACAAAC GTATCATTCT TCCGTCGTTT GGGGAATGCT GTCCATTCTG GGCCTCGCTT TTTGCTGGGT CAATGCCTTG TATCTTCTTC GTTTGGTAGA ACAAACCTCG ACTGACCTGC AAGCGTCGGA AGTGATCGTC CACGCAGCAA GGGCGAACGC TAGCGTGTTG TTCGAACCCA ACCAACTACC GCACGACCAT ACATTGATAC ACCCCGACAA GGCCCCAATC CTGAATCTGT TGCAAGAAGC TGGTCTGGAC GTGCGCACCC TGTCGTCCGA AATCGTGGCA TCCATCCCGT CCTGGTCACA AGTCACGGCA CTGTACGGGC ACCAACCCCG CATCTACGGG ATGGATCAAG GCGTCTGCGA CGCCTTTCAA CAATCTTCCA ACCCCGCCGA ACATTTCTTG GGCGTTGCGG GAACATTCAA CACGGGGACA AATCTCTTGG CCGCGCTCTT GATTCAAAAT TGCCACCTTC CCGCCCGCAT AAAAGTTCAC GGCCCTGGAT CCCCCGGCAT ACGGTGGCAG GTGCCTTGGG GTAAGCACAC GCCGGTGGAC GATGAAGATT TTCGACAACG GCACAAAGCG GCTCACGACC AAGACCTCGA GGCGGACAAT GTCTTGGCTG CCGTCGCGAT TCGGGATCCC GCCGTCTGGA TGGCTTCCAT GTGTCGACAT CCGTACGCCA TGCGATGGCA GCTAACTGCA AGGAAGGATA CCAACGGTAC CATACACTGT CCACATTTCG TGATGGAGGA AACGGACGGA ACCAACCACT CGGTCCCGGT TCATGTGCGG TACTCCAATT TCACTCGGCA TTACGAAAGC ATGGTCCATC ATTGGAACGA ATGGTACGGT GCGTACCTCG ACGTGTCCTG GCCGAGATTG CTCCTACGAT TCGAAGACTT GATTTTCCAT CCCAGGCAAG TGACCCAGAA GGTCTGTGAA TGCGCCGGTG GCAAACTTAA TAGTGGACCA TTCCGCTACA TGGTGGAGTC GGCCAAAAAG GGCGCAGTTC ACGGAACGAA AAAGACTAAC TATGTCGACG CCATTGTACG CTACGGAACC TCGAACCATC ACTGGAAAGG CATGAGCAGA GCCGACCTTG CTTACGTCAA ACAGCATCTT GATCCTAGGC TAATGCAAAT CTTTGGCTAC TCGTATCCAA CCCAACCAGA TATGTGACTT CTATTGCTTC ACCCCTTTTG GTTGTTCAAA CGATTCAACG TGGAGCCATG GGAATGGCAA CCACAAAGAA GGAGGAGTGA TTGGTATTCT TCCGTTTTTC AATACTTTTT CTACATGAGG TCCAACGGCA GCCCACGCAT CAAGCCATGT CGAGTCATCT ATGTCCCACA GCGGGGTAGC CCCACGGTAC ATTACAAGTT GAACCTGCTA CCAAGGAGTG TTGCCGGCCA AGAATCTTTC AGCAGTTTCC ATTTACGGCT CTGGATCCAC ATCTAGTAAA TGGAGCAAAG TAGGGTGTTG CATCGGTGTC AGGTGTCTCG AAATAAGGTG TCCGTAGTTT TCCATACTGC TACCGGCTGT ATGCGAGTGC AAATCTCTCC AAAACAGCCA CT
|
Protein sequence | MDAKLRRTRG TMPKPPGRQT YHSSVVWGML SILGLAFCWV NALYLLRLVE QTSTDLQASE VIVHAARANA SVLFEPNQLP HDHTLIHPDK APILNLLQEA GLDVRTLSSE IVASIPSWSQ VTALYGHQPR IYGMDQGVCD AFQQSSNPAE HFLGVAGTFN TGTNLLAALL IQNCHLPARI KVHGPGSPGI RWQVPWGKHT PVDDEDFRQR HKAAHDQDLE ADNVLAAVAI RDPAVWMASM CRHPYAMRWQ LTARKDTNGT IHCPHFVMEE TDGTNHSVPV HVRYSNFTRH YESMVHHWNE WYGAYLDVSW PRLLLRFEDL IFHPRQVTQK VCECAGGKLN SGPFRYMVES AKKGAVHGTK KTNYVDAIVR YGTSNHHWKG MSRADLAYVK QHLDPRLMQI FGYSYPTQPD M
|
| |