Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47324 |
Symbol | |
ID | 7202491 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 300130 |
End bp | 301927 |
Gene Length | 1798 bp |
Protein Length | 519 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181696 |
Protein GI | 219122736 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.37145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGCTCCT TCAGGTTTGC AATGCTCGTT CTGCCGCTCT TCCTTGTTGG CTTGAATATG TGGGAGGTCG TGAGGGACGA AGGTTTTGTG GAGCGCGACT CCTCCTTGAT GCATTATTCG GACAGCTCAC GGATCCTTGA CTACGCAGAC CCGTATATCT CAGATGATTC CAGCAAAATT GTAAGTTTGG TTGCATTCGG AAAATCGTAA GCTTGAAAGA CAACCAGGTT TACAGCTAGG ACTTCAAGAT GTGTCCTTGA TCACTCAGAG CGGAAGTGGA AACACGCCAA ACTCACCAAC TCATCCTCTT ATAAGATTGA GAATGCCCAT TTGTGGGTTC AAAATAATTC AAGGATATGG GTTCCAGCCA CTGAAATCGG ATTGACTGTG AAACTCACAT CCGAGGCATC GGACCCGGGA AAGCGTTTGC TCCGCTTTCC AACAATAGAC CAACGAGTTC AATTTTATAT GTCCTCTTGG TACGAGCCAC CCTGCGATGA GACTGAGCTG TTACATGTTG TCAAGTATGC TGCGTGTAAC GAAAAAAGTA CAACAATGAC GAAGAATGCC AAAGAGGACA ACATATTCAA GAATGAGAGT AGTCAGCCAA ATTGCTTTCC TTCCTTTGTA CTCCAGCGTC AAAACGGACC GGAGGTCGAC TCGCTCTCGG TGGTTTTGAA AGCATCTGCA AAGGCTACTC ACGACCTTAT TTTTGCGTTG GACAAAGATT CGCTCGATTC GTGCCAACTC AACCCCCAAG CCGATCAGTT ACAAGTTGTC CATAGTCGCT ATTGTCCAGA GCTTCGTGAT AAACTGCTAA TGGTGTACCA GAATCAGACC AACAAGACGG ACAACTTGAC CAGTTTTGAG CCCGCAATTG CGTTTGTTCA AATCGGTGAT TCGCGGAGTT CGAGATCTTT GAATGACGTC GGACAGCCGA GAAGACAGTT TCCAAAGCCT GCCGTACCTC ACTTTACCAA AATACGACCT GTTTGGGATG ATGCAAGTAC AAGCGATTTT CTTATGAAGG CATCGTCTAC GGCTTGCGCC ACTTTGCCAA CCCGACGGAC AAATAGAGGA AACCTGGAAT CGATCATTTG GAAAATGGGA GCGTGGCGCC ATTACAAAGA AATAGAGCTA CTTCCAACTT TGGACGTCCC TTGGGAAAAC AAACGGAACA CCGCCGTTTT TCGTGGAGTA ACCAGCGGAT ACTTCAATAC AAGCATATCA CCTCACGAGC GATGCTTCCA AAATCTACGC TGTCGGCTTG TCTTAGACCA CCACAATTCG ACCAAAGTGG ACGCCAGGTT CAGCAGGGTT TTGCCTGATA CCGTACCGAC GGAAATTGAA AACTTTTCTA TCGTTAGCAG GAATCTTGCG CGGGACGAAC TTTTAAAGTA CAAAATGTTA GTCTTCATAG AAGGAAATGA CGTTGCCTCA GGATTGAAAT GGGGACTGTA CTCCAACTCG GTGGTCATGA TTCCCAAGCC CACTGTATCG TCGTGGGCCA TGGAAGAGCT TCTAGAGCCA TACGTGCACT ACATTCCGTT AAAGGACGAT CTCTCTGACA TGGATACGCA GATTGAGTGG ATTCTCTCAC ACGATCGTGA GGCCCAGAAG ATTGCGTCAA GGGGTCAGCT TTGGATTCGT GACCTCCTCT TTGACGATCA GTCCGACAGC GACAATCGTG CAATCAATGG AGAAATGTTG CGTCGGTATG AAGCTCACTT TCGTCCAGGA ATGGTAGTGA AGCGAGGCCC CCTCTTTCAG CAAGAATTCT CATAGAAGTA GGTAGTAA
|
Protein sequence | MCSFRFAMLV LPLFLVGLNM WEVVRDEGFV ERDSSLMHYS DSSRILDYAD PYISDDSSKI ASDPGKRLLR FPTIDQRVQF YMSSWYEPPC DETELLHVVK YAACNEKSTT MTKNAKEDNI FKNESSQPNC FPSFVLQRQN GPEVDSLSVV LKASAKATHD LIFALDKDSL DSCQLNPQAD QLQVVHSRYC PELRDKLLMV YQNQTNKTDN LTSFEPAIAF VQIGDSRSSR SLNDVGQPRR QFPKPAVPHF TKIRPVWDDA STSDFLMKAS STACATLPTR RTNRGNLESI IWKMGAWRHY KEIELLPTLD VPWENKRNTA VFRGVTSGYF NTSISPHERC FQNLRCRLVL DHHNSTKVDA RFSRVLPDTV PTEIENFSIV SRNLARDELL KYKMLVFIEG NDVASGLKWG LYSNSVVMIP KPTVSSWAME ELLEPYVHYI PLKDDLSDMD TQIEWILSHD REAQKIASRG QLWIRDLLFD DQSDSDNRAI NGEMLRRYEA HFRPGMVVKR GPLFQQEFS
|
| |