Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44750 |
Symbol | |
ID | 7199724 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 127159 |
End bp | 129395 |
Gene Length | 2237 bp |
Protein Length | 379 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178709 |
Protein GI | 219115828 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.112014 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGACACGG ACTTGACGTC GACTTATCCC GTTTCTATTC GAAATCCTTC GAGGTTCAAC AGATGGCTGG TCGCGACTTA CCGGTCAAGT TGGAAAGATC CAGACGGCAA CGTTGACACC TCTGTTTATA GTCCGACGAC AAAGTCTTCC CGTTTCGAGT GTTTGTTTCT CACTCCGTGT ATTCCTCACG GCGCGCAGGC TGCGCCCCCC GACGGAAAAA GGAGAAGATA TCGTTCCACA TCGGCGCAAA AGTCTCAGCC TCCCAAAGGG GAACACCGAT CACCCGTTCG CGCACAAAAC TTCCTCTTTC GAGTCGACGG TTCGTGTCCT CGCTCTCCCC CCACTCATCT GTTGCCCAAA GCACTGCTCA TTCCTCTAGT CCAACAATAC CTATCTATAC TCCCAACGTC TTGATCCTAT ACCTTCTTTC CGTATTCGTT TATTCCCGCT TCGATCCGTC ATTGATCCAA AGACCACGAC TCTTTTTTTC TCCTCGAGAA GGTATCTTTT TCGTGGTCTC ACCTCCCACG GAGTTGTGTC ACTCACCTCT ACGCTCGTGC AACGGCATTT CGCATTACCA ACCCAACCAA CCGTAACGAC TCTTCCACCA CACTCTCATG AACGCGTATT CAACACTACA AGTCTCTTCG TTGCAGACTT GGAACGTACC GCAACTGAAT CAGAGTGACG GTTGTGGACC CACAACCATA CGATATCCGA ACCCCGTTCC GAGTCCACAC TCATCCACAG CTGTCGCATC GGTCACCATT CCAGCATCGG ATGCGGCTCC TTACGCCATT TCTCCCGAAA CCTCACCGAG AGACTTGACT CTGAGACACA ACGAAATTCC CAACCTTGAT GACATCCCGG ACATTGCCAA GGAAGCCTTG CGGATCACCA AGGAGAGTTT GGGATCAGAA TGTCAGTTGC TCCATCAAGG TATGTACCCT TCCATCGAAA ACACCCTGGC GCGCTTGCGG GACGTACCGG AGACTGCCAG CGCTTGGAAT ATCTCTTTCG CAGGCGTCAC TGTCTGTTTT CCTGGCGCTC ACTTCATTCC TTTCTTTCCA TATGTCGAAT TTTGGTACCG TCCTAACCAA TCACTCTAGA TACCACCAAT ACCAGAAAAT TACCAACGAT TGCCACAGGT AGTCGGCTCA ATAAATTTGT GCGCCGCCTG CACGATATGC TCAAGCAAGA ACAAGCTTCG GGTGTCGTCG AGTGGCGCAA AGGGCTTCTC GTTTTACACT CGACCAACTC CTTTGCCAAA CAAATTCTTC CCCAGTACTT CAACACGCGA AACTTCAAGA CCTTTAGACG TCAATTAAAC TACTACGGTT TTGTACACGT TCGCTCCTTT ACTACTGCCG GATCGGCCAC GACTGCGCTC TGGGTCAACC AACACTTGGC CGAAAATGGA TCGGACGACA TTTCCTCCGT TTTGCGGCTG AAACGCGTCG AGCCGTGTGA TGCGGCCAAA ACAGCCGAAA GCCGGAGGGA ACGCAAAGAA CTGGCGATTC ATACCGTCGA AGAAGATCTC GGAGTCAGCG CCCGCACCCT ACAAGTGGAA CAAATTCGGT CCATGGCCCT CCGCGGTAGC CGAGAACAAG TACAAGCCGA AATTCTGCGC GGACTCGAAA GAACACCGGC TATCATTGTC AACGCTGCCA CTACTGAGTC GGCGCTGAAA GTGGTTAAGC AAGTCACGAA ATCCGTCAAG CCACCCGTGC CGCGTGAAAT TCACTGCACG TCCATGGCCG TCGGTGGCGA CCTCCAACAA AGCAGAGACT CCTCCTCCAA TCAGGGCTCG ACGGAAAGCA TAAGTAGCTA CGGATCCGGC GTAAATCACG ACGATCCGAC GCAAATCTCT TTAGACGAGG ACAGCGGAGC AGCAAATCTT TTGCTCTTCC TTTCCAAATC GTCGTAGAAG ATTAGAAAGC GAGTACTGAC TATGAAAATC CCACTGGATT CCTGCTGTTT TCAGCTACGG GGTATACACC TATTCCAAAT CTCATACTCG AATACTGTCG GCTTGGTCAA CATTTCCCGT CTATTGTCCC AATATACGAC ACTTTCACCT TCGCTAAGTT TACCCGTAAC TCCTCTTCTT TTGTGATCCA AGTTCCTATG CGCAACGCGG AATACGTTCT CACACATCGA CCTCACCTTT CCGTGCAAGG AACGTCATCG CAAGACTCCC AATGGACTTT CTCGTAGCGG TTCTTCATAA TTTGTTT
|
Protein sequence | MNAYSTLQVS SLQTWNVPQL NQSDGCGPTT IRYPNPVPSP HSSTAVASVT IPASDAAPYA ISPETSPRDL TLRHNEIPNL DDIPDIAKEA LRITKESLGS ECQLLHQDTT NTRKLPTIAT GSRLNKFVRR LHDMLKQEQA SGVVEWRKGL LVLHSTNSFA KQILPQYFNT RNFKTFRRQL NYYGFVHVRS FTTAGSATTA LWVNQHLAEN GSDDISSVLR LKRVEPCDAA KTAESRRERK ELAIHTVEED LGVSARTLQV EQIRSMALRG SREQVQAEIL RGLERTPAII VNAATTESAL KVVKQVTKSV KPPVPREIHC TSMAVGGDLQ QSRDSSSNQG STESISSYGS GVNHDDPTQI SLDEDSGAAN LLLFLSKSS
|
| |