Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47218 |
Symbol | |
ID | 7202198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 843287 |
End bp | 845146 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181273 |
Protein GI | 219121854 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.134275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAGAG GCAACGGCAC CACCACCCAT AGTGTTCGGG AAAAGGATAA ACTTTTGACG CCCGTCATTT GGTGCATTCT CGTAACGGAA ACGGGCGAGC GCTTCGCGTA CTTTGGCTTT CGGGCAATTC TGGTGCTGTA TTTCGTCTCT CTGGAATATT CCGAATCTCA AGCGATTGCC TTTTTCGCGT ATACTACCTG TTTGGCTTAT TTGTCACCAA TTGCTGGCGC GCTATTGGCC GACGGACACT TGGGACGTTA CCAAACAATC CTGTGGTTCG GCCTCGTTTA CGTGATTGGC TTGTCCATTC TGACCTTCGC AGCAGCGGCA TCCGAAGATG TAGATCTCGC GTATCGCCGA ACGTTAACCT TCGTGGGTCT CTTTCTGGTG TGCCTAGGAA CAGGAGGCAT CAAACCGTGT GTGTCGGCTT TTGGTGCCGA TCAAATTTCC ATGCGGCCGG AGAATCACGA CGGTGATGAC ACATTGGAGC GTTCTGTTAA TAACGCCGGA CCTGTCGCAA TGACAAGTAC CAAGTTGTAT CGGGATAATC ACAAAGCGAT CACGTTGGCC GATACCGGAC AAGGACCCAG TGAAAGGGAT GGTCTGTTTC GAGAACCACC AGTCGATCCT CGCGAAACTG TCGTGGCACC CGACGGGGTC ACATCTTCGA AGAAGAACGA GCAAGTACGA GCATTTTTTG CTTATTTTTA TTTTTGCATC AACGTGGGGG CCGTCACATC GATTGCACTC GTACCTATAC TGCGAGGACG GTACGGCTTT AGTGCCGCCT TTCTGCTGCC CACATGTTTT ATGATTACGG CTATTCTACT CTTTCTGTCC AAACGAAACG AGTACATTCA TCACCAGCCC GGTAAGGACG GATCTTCACT CAGCACAACT TTTCGTTTGT GCTGGTGGCT TATACGGGAA AATCTATGGT CGATTCCGTG GGTGCAACGC GCACTTCCTT GGGCCAAACC CGAACCACTG CAAAATCATG CTCCCGGACA ACACACGCTG GTGCCAAACG AGGAAGACGA CTACAACACT GACATGGACG CAGGTCTTAA CGATAACACC AGTTCCGTTG ATGACGACAC AGTAGTTGAG AACGACACAA GGGCTTCACC CGACGCCGTC TTTCATCAAC AACTCGATGA CGCGGCACAA GCCGTCAACG TTCTGCCCAT AATGGCCATG TTCCCCATTT TTTGGTGTCT GTACGACCAG CAAGGGTCGG TATGGACGCT TCAGGCTACA CGCATGGCCT TGCCTGATGG AATGTTACCC GAACAACTAC AAGTCGTGAA TCCGCTGCAA ATTATGCTCT TTATCCCGCT TTTCGATAGA TACATTTATC CCGTGATGCA AGCGAAAGGA TGGAATATTG CTCCTCTGCG ACGCATGTCG TGGGGCATGA TGCTGACAGC CATTTCATTC TTTCTAAGCG GCCTCGTGGA ATGGTGTATA CAAAGCCACG AACGAAACAG CGAGGCGATG ATAAGCGTCT TCTGGCAACT TCCCCAAATC ACTGTTTTAG CGATTGGCGA AATATTTATC TCTGTCACCG GTCTCGAGTT TGCCTACTCC ACTTCCCCGG AAAGACTGAA AGCCTTTCTC ATGGCTTTGT TTCTATTGAC GACGGCCTTT GGAGATTTAC TGAGTGGAAT CTTGTATTCC ACCGTGTTTG CGAATATGAA TCGAGCGAAA ATCATGCATA CCTGTGCCTT GCTTATGCTG TGTAACTTGG GATTATTTGC GCTCGTGGTT CGGTGGTGGG AACGTCGCGA AGTGCACGAT TTAAGGCGTT TACAGTCCCT CCAGGGGCTG GAACTACGAG AAGAGCGAAG AATGATTTGA
|
Protein sequence | MPRGNGTTTH SVREKDKLLT PVIWCILVTE TGERFAYFGF RAILVLYFVS LEYSESQAIA FFAYTTCLAY LSPIAGALLA DGHLGRYQTI LWFGLVYVIG LSILTFAAAA SEDVDLAYRR TLTFVGLFLV CLGTGGIKPC VSAFGADQIS MRPENHDGDD TLERSVNNAG PVAMTSTKLY RDNHKAITLA DTGQGPSERD GLFREPPVDP RETVVAPDGV TSSKKNEQVR AFFAYFYFCI NVGAVTSIAL VPILRGRYGF SAAFLLPTCF MITAILLFLS KRNEYIHHQP GKDGSSLSTT FRLCWWLIRE NLWSIPWVQR ALPWAKPEPL QNHAPGQHTL VPNEEDDYNT DMDAGLNDNT SSVDDDTVVE NDTRASPDAV FHQQLDDAAQ AVNVLPIMAM FPIFWCLYDQ QGSVWTLQAT RMALPDGMLP EQLQVVNPLQ IMLFIPLFDR YIYPVMQAKG WNIAPLRRMS WGMMLTAISF FLSGLVEWCI QSHERNSEAM ISVFWQLPQI TVLAIGEIFI SVTGLEFAYS TSPERLKAFL MALFLLTTAF GDLLSGILYS TVFANMNRAK IMHTCALLML CNLGLFALVV RWWERREVHD LRRLQSLQGL ELREERRMI
|
| |