Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45453 |
Symbol | |
ID | 7200557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 198279 |
End bp | 199849 |
Gene Length | 1571 bp |
Protein Length | 458 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179601 |
Protein GI | 219117618 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00124507 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGAACACT CATTCCCGTG ACCCCCTTGT GTACACAAAC ATCTTTTTTT CGACTCTTTC ACAAAACAAA CCAGTAACCA CTCTTGGGTA TCTCCTTGGT TTCCTCATTT GGGCATTCAT GACGACCCGT AGCATGGGAC GTCGACCATT GTGGTACGGT TACTTACTGT TGGTGTTGAC GATTCCGGAA CCGAGTTGTG CGCGCGTACC GTGGGGGCAC GCCACACCGT TGGTCTACTT CAAGGACGCG TCGTCGTCGT CCTTCTCTCG CTCCCTGTCG GCAGCATCGA CGTCCTTTCC GGCGCCGTTA CCGGGCAGTA GCAGCAGTAG CAGCAACGGC GGTAGTTCCG CGTCGCTGTC TCGGGGTGGA TCGACCAAGA CGATCGTTAG CGTCGACGAT GGCGTGGATA CTCTCGATCC GGAAGCCGCA ACACTGTCAC CATCGGCACC GTTACTGCCG GAGGAATCGA CAACAGCGTC GTCATCCTCA ATAAAATCCT CGGAAGAGAC CACGTCGACG GCACGATCCG TCACGGGAGG CGCCTTCAAC AGCAGTGTGG TGACTAACAA CAGTCAGAGT GCCGCCGAAA CGACACCGCC TGGTACGAAA GCGAGCACGT CGGCAGCCGT TGGTCCCAAG AAGGTGACAA CCGAATACGA ACAATTGCCT CGGCGCGGCT TGTGGAAACT CATTCCACGA TCCAACAAGA ACGATCACAA ACGGATTGCC AAAACACTCA AGAATCGCAA CCACACCAAC AAGCGGCGTA AATTTATGCA CGCGTCCTTT GGTCTCTTGT TCGCCACGCT TAATCACGTC ATTCCCCGGT CCAAGTTTGT CCCCGGCATG GCTGCTTTAT CCAGTGCCAC GTTACTTATG GAACTGCTCC GTTATCGCAA CGAGTTTGGC TGGATGAACG ACGTCCTACA TTTCGTGCTC GGAAAGTCAC TCCGCAAACA CGAAATGGAA GGGAAATTCA CCGGTTCCTT TTATTTCTTT ACCGGTGTCA CCCTCACGGC CTACCTCTTC CCTCCCACCG CCGCTACGCT CGGAATTTGC CAACTGGCCA TTGCCGATCC TACCGCGTCC TACTTTGGAA GGCAAACCCG ACACGTCTAC TGGAGTCGGA TTGAAAACGG TCTGGGTGGA TTCGGTCGTA ACAAGGGTAT ACTGGGATTT CTCGGCGGCG CCGCCTGCTG CGTACCCTTC AATTACCGAG TTTTGAAACT AGCTAAATTC GGCGCCGTCC CCGTATCCAA CACGGCCGTG TTGGCGGCGT CCGTGGCCCT GGGTCTGGCT GGCGCCTTGG CCGATTTGGC CGTCCCCACT CCCGCCCTCG TTCTACCGAA AAAGGTCCTC GGCGTACGGG TACCACCCTT TCACTTGGAC GACAACTTCG TCGTCCCCGT AATGTCGGGA TGGGCCTGCG TCCGCATTTT TGACGCCCTG GGTTGGTCGC ACACCCTCGC GCTCGCACCC TTGCTCGTAC TGTAAAATAC GTACCATCGC GTTTTCGTGT TTTGGCAAAT GAAATGGGAA ACACCCCTCT AGCTTGCTGT TTTTATACTC T
|
Protein sequence | MTTRSMGRRP LWYGYLLLVL TIPEPSCARV PWGHATPLVY FKDASSSSFS RSLSAASTSF PAPLPGSSSS SSNGGSSASL SRGGSTKTIV SVDDGVDTLD PEAATLSPSA PLLPEESTTA SSSSIKSSEE TTSTARSVTG GAFNSSVVTN NSQSAAETTP PGTKASTSAA VGPKKVTTEY EQLPRRGLWK LIPRSNKNDH KRIAKTLKNR NHTNKRRKFM HASFGLLFAT LNHVIPRSKF VPGMAALSSA TLLMELLRYR NEFGWMNDVL HFVLGKSLRK HEMEGKFTGS FYFFTGVTLT AYLFPPTAAT LGICQLAIAD PTASYFGRQT RHVYWSRIEN GLGGFGRNKG ILGFLGGAAC CVPFNYRVLK LAKFGAVPVS NTAVLAASVA LGLAGALADL AVPTPALVLP KKVLGVRVPP FHLDDNFVVP VMSGWACVRI FDALGWSHTL ALAPLLVL
|
| |