Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49751 |
Symbol | |
ID | 7198340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 164121 |
End bp | 165699 |
Gene Length | 1579 bp |
Protein Length | 451 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184576 |
Protein GI | 219128766 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTTGT TTACATCCCG GAGACTGAGC CTACGCTCAG CGCAGCAACT TCCAGGGATG CTGGCGATGG CAAATCCCGG GTACCCTAGC AACAATACCT CCAATTCCCA GGATGACAGG GCCATCGAAA CTTGTCATCC GGGAACGCTT CGTCCGTCGC GACGCACGAG GATCGAATCT TGGACAACGG TTCGAAGGAA CTACCACGTC TCGGCAAGTC GAAGCTTTCT GCTTCCCTTG TTAGCGCTCG TTCCCGTTGC GGTGCCCGCC GTCTATTTTG GTCGAAAGTA CTTGCGAGCG CAACAGTTAC GACGAGAAAG GGTTGAGGAT CAGCAATGGC ACGTCTTGCA AGAGCAGCGC AAAGCAGACC AGACAAGCGG AAAGGAGTAC AACAACATCG TGCTGGACTT GGGAACAACC GACTTGAAAA CTGTCGCATT CAATAACCCT ACCGTGCAGA GTGATTTCCA GCCGGACGCA TTGTGGGCTC TTCAAAGCAG CACCACCACC AACGCCACCA CCAATAGCCT TAATTCGAAG GCACTCTTTC GTCTCGTACG GCATTACTTG CAAAAGTCCA TGCAGCAACC AGTCGACTCA CGCCGAGACC ACGTTCGTGT GGTGCTGACG GTACCACCCA AGCCAGAAAC TGCTGAGCGT TATCGTCACG TATTGCAGGA TTCACTCCCG TCGGATTCCT CCGTGGTATG GCTTCCGGAA CCGGTAGCGG CCATTTGGGG TGCGCAAGTC CTCCATCTCT TGCCCATCCC ACATGATCCT CGAACGCTTG TGATTGACAT TGGGGGACGA ACGTCAACCG TGTCGCTTGT GGAAAAAGAT CACGTCGTAG CAGCCACAGT CTTGCCCAAT ATTGGTGGAC AGGTCTTGAT TGACGCCTAC CAACGGAACG AGCCGCATAG TACGTGGAAA GAGGCACAGC AGGCTGTCTT GGATTGCGAG GTAGCAGCCC ATCTATACAG CGATGTGGAA GCTACTGTTC TACAGGAAGC GCTTCCGAAA TTGTACCGAG GAGTTCATTT GTCGGACCAG CTGCCCAAAC CAACCACTCT GGAAACTCTC TGGGAGTCTG CTGTTGCCCA TTTGCTGGAA ACTTTAGAAA GCTCCTCATC ATTGCGCTTC ACTACCGTCG TGGTTGTAGG TGGGGCCTCC AACAATGAGA CGTATCAACA TTCCCTCCAA GCAGCCGTCA CGAAGACAGT AGCTCCGCAA AGTGTCGACT GGATATTGCC TTCCAAAACG GATCGATCTC AACTCGTTAC TCTCGGGGCC AGGTCCATGA TGGCCTCCTG TGACTATTCA TTTGACAAGG GTCTGTGCCC AAAATCCAAT GTGTGAGAAG AAACAACTCA CCTGTCAGCA ACGGGACTAT AGTAAGAAAG GTTGCGTTAT CAAACAGCGC TGTAGTCTCC GGACAGACCC AGAAGAGCAG CTTCGTTCGG CTCGAGCGGG CAACAAATGG GATCTGGGAG CTCCTGTCAC GACATTTGTG CCAGCTCTTT ATTGAGGATT TGAGATAGTC ATTAACAGCA TGTTACCAAG GAAAGCGATT TATGCTGAA
|
Protein sequence | MLLFTSRRLS LRSAQQLPGM LAMANPGYPS NNTSNSQDDR AIETCHPGTL RPSRRTRIES WTTVRRNYHV SASRSFLLPL LALVPVAVPA VYFGRKYLRA QQLRRERVED QQWHVLQEQR KADQTSGKEY NNIVLDLGTT DLKTVAFNNP TVQSDFQPDA LWALQSSTTT NATTNSLNSK ALFRLVRHYL QKSMQQPVDS RRDHVRVVLT VPPKPETAER YRHVLQDSLP SDSSVVWLPE PVAAIWGAQV LHLLPIPHDP RTLVIDIGGR TSTVSLVEKD HVVAATVLPN IGGQVLIDAY QRNEPHSTWK EAQQAVLDCE VAAHLYSDVE ATVLQEALPK LYRGVHLSDQ LPKPTTLETL WESAVAHLLE TLESSSSLRF TTVVVVGGAS NNETYQHSLQ AAVTKTVAPQ SVDWILPSKT DRSQLVTLGA RSMMASCDYS FDKGLCPKSN V
|
| |