Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48747 |
Symbol | |
ID | 7195005 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 133228 |
End bp | 135692 |
Gene Length | 2465 bp |
Protein Length | 667 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183428 |
Protein GI | 219126362 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTAAAAAAAT TGTGGAAAAA CTACACACAC TCCCTTTTTC CTTCGACTAG TTTGGCCTAT TTACATTAGT ACTCAAACTA CTTAGCAGGC AAAATACCCT TCCTGGATTC TACGACACTT TCATTAATCC CAATGAGTGC AGCCTTTAGT CACCGTGCCA ACAAACGCCG CCGCAAGCGG ACGCATCCGT TCCACCACGC CACGGCGGCG GAGGAACGCT TTCTCCAACA GGCCATCCAA AATTCCAAGC TCGATCAGGG ACGGGACGGA ACGCTCGAAG TGCCCTGGGC ACCCACCTTT TATCCCACCG TACAAGATTT CGAAGGCAAC ATGATTCATT TCGTCGAAAA GATTCGGCCT GTGGCGGAAC GCTACGGGAT CTGCAAGATC GTGCCTCCGG ACGGATGGAA TCCTCCCTGT CGTAAGTAAT CAATCCCGTT TGGAGCGTGT CCAGGTTTGG CTGGGAACGG CATACTCGAT CGACTGGAAC GTGACCATGG ACGGTACTCG CGCTTGCCTG AACACTTATT TACAATATGT ACACACACAC ACACACACAC TTGACCTTGA TACCTGTTTT GATTTTGGTC CCGCGAATTA CGGCTAATGC CTCGATAGGA ACTAGTTTAC TGCTAAACCG CGAAATTAGC GGGTAGCTCC CAACGGATTT TTTTTTCTCA CGGCCTTTTT TTCTTTGCGT ACAGAGGTGG ATCGCAATAC GAGAAAGAAA TTTCAAACCA AGCGACAGTT GCTACATCGT CTCCAAGAAG GAATCAGTTT CGACGATGGC GTCGACTATA CACCAAAGGA GTACCAACGC ATGGCCAGTG AACGGACCCA GGAATGGAAG GCTCTCAACT ACCCTGATCA CGATCTACTC TCCCGGCACG CGGACGTAGT CCAGGAGGAT GCCCAGCGCG CCAAGCTCTT TCGTCCGGAA AACCTGGAGC GGGATTACTG GGACATTGTG GAAACCCATA CACGCCCGGT TACGGTGGAC TACGGGAACG ATGTCGACAC GGAAGAGTTC GGGTCGGGCT TTCCTCTTTC GCAGCGCGGA CGGTCCGTGT ACGGCACCAA GAAACTGGAA AAGATGGATC TACCGGAACC TACATTTGGT AGCGAAGATT ACTACAAAGA AACGTGGTGG AATCTCAATA ACATTCCCTG CGCACCGGAT AGCGTGCTAC GCCACGTCAA GGTTGGTATC AACGGAATCA ATGTTCCCTG GATGTATTAC GGATCACTAT TTACCACGTT TTGCTGGCAC AACGAGGACA ATTACCTCTA CAGCATAAAT TACAATCACC GTGGCGCCCC CAAACTGTGG TACGGCGTAC CTGGACAGAG TAAACAAACT GCGGATGGTT TGGAGAAAGT GTTCAAGAGC TTTTTGTCCA TGAAGATGCG TGATGTACCG GATTTGCTCC ACCACATCAC TACCATGTTC AGTCCTAGAC TCTTGCAGAA TGCGCTGGTC CCCGTCTACA AGCTTCTACA GCACGAAGGT GAATTCATCA TCACCTTTCC CCGGGCTTTT CATGGGGGGT TTAGCCTAGG TCCGAACGTG GGCGAAGCGG TCAACTTTGC CACTCACGAC TGGATTGCCT ACGGTTCGGA TGCGAACGAG CGGTACCGTT CCTTCGCTCG TCCGGCCGTC TTTTCACACG ATCGCCTGAC CTTTACTATG GCCAATCATC TACAAGAACA AAAAGCATAT TCCACTTGCA AGCTGCTCTT GATTGAACTG AAACGTGTGG TCGAGGAAGA GTTGCGTTTG CGGGCCAAGC TACTGGGGGA GGGTGTCCGG GATGTGTCCA AGATTATATC TTTGCCGAAG AATCGTCTCG ATCAGTTGGA CGAAAATAGC GCCAACTACG ACGACAAACG TTTGTGTCAC GGCTGCAAAC ATGTATGCTT CTTCTCGGCC GTTGCCTGCG AGTGCAGTCA ATCAAAAGTG AGCTGTCTGC GACACAGTCA CTACATGTGT CGGTGTGCGA CGGAGCGCAA ATACTTCATG ATTTGGAGCG ATGACGAGGA GCTCAAATCG ACGATGGAGC GGGTACGCAA TCACTGCGAG GTACTCAAGA TCAAGGAAGG ATGCACTGAC GAAGCGTTAG CGCAGTGCAA AGATCTTTCC GCCAGTCAAG AACCTCTTCC CACAATGGCC CCTGGCGCTG AGCGGGACTT GGCGATTCAC AAAAACCATG AAATTTCAAC TGCGGAGTTC TTGACCGAGA CGTACCGTTT CAACCCTCCA ATGAGTGCCA GCTTCAAGGA AGAATCAAGG TCGGTGGCGA CGACTGTTGA TTCCGATGCA TCTTCAGGTT GCATGATTGA TGAAGTGGCC TTTGCCGAAG CGGACGAAAA CGAGATTGAA GTTGTGGGCG TTCGCGGGGG CGTAGGTCCG ACTGTCTAGT TGTCTCGATC AGTAAAAAAT AGGTCGTGTA TATCACGCTG GGGAC
|
Protein sequence | MSAAFSHRAN KRRRKRTHPF HHATAAEERF LQQAIQNSKL DQGRDGTLEV PWAPTFYPTV QDFEGNMIHF VEKIRPVAER YGICKIVPPD GWNPPCQVDR NTRKKFQTKR QLLHRLQEGI SFDDGVDYTP KEYQRMASER TQEWKALNYP DHDLLSRHAD VVQEDAQRAK LFRPENLERD YWDIVETHTR PVTVDYGNDV DTEEFGSGFP LSQRGRSVYG TKKLEKMDLP EPTFGSEDYY KETWWNLNNI PCAPDSVLRH VKVGINGINV PWMYYGSLFT TFCWHNEDNY LYSINYNHRG APKLWYGVPG QSKQTADGLE KVFKSFLSMK MRDVPDLLHH ITTMFSPRLL QNALVPVYKL LQHEGEFIIT FPRAFHGGFS LGPNVGEAVN FATHDWIAYG SDANERYRSF ARPAVFSHDR LTFTMANHLQ EQKAYSTCKL LLIELKRVVE EELRLRAKLL GEGVRDVSKI ISLPKNRLDQ LDENSANYDD KRLCHGCKHV CFFSAVACEC SQSKVSCLRH SHYMCRCATE RKYFMIWSDD EELKSTMERV RNHCEVLKIK EGCTDEALAQ CKDLSASQEP LPTMAPGAER DLAIHKNHEI STAEFLTETY RFNPPMSASF KEESRSVATT VDSDASSGCM IDEVAFAEAD ENEIEVVGVR GGVGPTV
|
| |