Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47560 |
Symbol | |
ID | 7202791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 114205 |
End bp | 115990 |
Gene Length | 1786 bp |
Protein Length | 390 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182001 |
Protein GI | 219123375 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCAGT GTAAGCCCAC TCTGTTCGGG CTTCGTCAAA AGACTGCGGC GACTGACGAT GTCAACTCAG CAAACAAAAA CCTTGCCGCG CAAATTCACC AGGCGATTCG CGACGGCGAA TACGCGTCGC TGTCACGGCT CGTTAAAGAG AAGGAACACC AAGTCTTATG GGAGTCGAAT TCTTCTGGTT GGACGGCTAT CCATTACGCC GCAAGTCACT TTCTTCCGGC AGAATGGTGG GTTTGGATTC TCTCACGCGC TGCAGTCACC TCTGGCGATG AGTTCTTTGA CTCGAAAACC GCACTCGGAG AAACATGCTT TGACATTTTT CTTCGAAGCT ACATTGATCC ACTACCGTGG CAATCGATTC AAGTAAAAAA CTCTTCCAAA CACTTACTAG AAGCAATTGA TTTCGTCTGT CAAGACGATC GACTTATTGC CCAGACAAGA AAAGCCATCA AAATGCAAGA ACGGAATCAG TGCCATAGTG TGCCTCGTTC TTTGGTGACT GGAAATAGGC AAGTTCTCCG GTGTGTTCGA TTCTGGAGCC GCTTGGATAT TCTTTGCCGT GCTGCGGCAG ATCGAGACTT GGAGTATCCT CGCAATGAAA CTCTCGTATC TGTGTTAGTC CGGTGCGGCA CGTGTCCTGA ACCCATAGCA CGGCTTTTAC TAGTACTATA CCCAGAATAT GCTCGAATCA GAACACCAAA GAACTCGCTT CCATTGCATA CTTGGACCGC ACACTCAAAG AGCGATTTCA CAAGTTTAGA CACTACCGGG ATGCTGTACT ATTTGATATC GGCCTATCCA CAGGCTGTAA CGTCGTCCGA TGAACAAAGG CGGCTTCCAT TACAAACGGC GCTTCTTTCA GGAAATCCTT GGTGTTCACT AAGGCCATTG TTTGAAGCAG CACCAACGGT TCTTGAGCAA CGAGACCCTG TTACCTATCT ACCATGTATC TGTTTATCAG TTTTGGCACC CCAAAAAAAT GTCGAAGTAC GAGCACGACA AAATATTGCC GGGAATGGAG GCTTAGACGT TATGTGGAAG ATGTTAACAA AAGAAAAGCA AAAGGCCAGT AGAGAGCAAG CAAGAGCGCA GCTGGAGCTG GAGCGTCTCG ATCTGATCTA CAACGTGCTG CTTGCTTTTC CGGGGGCTTT ATGGACATTT TGAACGAAAG ATATTCTACT TCTCTATCTT TGAGCTACCT CGACTTGACA GTGAGTACTG TCTTCCATTC AAAATTTAAA GGGCTGTCCA TGGGGATTAG TACATGCTTG TGGTATGCCT CCTCTTGCTA TTGTGGATAG TGAAGTGCTG TCTTGCTCCG ATAGTATGCA CTACCTATGA TCCCTTCACC GTAAATGCAT GTCTCTCTTT CCCGATCATC GTCGTTACGA CAGACATTTG AAGACCTCTG CGATGATTTT CTTGAGCAGT TCCCTTTCCT TTTCCAATTT AAAAGATTCT AACAGTCGAC TAAAGGCCAA CCGCTACAGC GCATTTGAAA CAAGGCGGCT CTTTTGATCA CGGATATCGA CTTTGTCGCA CAGCTTACAT ATTGAACGCG ACTCTACAAT CCAATACTTT TGCTGGATCT TTCAGTGAGA GACCTCATTC ATGTATTTTA TTTTACTGGG AGCGCTATTC GCTTCTGGGG TTGCAGCAGA CGTTTGTCTG TCCAGTTTAA TCCAAGGACG CAACAACGTG AAGGAGAGAG GCGCGACCTC ATTCAGCGAA GATACTACTG TTACCAAAGC GGTAATTCTG TCTATTCCTT ACAAGG
|
Protein sequence | MEQCKPTLFG LRQKTAATDD VNSANKNLAA QIHQAIRDGE YASLSRLVKE KEHQVLWESN SSGWTAIHYA ASHFLPAEWW VWILSRAAVT SGDEFFDSKT ALGETCFDIF LRSYIDPLPW QSIQVKNSSK HLLEAIDFVC QDDRLIAQTR KAIKMQERNQ CHSVPRSLVT GNRQVLRCVR FWSRLDILCR AAADRDLEYP RNETLVSVLV RCGTCPEPIA RLLLVLYPEY ARIRTPKNSL PLHTWTAHSK SDFTSLDTTG MLYYLISAYP QAVTSSDEQR RLPLQTALLS GNPWCSLRPL FEAAPTVLEQ RDPVTYLPCI CLSVLAPQKN VEVRARQNIA GNGGLDVMWK MLTKEKQKAS REQARAQLEL ERLDLIYNVL LAFPGALWTF
|
| |