Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40901 |
Symbol | |
ID | 7198736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 231174 |
End bp | 232391 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184922 |
Protein GI | 219129493 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGTCA ATGCATTTGA AGTCAGCGGC AAGATTGATT ACACCAAGCT GGTTGACAAA TTTGGATCCA ACCTCATTTC AGACTCTCTT ATGGATAAGC TGGAAGCGTT AACGGTTGGA AAAGGCCGAG TTCCCCGGAT GCACCGCTTT TTACGCCGGG GAATGTTCTT CAGCCATAGA GATCTCGATA CCTTGCTGCG TCAGGTAGAG GCCGGTGCTC CAATGTACCT TTACACTGGG CGAGGGCCTA GTTCCCAATC GATGCATCTA GGGCATCTTA TACCCTTCCT TTTTACCAAA TGGTTGCAAG ATGCCTTGGA CGTCCCGTTG GTCATCCAAA TGACGGATGA CGAAAAGTTC TTATTTAAAG GACATTACGA TGACCAAACC GGCGACAATT TATTAGACTT TCAAAGTTTG ACCATGGAAA ACGCCAGGGA TATTATTGCG TGCGGCTTTG ACTACAACAA GACCTTTTTG TTTTCAGACT TGGATTATGT CGGTAGCATG TATCCAAACA TTGTTCGCAT CTGGAAGGCG GTCACGACCA ATACGGTAAA CGGAATTTTC GGTTTCGATG GATCTTCAAA TATTGGCAAG ATTGCTTTTC CCGCCATTCA AGCCGCGCCG TCTTTTGCCA GTAGTTTTCC AGTCGTCTTG GAAGCTGACC GTAATTCCAA TCATTTGTGT CTGATCCCCT GCGCGATTGA CCAAGATCCT TACTTCCGCA TGACGCGGGA TGTTGCGCAC AAACTAGTTC ATAAGCAACA TGGTCTCGGT GGGAAACCGG CACTGATTCA CTCTAAATTT TTTCCTCCGT TGCAAGGCGC CGAAGGCAAA ATGTCGAGCT CCAACACGAA CTCGGCTATA TTTTTGACGG ATTCGCCGGA TGACATTGAG CGGAAAATTA AACAACACGC CTTTTCTGGT GGACGAGAAA CCAAAAAGGA ACAGCAAGAG CTCGGAGCTG ACTTGGAGGT AGATGTGTCC TACCAATGGA TGCGGTTTTT CTTGGAAGAC GACGACGAAT TGGAAAAGAT TGGCCAAGAT TACGGTAGCG GATCCGGCGA ATATTGGAAC ACTGGCAAGG TGAAGGGGCG CCTGATCGAA ATTCTAAAGG AATTGGTAGC GGAGCATCAA GAACGACGGG CAACAATTAC CGACGAAGAA GTTCGCAAAT GGATGGCTGA GCGTAGCATC GTTAAGAACA GCACTTGA
|
Protein sequence | MVVNAFEVSG KIDYTKLVDK FGSNLISDSL MDKLEALTVG KGRVPRMHRF LRRGMFFSHR DLDTLLRQVE AGAPMYLYTG RGPSSQSMHL GHLIPFLFTK WLQDALDVPL VIQMTDDEKF LFKGHYDDQT GDNLLDFQSL TMENARDIIA CGFDYNKTFL FSDLDYVGSM YPNIVRIWKA VTTNTVNGIF GFDGSSNIGK IAFPAIQAAP SFASSFPVVL EADRNSNHLC LIPCAIDQDP YFRMTRDVAH KLVHKQHGLG GKPALIHSKF FPPLQGAEGK MSSSNTNSAI FLTDSPDDIE RKIKQHAFSG GRETKKEQQE LGADLEVDVS YQWMRFFLED DDELEKIGQD YGSGSGEYWN TGKVKGRLIE ILKELVAEHQ ERRATITDEE VRKWMAERSI VKNST
|
| |