Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16987 |
Symbol | |
ID | 7199293 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 291673 |
End bp | 292986 |
Gene Length | 1314 bp |
Protein Length | 404 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185464 |
Protein GI | 219130630 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGACT CTCTGACGTT TCGAGACTTG GGTCTGAGTA CCGCCGCCTT GCGAGCGGTC AAGTCGCATC CTGATTGGAC TGCTCCGACG CTAGTGCAAC AACTGGTCAT TCCAAAGCTC CTGGAGGACA TCGGTTCCCC ACGGAAGCGT TCCATCTGGT GCGAAGCTCC GACGGGTTCT GGCAAAACGG CAGCGTACGG ACTGCCACTC TTGCAAAATA CACAAACAGC GTGCTTTCGG GAACCAAACG CACTGATCCA AGGTGGGATT TCCTCCATAA TTATCCTTCC GACTCGAGAG TTGGCAGTGC AAGTGGGCGT GGTCTTGTCG GAGCTTGCTC AGAATATGTC TCGGGGGGGA TTCAATATTA TGGTTTTGTA CGGAGGAACT CCATTGCAAT CGCAAGTTGA TCGAATGGAT GAGTACGCTC GTAGTGGAGA GACCATTCAT GCAGTGGTGG CTACACCTGG CAGGTTTCTA GACGTGATGG CCCGTGTCGA ACACCCCACT TTACTGGACA ACCTACGCTA CCTTGTTTTG GACGAAGCCG ACAAGCTAAT GGGTAACGGG TTCGCAAAGG AGCTGGACGG TGTCTTGAAT CTGCTACCCC GCAAAGTCCC GACGTGGCTG TTTTCGGCGA CCTTTTCCAA GAGTATGGTT CCTCAGGTGG CAGATGTAAT GAAGCGGCTC GAAATTGTGG AACCGCCCCT GCGGATCACC TGTGCCAACT CGGATCGCCG GGCACCAGAT GAAACAGCCA GCTTACAGAA GCGTTTGAAG CGTTTCGCTC AAGGTGAGGA AATGGAATTG GTTGGACCCG CTTCAACTAT TGATCTCCGG ACGATTCGTT TGCACCAGCG AGACCGCACG CAAGTTTTGC GGAGCCTGTT GGAAGCGAAT AAGGAATGGG ACCGAGTCTT GGTCTTTGTC GCTACGCGAT ACGCGTGCGA GCATGTTTCT CGAAAGCTGC GCCGTCTCGG TATTCCGAGT AGCGATTTGC ACGGTAAGCA GGATCAAGAC ATTCGTTCGC AACAGCTTGA AAGTTTCCGC AGGGGCCATA CACGAGTTCT CCTGGCAACC GACTTGGCCT CCCGCGGCTT GGATGTGACT GCTTTGTCAG CGGTCGTTAA CTACGATCTC CCCAGATCCT CTGCAGATTT TATACATCGG GTGGGACGAA CCGGGCGAGC AGGGTGTAAA GGAGTAGCAG TGACCTTTCT TACAGCCGAT TCGGAGGCGC ATTTGAACTT GATTGAAAGT CGTCATCTGG CGGAGCCCGT TGCACGAGAA ATCTACCCGG GTTTTGAGGT TGAC
|
Protein sequence | MADSLTFRDL GLSTAALRAV KSHPDWTAPT LVQQLVIPKL LEDIGSPRKR SIWCEAPTGS GKTAAYGLPL LQNTQTACFR EPNALIQGGI SSIIILPTRE LAVQVGVVLS ELAQNMSRGG FNIMVLYGGT PLQSQVDRMD EYARSGETIH AVVATPGRFL DVMARVEHPT LLDNLRYLVL DEADKLMGNG FAKELDGVLN LLPRKVPTWL FSATFSKSMV PQVADVMKRL EIEMELVGPA STIDLRTIRL HQRDRTQVLR SLLEANKEWD RVLVFVATRY ACEHVSRKLR RLGIPSSDLH GKQDQDIRSQ QLESFRRGHT RVLLATDLAS RGLDVTALSA VVNYDLPRSS ADFIHRVGRT GRAGCKGVAV TFLTADSEAH LNLIESRHLA EPVAREIYPG FEVD
|
| |