Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48834 |
Symbol | |
ID | 7195133 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 399322 |
End bp | 401198 |
Gene Length | 1877 bp |
Protein Length | 597 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183481 |
Protein GI | 219126473 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.935241 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTGCC TTGCTCATCA CCACTGGGGC GGAAGGCTAG TTTCAAGGAT TGCTATGATC GGTTTGGTGC TCATTCTTGT CGGTTTGAAT TTGTGGAATG ACTCCGCTCC AATCAACCCA ACAATTCGAT CTTCAGATAA CTCTGGCACG CAGTCCTCGA GTCTTTCCCG AAGCTTTCCT CTCCCCTACC GCAATGATAC TCACGAAGGA AACAAAGGTG TAAGTCAGGA TCAGTGGGTG CAATGCAAAA GAATTGAAAA GTTTTCGGTC TTATTCGTCA AAATTCTTTT GTGGTGGTGC AGAATGGCAA CGAAAACTTC ACTTCTCCCA GCAATCTAAG TACCACTCCT AATAACGCAA ATTCCTCGTC AAAAAGGACA ATTTCCCCTG CACTACAACC GCGATCAGAT GCATCAAAAG ACTCCATATT CACCGTTCAG GAGGTAAAAT CGGCTGAGGC TTTATCTAAA ATGGAAAGCG AGAGCCTGTC GAAAAATCTC TCTTCTGCGC TGCTCTCGAA GCATTTTCCA ACAGCGGATC AACGAGTCCG ATTTTACATG TCATCCTGGT ACGAGCCACC ATGCGATCAG GACGAATTGC TCGAAATTGT CAAGTATGTT GGTAGCAGGG GCAATGAAGG CAGCACAGCG GCCAACGAAA CCGGAGAGGA GCAAAAAGAT GACAGTATGA CACAATTCAT TCCTTCCTTC TCCCTCCACC GGCACGTACA ATCCCCACAA ATATCCAGCT CCATTATATT CAAGGGAGCA GCTAGGGCCG ATACCGACGT CGTATTTGCT TTGGACAAAT CTGCTCTCGA TTCGTGCGAA TTTAATCCCC GACCAGACCA AAAGAAAATC TTGGAGAGCA TCTATTGTCC AGAACTTCGA GACACATTAT TACTGCCTTA TCAAAACCAA ATTGGCCTAA CCAACACTTC AAAAAACAAC GAGCTCTCGG TTGTGTTACT TGCGCAAGTC GGCGACGCCC TGTCATCCAG AGCTATGGAC GATTTTGGAA AGCCACATGG ATACTCTGCG CAACCTACTG TACCTCATTT TACTAAAGTA CGGCTTGCTT GGGATAACGT GACTGCAAGA GTTTCCCTAA TGAAAGCATC ACCTAGGTCC TGCTCAACCT TACAAACACG ACGAATTAAC CGCGGGAAAC TGGAACCAAT TATTTGGAAA ATGGGAATCA AGCGGCACTA CAAAGGAGTG GAGGACGTCC CCAGTGAGGA CGTACCTTGG GAAGAGAAGC GGGATGTTGC TGTGTTTCGG GGTACCTCTA CTGGGGATTT TGACCAAAAA ATGCCGGCTC GCGAGCGTTG TCGTCAAAAT CAGCGCTGTC GGCTTGTGTT GGACTACCAC AATTCGTCTC TCGTGGATGC AAAGTTTACC AACATACTCA GCAGGAGCAA CCTGCCATTA GCGATCGACG GTATCCCTAT AAATGGCAGT CATCTCCAGA GATATGAGCA GCTAAGGTAC AAGGCGTTGA TATTCATGGA AGGCAACGAT GTCTCTACCG GATTGAAGTG GGGATTGTAT TCCAACTCGG TTGTTATGAT CACAAAGCCA TCAATTTCGT CATGGGCCAT GGAAGAGCTC TTGGAACCGT ACGTACACTA TGTGCCTTTG AGGGACGATC TATCGGACGT GGAAACGCAG ATGAAATGGA TCGTGGAGCA CGACAGGGAG GCGAAGGAGA TTGCGTTGCG GGGGCAGCTT TGGATGCATG ACCTGCTGTA CGCCGAGGAG TCCGAGAGGG ACAATGCGGC AATCAATGAA GAGATTTTGC GGCGATATCA GACTCATTTC CGACCCGGCA TTGCGGTCAA GGAAGAGCTT CTATTCTATC CGAAGCCGTT GAAGTAG
|
Protein sequence | MTCLAHHHWG GRLVSRIAMI GLVLILVGLN LWNDSAPINP TIRSSDNSGT QSSSLSRSFP LPYRNDTHEG NKGNGNENFT SPSNLSTTPN NANSSSKRTI SPALQPRSDA SKDSIFTVQE VKSAEALSKM ESESLSKNLS SALLSKHFPT ADQRVRFYMS SWYEPPCDQD ELLEIVKYVG SRGNEGSTAA NETGEEQKDD SMTQFIPSFS LHRHVQSPQI SSSIIFKGAA RADTDVVFAL DKSALDSCEF NPRPDQKKIL ESIYCPELRD TLLLPYQNQI GLTNTSKNNE LSVVLLAQVG DALSSRAMDD FGKPHGYSAQ PTVPHFTKVR LAWDNVTARV SLMKASPRSC STLQTRRINR GKLEPIIWKM GIKRHYKGVE DVPSEDVPWE EKRDVAVFRG TSTGDFDQKM PARERCRQNQ RCRLVLDYHN SSLVDAKFTN ILSRSNLPLA IDGIPINGSH LQRYEQLRYK ALIFMEGNDV STGLKWGLYS NSVVMITKPS ISSWAMEELL EPYVHYVPLR DDLSDVETQM KWIVEHDREA KEIALRGQLW MHDLLYAEES ERDNAAINEE ILRRYQTHFR PGIAVKEELL FYPKPLK
|
| |