Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49725 |
Symbol | |
ID | 7198405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 70051 |
End bp | 71360 |
Gene Length | 1310 bp |
Protein Length | 401 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184482 |
Protein GI | 219128570 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00466519 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATGTACGGG AGGAAACAAC AAAAGAAAAG GTTGGAAGGA GCATTGATTG TATCGCTCTG TATTGTAACA GTTAAGACAA TCCCAAGAAA GGTGACTCAA CGCGATGCTT GATGACTTCG GAAACGCTAG CGAAGTCGCC ATCTTTTGGG ACTACGAGAA CGTTCCCGTA CCATCTTGGT GTAAAGTGGC TTCAGAAGCT TCCAAGGCAA TTGTTAACTC GGTTTCGGAA CAAGGTCGAA TAGTGGATCG GCGGCTGTAT TTTGATTTTT CAAATCAAGA GCAAGGCAAC AGATGGAGCG GGCTAGATTC CAGTGGGTTT GATCTCGTGA ATACGCCGCA GCGAAACCAG AAGGAGACGC TCGATAAGAA AATGATTGCT GATGTCTTGC TCTTTTGCTG GGATAGTGCG ACGCGAAACC AGGGCACGAA CAAAGGATCA TGCGTTGTTC TGGTCACGAG CGATGGAGAT TATGCTTATA CATTGAATAA GCTTCGTGAT CGAGGGGTTT CCAGCGTAGT GATCTACGGC CACGGTAATG TGGCTGATAT TTTGATCAGC AGCGCAGATG TTGCTCTTAG CCTAAAGCAT GATGTTCTCA AGCATCTTAG ACCACCGTTG GCGCTACCAA ATGGTCGCCC CAAAAATGTC CAAGACAAAA CCGTTTCAAG AAATCCGCAT TTGAAGATTG TGAACGGAAA GCCTGCTTAC CATCTAGACT CCAGGGTTGT GCCAAGAAAA CCAACGCAGC AATCTGAAAG CCAGGCATCA AAGCGAACCT TCGATTGCAA CGACAAATCC ACCACACATT GCTCTGGATC GAACTTGAAA ACAGGCATGA CATCAGCAAA CGATGCAGGA GCAGCAAATA AGCATCCAAA CATCAAATAT GTGGCCAGCC CGGCAAAATT TACAATGCAG CGGGACGCTA TACTTTTGTG CCAGTCATTG GCTGATTTGG CAGTGAGCAC AGCCATAGCT CCAGAGCAAC AATGGGTATC TGAAAGCCAA GCTGGACAGA GCTTCCAAAA AATTCTAAAG ATCTGCAAGA AAGAAAAGGA TTCCGATGGT GATGTGACGC AATGTTTTCA AAACGCATAT TCCCTGGCCA TCTTGCTTGG CTGGATCGAG GAGGGACGTC GTGGTTTAGG AGGCCAAAAT GGCTATATAG CAGCTACATC ATCACCTAGT AGAGAGAAAG AAACGACTGA AACATTTCTA CGATTGACTC CTTATGGCAT AAAGGTTGCA AGCAAGCGTG CATTAATTCT TCCCGTGAAG ACAACAACGC TTGTACCCTC TAAAGCCTAA
|
Protein sequence | MLDDFGNASE VAIFWDYENV PVPSWCKVAS EASKAIVNSV SEQGRIVDRR LYFDFSNQEQ GNRWSGLDSS GFDLVNTPQR NQKETLDKKM IADVLLFCWD SATRNQGTNK GSCVVLVTSD GDYAYTLNKL RDRGVSSVVI YGHGNVADIL ISSADVALSL KHDVLKHLRP PLALPNGRPK NVQDKTVSRN PHLKIVNGKP AYHLDSRVVP RKPTQQSESQ ASKRTFDCND KSTTHCSGSN LKTGMTSAND AGAANKHPNI KYVASPAKFT MQRDAILLCQ SLADLAVSTA IAPEQQWVSE SQAGQSFQKI LKICKKEKDS DGDVTQCFQN AYSLAILLGW IEEGRRGLGG QNGYIAATSS PSREKETTET FLRLTPYGIK VASKRALILP VKTTTLVPSK A
|
| |