Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42685 |
Symbol | |
ID | 7196335 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 786302 |
End bp | 789485 |
Gene Length | 3184 bp |
Protein Length | 847 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177162 |
Protein GI | 219110821 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0260166 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTGG CCGACTTTTC ATCACCCTTT CTGGTGCTGG TCAGCCATGC AATTGCGATG GGCAAAGGGG TGATGGCACG AAGCGCCACT GATTCACTGT ATTCGACAAA TGGTAACTCG ACACACGTTG GTGCAGTCTC CACGAAAAAA GCAAAACGGC GATTGGACAC CGACGGATTT GCACCTTTAT CATGCAACGC CAACTTGGAT ACGACCTCAT GTGCGTCATT CACGTCGATT TTCGGAAACA AGGCTGTATA TTCATCTCGA CTCATCGTTC CCTGTGGCTC ATGTATAACT ATGGACCATC CTGGACCGAC ATTGACCTTC GGCGATGGGC TTGATATTCA TGGAAAGCTC GTCTTTCCCG ATAGGTACGA ACTCATTGTC AGGTCGACCA TGATTCTGGT ACAAGGCGAG CTTCACATTA CCGCTGTAAA ACCGGTCGAT GGAAATCCGC AGGTCAAATT TGTCATGCAT GGTGACAAGG ATGAGTTCTT CGAGCCTGCT GACAGCAATT CCAACATATG TGGCGATGGT CTTTGCGAAG CTGGGAGTCG GTCTATCACT ATCGCGGGCG GAAAGCTTAG CCGTAAGTAA AGTACTCTGC TAGCAATTCC GTCAGAGAAA CTGATGATGC TTTTTTCTAA TCTACTTCTA TCGTTCTCCT GTTCACGGTG CAGTCCGTGG TGTGCCCGCC GAGACTCCAA CTTTCATGAA TTTGTACGAT ATGGACGGAG ACTCAACCAT AATCCTCCCG GACTCAGTAT TGGAGAAATG GGAACCAGGG GCTCGAATCG TCATCACTTC AAACACTCAG GCTTGGTGGG CAGACCAACA ACGAACCATC GAAAGAATAT CCGTTGCCAA GCCTGGTTTT GTCAACATAG AACTAAATTC TCCTATTACC CGGCCAACTA CTGTAAAAGA CGATGTTGGG TTTTCTGTCG AGGTCGCTCT GCTCTCTCGT AACATCATGT TCGAAAGTGA GGACGGCGGC GGACACTTCT GGGTCATGCA GACCCCGCTC GTCCAGCAAC TGATCGAAGG TGTCGAGATC TCTAACTTCG GACAGCAGGG GAGGCTGGGC AGGTATCCGT TGCACTTTCA TATGTGCGGA GACGTGCGAG GTTCTATTAT CGCCAAGAAC ACCGTCCGAA ACTCCAATCA GCGCTGCTTT GTCGTACACG GAACCAATAA CCTCCGTCTT GAAGACAATG TTGCCTTCGA CACCAAGGGA CATTGCTACA TGCTTGAAGA TGGTATCGAA ACCGGCAACG AGTTTGTTCG TAACATCGGT ATTCGCACCG GAGCGCCCAG AACGATCATT CCGGACATGG GTTCGAACGG CATCGAAAGT GATGGACTAC CGGCGACGTT CTGGATGACC AACCCCCATA ACACCTGGAT CGACAATGTT GTCGCGGGGT CCGAGCATAC AGGCTTTTGG TTTGAGCTGT TAAAGCGCGG TGACCGAAAA GTAGACTTTC CTAATCTTGA CCCGAAGACA GATTCCATCA TTAAGTTTGA TGGCTTTGTT GCTCACAGTA CCTCAGCAGT AGGGTTCACG TACTATCTGT CTGGCTACGA GCCAACTTCG CTTCAATTGT TTGACAATAT GCGCTTCTAC AGAAATCACA ACAAGGCCAT TAGAATCCAC CGGACACGCA ATGTTGTAGT CCAAAATGGA ATCTTTATGG ATAATCCAAT CAACGTTGAG GTCGATCGTT CCGAGGAAGT ACACCTATTG AATACTTCAA TCGTGGGGTA CTCTGATGGA TACAAGGATA CTGTTAGGCA AGGGGGTTAC GGGTTCGCAT TAGCGCCTTG CGGCAATCGA AAAGATGAGT TGCTGGGGTT GACCTTTCAG TCAACAAATC CTTGGATTTA CAAGATGGAA TCAGAGTTTA ACGGTGTCAT CATGGACAAT GTGAGATTTT CCGGTTTCAG TAGGAGTGGT TGTGGGTCTT ATTCTTCCAT TGACCTTGCC TCTCGTCTGG ATGGCTTCAA AACCTTCGAG ATGTTCTCTT CTTTTACAGA TGTGACCATC GACAACCCCG ACTCGATTGA TTTTTGCAGT GGCGCATCTA CCAATGCAGA TGATGTGTAC GTCTCCGATG TCCACGGCAC CCTTTTTGGA GATAGGGGTC CGTCAACTCT CTTACGCAAT TCGGCTATCT TGCTTCATTT TGTCGACGAG AGCAAGTGCA CTGACAATAA TGAAAAGTGC TACAGCTACT GTGAGGACAC TTGCTTCCGC ACTGTATTTT ATCATGTCTC ACAGAGCCAA AGAAATGTAT ATGCACTGAA AGTGTGTGAT CGAAACAGTA CTTCGAATTG TGTCGAGGTC TCAGGTACGG TATCTCGACC AGCCTGGCCT ACGCGCTTCG CCGTCCATGT TCCAAGTGGA CGCCAATACA ACGCTTTTTT CGTAGACGCC AATGAGAAGA GAGTCTATCC TGACGGCGTA CAAATTACTT TTAAAGACAA GTTGTGTCCA AGTGCCCCAA ATGAAGTCGA CATTTCCCTC CTCGGCTCGG GTGGCGAAAT TGCTCCCCCA ATAGCGAACC CAACAACCTT TCCCTCAAGG CCTCCGACAC TTCGACCTAG CTCGGGTGGA GTCGAATCGA CGACGAAATT GTCAACGAGC GCTCCCGAGG GAGCGCAATT GGCCCCTGAC CCATCCCTAG CTGAGGATCG GAGTGTTCCC TCGCTGAGCG TGGGACGCAG CAAATGGTGG AAGTGGTTCC CAGGCTGGAG CTGGTAAAGT TGAGCCATTT GTTTCCTAAA GATGAAGCCG TCTGATCCCA CTACATCCTG AACTTCTACT AATATATGAC TGTCTGTATC TTCGTTGCCT TCGGTCAACT GGATCAGAAA TTCCATTCCA CATACATCGG ATATTTGTCA TGAGTTAGTA GACAAACGCA TGACTTGTAT CGAGGTAAAG AGCCGATGGC CATCCTGCAA TGACCAATCG CTAAAACTAA TCCTTGATTT CGCAATGCAG TCGCCAATGC AATCGAGTAT TGGAGTCGTT ACCTATAGTT AGCCTTGGAT GAAGCGTGGC CAATCTCCTA CATGATTACA TTTTCCCATG TCTTCAACAT ACCAGCATTG AAGTTTTCGC TTACACACAA TGAAATATAC TTTTACTGAT TGATAATTAG GATTTATTAG GTTA
|
Protein sequence | MKLADFSSPF LVLVSHAIAM GKGVMARSAT DSLYSTNGNS THVGAVSTKK AKRRLDTDGF APLSCNANLD TTSCASFTSI FGNKAVYSSR LIVPCGSCIT MDHPGPTLTF GDGLDIHGKL VFPDRYELIV RSTMILVQGE LHITAVKPVD GNPQVKFVMH GDKDEFFEPA DSNSNICGDG LCEAGSRSIT IAGGKLSLRG VPAETPTFMN LYDMDGDSTI ILPDSVLEKW EPGARIVITS NTQAWWADQQ RTIERISVAK PGFVNIELNS PITRPTTVKD DVGFSVEVAL LSRNIMFESE DGGGHFWVMQ TPLVQQLIEG VEISNFGQQG RLGRYPLHFH MCGDVRGSII AKNTVRNSNQ RCFVVHGTNN LRLEDNVAFD TKGHCYMLED GIETGNEFVR NIGIRTGAPR TIIPDMGSNG IESDGLPATF WMTNPHNTWI DNVVAGSEHT GFWFELLKRG DRKVDFPNLD PKTDSIIKFD GFVAHIQNGI FMDNPINVEV DRSEEVHLLN TSIVGYSDGY KDTVRQGGYG FALAPCGNRK DELLGLTFQS TNPWIYKMES EFNGVIMDNV RFSGFSRSGC GSYSSIDLAS RLDGFKTFEM FSSFTDVTID NPDSIDFCSG ASTNADDVYV SDVHGTLFGD RGPSTLLRNS AILLHFVDES KCTDNNEKCY SYCEDTCFRT VFYHVSQSQR NVYALKVCDR NSTSNCVEVS GTVSRPAWPT RFAVHVPSGR QYNAFFVDAN EKRVYPDGVQ ITFKDKLCPS APNEVDISLL GSGGEIAPPI ANPTTFPSRP PTLRPSSGGV ESTTKLSTSA PEGAQLAPDP SLAEDRSVPS LSVGRSKWWK WFPGWSW
|
| |