Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41659 |
Symbol | |
ID | 7195985 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 619397 |
End bp | 621529 |
Gene Length | 2133 bp |
Protein Length | 645 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177124 |
Protein GI | 219110745 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.666462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGAAG GCGAACAAGA GCTCTTCGAA AAGAAACTCT CCAAGGAAGA GAAGAAAGCT CGTGCCAAGG CACTTCGCGA AGCGAAAAAG AAAGCCAAGG AACCAAAAGA AGGGAAAAAA GACAAGAAGG AAGATGCTGA AGAAAAGAAG GAAGAGGCTC CCGCTCTGAA CTTAGACGCT CTCGACTTGG ATGCTCACAA GGATGCCAAG CGCGAAGCTG CCCTCGACAA GCTTTCCGAC GATGACATCA TTGTAACCTA CGAAAGTAAG AAAGGAGTGT TGCATGCGAA TACTCGAGAC ATTAACGTGT CGGGAGTAAC GGTCACCTTT CACGGAAAGC CTTTGATCGA GGAAACAGAA ATTACCATCA ACTACGGAAA CCGTTACGGA TTCATTGGAC CCAATGGCTC CGGAAAATCC ACGATCATGA AGGCAATTGC TGCGCGTGCT ATCCCTATTC CGGATTCTTT GGATATTTAC TTTTTGGATT GTGAATACCC AGCACGCGAT GACATTACGG CTCTGGAGGC AGTCATGGAA AGTAATGACG AAGTCGGCAT CCTGGAAAAG CAAGCAGATG CTCTCAACAT GGCTATGGGA GAAGCCGATG AAGAGCAACA GACATCCATC CAAATGACAC TCGAAACAGT TTACGCCCGT CTTGATCAGT TGGACGCGAG CTCGGCCGAA GCTCGCGCCA CAACTATTCT GCACGGTTTA GGATTCACCA AGACCATGCA ACATATGAAG ACTCGGGAAT TCAGTGGAGG ATGGCGCATG CGCGTTGCCT TGGCCCGTGC TCTCTTTCTT CAGCCCGAAT TCTTGCTCCT CGATGAGCCG ACCAACCATT TAGATATGGT ACGTTTTTCA ACGTTTTTGG GCTGATTGAA AGCCAACAAT CTCTGAAATG TTCACCTTCT CATTCTATTT CCTCCAGGAT GCTGTGTTAT GGTTGGAAGA ATATCTGTCG AACTGGGACA AAATTCTGTT TTTCGTCTGC CACAGTCAAG ATTTCATGAA TAGTGTCTGC ACAAACATCG TTCGCCTCGA TATGACGTAC AAAAAGCTGC GGTACTATAG TGGAAATTAC GACACATACG TGCAGACGCG TCGGGATCAA GATATGGTGC AAATTCGTCA ATACGAAGCT GAGCAACGTG ATATCGCTGA AATCAAAGAT TTTATTGCTA GATTCGGTCA CGGTACCGTC AAGATGGTTC GGCAAGCACA GGCGCGCGAA AAATTGCTTC AGAAAAAGCT GGAAGCCGGT TTGACTACGC TGCCCGAAAT GGATCCAGAA TGGGATTGGA CATTTCCTGA TGCGGGAGAG CTCCCCGTCC CGGTTTTGTC GATCGAGAAT GTCAGTTTCA ACTACCCCAA TAGTGTCGAG CTCTACAGCA AGGTAGATTT TGGGGTAGAT TTGCAGACGC GCGTTGCCTT GGTGGGGCCC AACGGTGCGG GAAAGACAAC GTTGGTCAAA CTAATGACGG GTGAACTTAA TCCGACTAAG GGGGCAGTGA AGCGCAATAC GCACCTTAAG ATTTCTCGCT TCACTCAGCA TTTTGAAGAA AAGCTTGATT TGACGATGAC TCCACTCGAC TTTTTCAAGC AAAAAGTCAT GCCGGAACAG CCCATTGAAA AAATCCGTCC GCTTTTGGGA CGTTACGGGT GTTCGGGGGA CCAGCAATCG CAGGTGATGA ACCAGTTGTC AGCTGGCCAA AAGGCACGAA TCGTCTTTGC AATTATTGCC CATGAAAAGC CGCACTTGTT GCTGCTAGAC GAACCGACAA ACCCATTGGA TATGGAAAGC ATTGATGCGC TGGCACGATG TTTGAACAAG TTCAAGGGTG GTGTTTTGAT GATCAGGTAC GTAGAAGATT ATCGTTTACC AATGATTTGG ATTGTCTGCT TGCATATTTG CTTTCTTTGC GATGCGCCGG CCTGCACTAA CCAAATGTTC TCCTATTCCT TTTGTCCCCT AGTCACGATA TGCGCTTGAT ATCGCAATGT GCCGAGCAGA TATATGTTTG CGATCACAAG AAGGTTGTCA AGTATACCGG AGATATTATG GATTTCAAAA TGCACACTCG CAAGGAAAAC AACAAGAAGC TGGCTCAGCA TTTGAATGGA TAA
|
Protein sequence | MGEGEQELFE KKLSKEEKKA RAKALREAKK KAKEPKEGKK DKKEDAEEKK EEAPALNLDA LDLDAHKDAK REAALDKLSD DDIIVTYESK KGVLHANTRD INVSGVTVTF HGKPLIEETE ITINYGNRYG FIGPNGSGKS TIMKAIAARA IPIPDSLDIY FLDCEYPARD DITALEAVME SNDEVGILEK QADALNMAMG EADEEQQTSI QMTLETVYAR LDQLDASSAE ARATTILHGL GFTKTMQHMK TREFSGGWRM RVALARALFL QPEFLLLDEP TNHLDMDAVL WLEEYLSNWD KILFFVCHSQ DFMNSVCTNI VRLDMTYKKL RYYSGNYDTY VQTRRDQDMV QIRQYEAEQR DIAEIKDFIA RFGHGTVKMV RQAQAREKLL QKKLEAGLTT LPEMDPEWDW TFPDAGELPV PVLSIENVSF NYPNSVELYS KVDFGVDLQT RVALVGPNGA GKTTLVKLMT GELNPTKGAV KRNTHLKISR FTQHFEEKLD LTMTPLDFFK QKVMPEQPIE KIRPLLGRYG CSGDQQSQVM NQLSAGQKAR IVFAIIAHEK PHLLLLDEPT NPLDMESIDA LARCLNKFKG GVLMISHDMR LISQCAEQIY VCDHKKVVKY TGDIMDFKMH TRKENNKKLA QHLNG
|
| |