Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43871 |
Symbol | |
ID | 7204288 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 277541 |
End bp | 279605 |
Gene Length | 2065 bp |
Protein Length | 673 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186032 |
Protein GI | 219112897 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGGACC AAACAGCACT CCCAAAGACG TCTGGCGCCT CCCGCCACTT AGATAATGAT GCTCCAAGGG ATGAGGAAGG CCTGTTGCTT GAATACGATG CGATCATCTG CGGTACAGGC TTGACTCAGT CGATTTTGGC CTCCGCCTTG GCTCGCCATG GAAAGTCAGT CTTGCACTGC GATACTTCCG ACTACTACGG CGAATTGGAT GCCGTTTGGA CACTACCGTA TTTGCAAGAA AAGTTAGGAA AGGGACACGA ATTAAAAGGA AAGATGGTGA CTGTCAGAAC GGATGATTCA ACCTGCATTC CTCTGTCTCC AAAGGGTGGT TTGCAAAGCT TTCAAATACA TTCTCTCGAG CGAAGGAACG ACTTTACTGT AGAACAAGGA ACAGCAGTAG TTACGCCTTA TGGGAATGGC ACAATTATAT CGATTTCACC AAGGGGGCTC TTTCTTGATT TATTAGTTTC ACTCGACACA TGGAAGCTAG CGAATGGAAA AAACCCATCT GTTCATATCT CCATTCCGCA ATCCGCAGCG AGTACTGGCG CAGCTTTTCA GAAATATCTG AAGGAAAAAC TCGGAATAGT TTCCATCGAT GAGCGTGATG CTCATTTGAT TTTGGACGAG CGTAGTCGGG GATTCGCGGT AGACGCGACA CCAGGATTGC TCTATGCCTC TGGCGCATCG GTCGAAGGGT TGCTAAAAAG TGGCGTCGCC GACTACCTCG AGTTCAAGTC TCTCGAAGGC CTCCTTTGGT TGGAAAACTC CGACAGTTGC TTGGAACCCG TGCCTTGTAG TAAAAATGAT GTTTTTGCAT CAAAACTTTT GTCACCCATG GACAAGCGTC GACTTATGAA GTTTCTGCAA CTTGCGTTGG ACTACGGCAC AGCCCATGCG TTGGCCGAGG AAGAAGCTTC TGCCGAAAAC AACAAACAAG TCTTATCTCT GAACGAGCGG TATTTGAATC AAGGGAGATC TCTGGCACGT CCTCAAAATA AGGCAGTACT AGCAGACGAT ATTCGAGCTC TGCAGGAATT CATACAGGCT GACATGAAGT TTCATGAGTA CCTCGAAAGC AAACAGCGGT TGTCTCCCAA ACTGTGCCGC ATAGTACGCC ATGCACTAGC GCTTGAATCT GGCAATGGTG ATTGGTCGTT GAGGCAGGGT ATGACTTCTT TGTGTCAACA CATGCAGGCT TTAGGAAGGT TTGGAACAAC AGCTTTCTTG GTACCTATGT ATGGATCGGG TGAGCTTCCA CAGGCCTTTT GTCGATCGGC AGCCGTTTAT GGGGCTACCT ATTTGCTACG ACGTGCTCCG CTTGCAATTG TTACCGATAA AGAAAGAAAC AACAGTGAAA GCTGCCGCCA TTGCCACAGA CGGTGAAGCT AGCGCGGCAA TGGTAAAAGA AGTCATGTGC CGTCACATGG TTGTTCCGCA GGATTCTGTG GCTACACTTG AATCAAAGTC TCGCCGCGTA TGGCGCTATC TTTGTATTTT CCGGGGGAAA GTTGTTGGCA GTAGTCGCTC TCCTCAACGG CACGCAATCA TTGTTCCTCC TGGAGCGTTT GGTCCTGACG CTGTTCGTGG GGTCTTACTG GACGAGGGAG TAAATGTGAC TCCACACGTA CCCTGTGGCT GTACCATATT GCATCTTACC ATGGCGGTAG AGAACGATAC AGGAATCGAC CCGCAAGACA TATTGCGACG AGTATCGCTT TCGGTACTGT CCAGCAAGTA TGAAGGTAGC AATGCTATTC GCATTTTTCA AGTAACATTC AGCTACGCTC TACCAGAGGC ATCCAATCAA ACGAAGAATG TTCAAAATTT GCATTGTGTT CAGCGCTCAG AACCAGGCCT TTCCGCCGAT TCCTCGTTTG AACAAGCCAG AGAAATCTTT GCAGCAATTT GCCCTGACGG GAATTTCTTG GCCATATCTG AACAGGTTGA TACGGTTGTG AAAGAACGGC TTGGTGAGCA GATTGAGGAG GACGCTGACG GATTGGTTCT GGATAGTGCA ATGGATATCA TTGAACCTAA AACAAGAGAG AGTAGTGCTA CGTAG
|
Protein sequence | MVDQTALPKT SGASRHLDND APRDEEGLLL EYDAIICGTG LTQSILASAL ARHGKSVLHC DTSDYYGELD AVWTLPYLQE KLGKGHELKG KMVTVRTDDS TCIPLSPKGG LQSFQIHSLE RRNDFTVEQG TAVVTPYGNG TIISISPRGL FLDLLVSLDT WKLANGKNPS VHISIPQSAA STGAAFQKYL KEKLGIVSID ERDAHLILDE RSRGFAVDAT PGLLYASGAS VEGLLKSGVA DYLEFKSLEG LLWLENSDSC LEPVPCSKND VFASKLLSPM DKRRLMKFLQ LALDYGTAHA LAEEEASAEN NKQVLSLNER YLNQGRSLAR PQNKAVLADD IRALQEFIQA DMKFHEYLES KQRLSPKLCR IVRHALALES GNGDWSLRQG MTSLCQHMQA LGRFGTTAFL VPMYGSGELP QAFCRSAAVY GATYLLRLKA AAIATDGEAS AAMVKEVMCR HMVVPQDSVA TLESKSRRVW RYLCIFRGKV VGSSRSPQRH AIIVPPGAFG PDAVRGVLLD EGVNVTPHVP CGCTILHLTM AVENDTGIDP QDILRRVSLS VLSSKYEGSN AIRIFQVTFS YALPEASNQT KNVQNLHCVQ RSEPGLSADS SFEQAREIFA AICPDGNFLA ISEQVDTVVK ERLGEQIEED ADGLVLDSAM DIIEPKTRES SAT
|
| |