Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_25974 |
Symbol | |
ID | 7197993 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 69685 |
End bp | 71928 |
Gene Length | 2244 bp |
Protein Length | 733 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178422 |
Protein GI | 219115253 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAACTTTCTT CGGGTTGTCT CACAGTTAGT CAGCAAAGCA CTCTCATGCG TACCGACGCT GTTTCCGAGA CAGAGTCTGC TGCGGATCTG AGACAGCGCG GCAATCACGA GTTTCAACAA CACAATTACG AGCACGCCGT CGTGCTCTAC ACGGAGGCCT TGCGGGTCGC CGCGACGTCT CCGGACACTC TGGACACGCC CGAACTCGTA CTCAATCATT TGAATCGTGC CGCCGCGTAC GGACAACAGC AGAACTACCC GGCAGCACTG GAAGATGCCA AACAAGCGTG GGACTTGTCC CAGCACACTT CCGTTAAGGC TGCCTACCGA TTGGCCAAGA CGTACCGACA ATTGCACCAA TACGCCGACG CCAAAGCAAT CATCCAAGCC GGACTGAAAG CCTTGCAACA ACAAAGTCCG GTGCCCGAAG CATCGGGTGG TGACGAAGAT CCCAATGACG TTACCTTTCA GCAACAGCAC CAGGCGCTCC AGGATCTGTG GACCCAAGTA CTGCAGGCTG CCCTAGAGCA AAGCAGCGAT TCCAAAGAGC CCAAAGATAC TACCCCGGAA ACGTCCATTG AAGACGTCAC CCGCGGAGTT TCCATTCGCG AGTTCCGCGT GCACCAAGAA CTTGGCTACG GCAACTTTTC GCAAATTCAG GTGGTCACGC ATCGGATAAC GAACGAACGC TTCGCTCTCA AACGCATCGA AAAAAAGCGC TGCGAGGAAC TCGCCAAACG ACAACATCCC AACGTCTACA ACGAAGTTAA TATGGAGCGT CGCGTACTGC TCCAGCGCCT CCCCCGGCAT CCCAACGTGG TCAAAATGTT TCATGCCTTT CAAGACTACA CCACGTTGTA CTATCTGATG GAATTGCACG ATGCCTGGTC GGACTTGTGG AGTGAGTTAC GTGATCTTCC CGGCAACGAA GGATGGTCCT CACCGTCTCC AAAATCCCGC ATGGTTGGCG CACACCGTTC CTGTGCACAA ATCTGGATGT ACCAGCTCCT TGCGGCCGTC GAACACTGTC ATAAGCATGG CGTTGTCCAC CGTGATCTCA AACCCGAAAA CATTCTCTTG AACGGGCGCG GACACGTCAT TCTGATTGAT TTTGGAACGG CCAAGGATTT GCTCGAGACC GACTTGAACG GTCCCGAGTT CGTCGGGACT CCAGATTTCA TGAGTCCCGA AGCCGTCAAA GGCACGTCCA GTATGGCAGA AACGGAGGCC GCACGCCAGG ATCATGGTGA CGTTGTTGGT GCCGGTCCAG CCGCTGACTT GTGGGCGTTG GGCTGTTTGC TGTATATTTT GCATACGGGC ATGACCGTCT TCTGGACAAC TTCGCCGTAT TTGGCCTTTC TACGGATTGC GCGGGGACTC GTCACACGGC CGGTCGGTAT CGTCGACAAC GACGCCTGGG ATCTTATCCA AGCATGGCTG CGATTAGATC CACAATCGCG GTTGGGAGCT GATGTCTACA CCGTCCAAAC ACCCGACGGG ATGGACAAGC CTACCATGCG ATCGTTACCG GGCGGGTATG ACTGTATACG CGAGCATGCA TATTTTGCCG AGTACCACCA CGCGGATGAC ACAGTGAAGG CGCAAACCCC GATACCAACC CTGCGTGATT TGTGTGTACG GTCCGTGGCT GAGTTGGCGC ACCGAGATGC CCACCATTTA GACATTAGTG ACCGACACCC TCCCGGCGAC GGATCTTCTC ACGACATGCT GCGCTTGAGT TCACGAGATC GTGACATGGT GCTGCACGTT TTGGATCGAC AGCAGCGGCT ACGGGATCCC CGCGTATACG CGAGGTTCTT TGCGGATCCT TTGATGGCTC CGCTAAATCG GATTCGACCG GCAACCCGGG ACGTGACGGG CTGGACGCAA ATGAACGACG ACCAGGGTAA AGCTCCGCAC GCGTTGCAGA ACCCCGACCC GCACGCCACA CCCGTCCCTA TCGACCCGAT TCGCCTTGTT TACATCTCCA ATCCTCTCTT TGGATCCCAG ATCCGGGACG GCGCGATGGA TGACGATGAG CGTAAACTGT TCTTGAAGCA ATTGAAACGA TGCGTGGCCA CCATCAACCG ACAGCGGCCT AAACTTGTAA TTGTAACCGG AACGATCGAT GCCAAATGCC GCAAAGTCTT GGCTCGGATT CGTGATTCCA TCCCTATCCT GCTCAACGAT GGCACCGCCT TTTGTTCCTT TTGGATACTT GGTGTTCAGT GCCTTGCACT ATGC
|
Protein sequence | MRTDAVSETE SAADLRQRGN HEFQQHNYEH AVVLYTEALR VAATSPDTLD TPELVLNHLN RAAAYGQQQN YPAALEDAKQ AWDLSQHTSV KAAYRLAKTY RQLHQYADAK AIIQAGLKAL QQQSPVPEAS GGDEDPNDVT FQQQHQALQD LWTQVLQAAL EQSSDSKEPK DTTPETSIED VTRGVSIREF RVHQELGYGN FSQIQVVTHR ITNERFALKR IEKKRCEELA KRQHPNVYNE VNMERRVLLQ RLPRHPNVVK MFHAFQDYTT LYYLMELHDA WSDLWSELRD LPGNEGWSSP SPKSRMVGAH RSCAQIWMYQ LLAAVEHCHK HGVVHRDLKP ENILLNGRGH VILIDFGTAK DLLETDLNGP EFVGTPDFMS PEAVKGTSSM AETEAARQDH GDVVGAGPAA DLWALGCLLY ILHTGMTVFW TTSPYLAFLR IARGLVTRPV GIVDNDAWDL IQAWLRLDPQ SRLGADVYTV QTPDGMDKPT MRSLPGGYDC IREHAYFAEY HHADDTVKAQ TPIPTLRDLC VRSVAELAHR DAHHLDISDR HPPGDGSSHD MLRLSSRDRD MVLHVLDRQQ RLRDPRVYAR FFADPLMAPL NRIRPATRDV TGWTQMNDDQ GKAPHALQNP DPHATPVPID PIRLVYISNP LFGSQIRDGA MDDDERKLFL KQLKRCVATI NRQRPKLVIV TGTIDAKCRK VLARIRDSIP ILLNDGTAFC SFWILGVQCL ALC
|
| |