Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_26061 |
Symbol | |
ID | 7197853 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 393640 |
End bp | 395512 |
Gene Length | 1873 bp |
Protein Length | 552 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178499 |
Protein GI | 219115407 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCGTTTGCG AATTAGAGTC ACAAGCGTCC ACTTCCACTA TCACCATGGC TTCTCTTTCT CTCCCCCCTC CCCGCAATAT GTCGGAAGCG CCTGTTGTGT CCCGCAAAGC CACCACAGAA CAAAGTCCGG AAGCCCCGAA GGCTGATCAC TCGCGCCAAC CGGATTCTGC CGCTACCACT ACCAGCAAAC CATCTCCTCC CTCCTACGCC GAACGATGCC GGGCCGCGCT CGCGCTCCAT ACCATGCCAC CCGAAGAACG ACGTAAAAAC AAGCTGTTTG TCCCGCGTTC ACTGGACGAT TTCGGCGACG GCGGAGCGTA TCCGGAAATT CACGTGGCGC AGTACCCGCG TCACATGGGC AACCCCCATC GGCACTCGAA GCACGCGAAC GCCTCGGGCG GCACGTCTCG CGGTGCACAG GCAATATCGA AGGCACTGGT AAATGTAGAA ATTGATAAAG ATGGTAAGGT ATCGTACGAT GCGATCGTCA AGGGGGGGAC AAACTCGGAT AAGATTGTGT ATTCGCGACA CGCGGATTTG CGAGGCGGAT CAGCCAAAGC GGAGGACATT GCGTTGCCAA CAGAAGAAGA GGAGCAATCC GAAGCAGCTC GGACACAAGC GGCACTCGAT GCAATATTGG GTAAGAAAAC CGCGTTGGAC AATCCTTCTG GCAGCGCAAT CGTCAACGCC CAAACGTCCC AGAATGTGGA AGCGAAAACA TCCTTTATTA AATACACCCC ACGACCCGAC GCTCCCGGCT ACAACCCTGC CGCTTCACAG CGAGTTATTC AAATGGTGCC AGCCAAGGTT GATCCCATGA TGCCTCCCAA GCACAAGCAC ATTAAAGCTC CGGCTGGACC GGCCGAAGAT CCGGTTCCGG TCTTGCACGC ACCACCTAGC AAATTGAGCA AGGAAGAACG CGAGGCATGG AACGTACCCG CGTGTATTTC CAACTGGAAG AATACCCGAG GCTATACGAT TCCGCTCGAC AAACGATTGG CGGCGGATGG GCGAGGCCTG CGTGAGCATA CGATCAATAC CAATTTTGCG ACGCTTTCCG AATCCCTGTA CGTGGCGGAA CGCCAAGCTA GGCAGGAGGT ACGCATACGG GCGCAAGTGC ACAAGAAATT GGCTTTGCAA GAAAAAGACA AGCGGGAAGA TGAGCTTCGG CAGCTGGCGA ACCAAGCGCG TCTGGAACGG GGTGGCGGAG GCGGAATGCC TGCGGCGGCC CAGCCATCAC GCGACCGAGG GCACATCTCC GATGCGTCAT CGGATGATGC AGAAAGTATC GACCATCCTC CCCCGGCAGC TGCGCAAGGA GATACGGAGG ATGATGTGGC CGCGCGTCAG CGAGAAAAAC TTCGCTTGGA ACGAAAACGG GAAAGAGAAC GAGAAATGCG TATGGAAAAC AATATGGAAC TCAAGAAGCA AAAGTTGGAG CAGGAACGTG ACGTGTCGGA AAAAATTGCT TTGGGGGTAC ACACGGGTAC AGGTGGCTTG GGAGGCGATG TGGATTCACG TCTTTACAAC CAATCGGCGG GTATGGATTC AGGGTTTGGC GCGGACGACG AATACAATGC GTATTCCAAG CCTTTGTTTG CACGCCAAGC CGCGGCGTCG TCGGCATCCA TTTACCGTCC GACTCGGGGC GACACGGCCT ATAATGCGGA TGAACAATAC AGCAAGTTAC AGCAAGGGGC TACCTCCAAG TTTCAACCAG ACAAGGGTTT TTCTGGGGCC GAAGGTGGTG TCTCTGGGGC TGGAACCACT CGCACAGCTC CTGTTCAGTT CGAGAAAGGC GATCAAAAAT AGTTCACGAA ATTTTATCAT CTTTTGTTTA CTAGTTTTTA AAAGGATAGA TTGAGGAGGG GCT
|
Protein sequence | MASLSLPPPR NMSEAPVVSR KATTEQSPEA PKADHSRQPD SAATTTSKPS PPSYAERCRA ALALHTMPPE ERRKNKLFVP RSLDDFGDGG AYPEIHVAQY PRHMGNPHRH SKHANASGGT SRGAQAISKA LVNVEIDKDG KVSYDAIVKG GTNSDKIVYS RHADLRGGSA KAEDIALPTE EEEQSEAART QAALDAILGK KTALDNPSGS AIVNAQTSQN VEAKTSFIKY TPRPDAPGYN PAASQRVIQM VPAKVDPMMP PKHKHIKAPA GPAEDPVPVL HAPPSKLSKE EREAWNVPAC ISNWKNTRGY TIPLDKRLAA DGRGLREHTI NTNFATLSES LYVAERQARQ EVRIRAQVHK KLALQEKDKR EDELRQLANQ ARLERAAQGD TEDDVAARQR EKLRLERKRE REREMRMENN MELKKQKLEQ ERDVSEKIAL GVHTGTGGLG GDVDSRLYNQ SAGMDSGFGA DDEYNAYSKP LFARQAAASS ASIYRPTRGD TAYNADEQYS KLQQGATSKF QPDKGFSGAE GGVSGAGTTR TAPVQFEKGD QK
|
| |