Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45819 |
Symbol | |
ID | 7200824 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 372945 |
End bp | 374145 |
Gene Length | 1201 bp |
Protein Length | 258 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180028 |
Protein GI | 219118515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.178302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCATT CCTTCATGCA GGCTTCACTG CTTTTCCTCC TTATTTTGCG TGTCCTAGTT GCTCCGGTAT GCAGTTTTTC TCGAGCCACA CTATAATTCT AATTCACCCA GTGCCGCACG TTCACAGTAA AAAGGATGGA TACGCCCTTC TTGATACAAA ATACAGTCAG TCAAAACAAG CGCATGTTTG AGCATGCACG CTCATACTAT GCATAGTCGA ATGCTTTCGT CGTGCGATGA ACCAACAATG AGTACCTACC GAGATGCGAC GCATTCCTTC CCTCGACAGA TACATGCAGA AGAAGGGATC CATAAGGAGG ACAATCGCAT GTTTCTACTA GATTCCTTCG ATAGTTCCGC AGTTCACCAC CGTTCTGTTA GCTCGCATTG TCACGATATC TCGCGGCTGG ACAGAGAATT GCCACTAGGC TTGGAATTTA AACCTGCGGA ATGGGACGTT GTAAGTGCAT GGTAGTGGGC GGCATTTTGG ATTGTGTGGT GGTATTGTTT GATCTCACAA ATATATTACT CCAATGGAGC AGATTTGTGG GAGGGGGAAA GCTAGTTTCG ACCATGTCGG AAACAGGCGA CTGCGGCTCC TTGTTGCCAA TAGCATGCAT TCATATGTAG AGGCAAAGTC TAGAGTTGAT AAAACTGCCA TAGTCCAAAC TATTGTTGAG CAGATACGCG AAGCCAGTCC TAATGGAGGT TTTGTGAGAA AAGACGACTT CGGCGAGTGG TACGAGATCG GAACAAAAGC TGCAAGGGAG AAGGTTGGAC ACGCAATCCG CGACTGCTTG ACAGAACCTT TGAGGGGCAG ATCATTGAGT ACCTACCAAG AACGACTGGA AAGCCTACAG GAAGTACAGG ACGAAGTGTT TCGTTCTCTG AAGATTGCTG GTATTCAAGA AAGAGAAGGG CAAGCAAACA GTCCATACAA ACAAAATGGG TCCGCCTAAT GTAAATCGAC GAAGAAAAGT TACTCTATCG AAGATTTTCA AGGAGAAGGT CGAGGTTTGT AACTGCTCAT GATCCATGAA TTCCCTCCGA CGCCACCAGA TTCAGTAAAT GCCTGTCCTT GTCCTTGAGT TTAAAGAAAG GAAAGGAGCA TAAATGCGTC CTCTCTTTAA CTCTTCTCCT ACTAGCATGA TCTAGATAGA CCGTGCTCTT TAAAAGAGAT AGTAGTTAAT ATTTTGAGTT G
|
Protein sequence | MPHSFMQASL LFLLILRVLV APSVKTSACL SMHAHTMHSR MLSSCDEPTM STYRDATHSF PRQIHAEEGI HKEDNRMFLL DSFDSSAVHH RSVSSHCHDI SRLDRELPLG LEFKPAEWDV ICGRGKASFD HVGNRRLRLL VANSMHSYVE AKSRVDKTAI VQTIVEQIRE ASPNGGFVRK DDFGEWYEIG TKAAREKVGH AIRDCLTEPL RGRSLSTYQE RLESLQEVQD EVFRSLKIAG IQEREGQANS PYKQNGSA
|
| |