Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43626 |
Symbol | |
ID | 7197495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1037642 |
End bp | 1039014 |
Gene Length | 1373 bp |
Protein Length | 341 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177741 |
Protein GI | 219111979 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000259713 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTACCATTC ACAGCAAAAG ATCTTTTACA TCAATTGCCG ACCAGCTCTA CTTATCGGTC CTCACTGTCC ATCTAGGTCC AGCAAGAGTA GACCTCGCCA TGGTACTGCG GCAACGCAGG ATTGCAACGC TATTCCTCTT ATTATCTGAA GCTCGGTCGG TGCATTCGCT GCAAGGCGTG ACGACACCAA TACGAAAACG AAATTTCTGT GCGTACGCGT TAGGAGAAGC AGATGAGCTG GCAAGGAGTG TCGGACAACG AAGGGAAAAG AAGAACAAGT ATATCGAATT CTCTAAGGTT ACGAGTGGAA AAGATCCATT TGAAAGCCTT TTGGAAGAGT CCTTTTCCAA ACGAAAAGCG CTAGATGATG ATATCGCTCG CAAAGACGAG CTTAGCAGTT CACCACCAAA AATTGGTAAT TCGGGAAACT TATCGTTTCC TGACAATAAG AATATTGATC CGTACGATCC GACAACTTTT GGGTACATAG AAGTTGGGAT TGTACGTGGC GCACATGGCG TACACGGATG GGTCAAGGTC AAACCAACCA CTAGCTTTCC TATAGATCGG CTCTGCGTAG CTGGCATAAA GCACTTGAAA CCACCCAAGA AACGCGCCCC ACGGAAAGTT CTGTTATTGG AGGGAAAACG TCGTAATGCG GAGGAATATT TGATTAAACT TGAACAAATT AATGACAGGG ATGAAGCTTT GAAGCTCCGA GGATCGCTTT TATACGTGCG AGAAGAAGAA AAAATGGCGA CCGACCAGGA AGAGTACATG GTATCCGATT TGGTAGGCTC TGAGGTGTTT CTGGACACGC TAGATACAGT TGAAAATCCG CTGTTTCTTG GCGTTGTGAA CGGTGTTGTT TTTGCAGATG AAATGTGCTC TATTCCAGGA CTCGGCCACG ACATGTTAGA GGTGGTCCTT CGAAAGGGTC AAGACGGTAT GGCCTCACTC CGCGACGAGC TTGTTCTTAT TCCGATGGTT CCTCAAATTG TTACGCATGT CAGTGCTGCG AAAGGGGTTA TTCACATTAA TCCACCATCA GGATTACTCG ATTTGACGTA CTTACGAGAG GAGCGAGCCA AATTAAAAGG ATTTCTACCA CCGGGAAAGG GGTGAATGCG CCTGAGCTTC AATTCCAGTT GGTGTTCTAC CTTGAATACA GTGCATAGTA TGTCATTTAC AACAGTCACG GGAGATCGAA ACAACAAAGG CAAGCAGAGC TTCTGTGAGG ATGGTTGGAC CAATTGTACT TATATAGGTG CATCATGGCT TACTGTTGAC CAACGTTGGC CCTTGTCTTC CGTACGCTGA GGCATCGTTC CTACAACGTC TTGCTTGCAG TACATTGTAC AAATTAAGGA ACC
|
Protein sequence | MVLRQRRIAT LFLLLSEARS VHSLQGVTTP IRKRNFCAYA LGEADELARS VGQRREKKNK YIEFSKVTSG KDPFESLLEE SFSKRKALDD DIARKDELSS SPPKIGNSGN LSFPDNKNID PYDPTTFGYI EVGIVRGAHG VHGWVKVKPT TSFPIDRLCV AGIKHLKPPK KRAPRKVLLL EGKRRNAEEY LIKLEQINDR DEALKLRGSL LYVREEEKMA TDQEEYMVSD LVGSEVFLDT LDTVENPLFL GVVNGVVFAD EMCSIPGLGH DMLEVVLRKG QDGMASLRDE LVLIPMVPQI VTHVSAAKGV IHINPPSGLL DLTYLREERA KLKGFLPPGK G
|
| |