Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47198 |
Symbol | |
ID | 7202187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 789198 |
End bp | 790676 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181262 |
Protein GI | 219121831 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.459102 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGAAA GAACAATTGC CCCACCTCTT CCCAAAGTCG TTGTCGTTGT AGGTACCTAC GAAGGCGTCT TGGCAGGCTG GGAGTTATCG AAACACAATA GCTTTCAAAT ATCGTTCGCG ACGCCGGTAC ACGGAGGCAG CGTTCGCAGT CTGTGTATTG CCAGCCGCGG CAGTGCCTCT ACTTCTGGGA ATAGCGACAA AAATCAGAGC CTACCAGGGT CTTTGCTCTC CTGCGGATAC GATGAATACT TGAAAACTCA CGATTTTGCC AAAAAATTGA CATCATCGGG AGAAGTCCGG ACCCCCTCCG AGTTCGGTAC GCCCTTGTGC TCGTCCTTTG CTCCGCCAGC GTCGTCATCC GGCCTGCCGA GTACGCACTG TTTGTTGGGC TTTGCGGGTG GGAAGCTTGT TATCTACAAA AAGCGCGATT GGAGCGTCCA GCATGTACTG GCGGGACACG AAGGCGGCGT ATCAGCGATG GCTGTACATC CTTCGGGGAA AATGGTTTTG ACTGGAGGTG AATCGGACGG CAAGCTCAAG CTTTGGGATT TGACCAAGGG TCGACTAGCG TACGTGAGCA AAATCCAACC CGCGCGCACG AACATTCAAG GTCGAACCCA CTACGATGCG GTTGTCAGTC TCGTTTGGAG CCCCGTAAAT GGTGACGCTT ACGCCTTCGC CTATGGATCG CATTTGACAG TTCGAGATGT TGCGACAGGA AAGGATCTGC TGGATACTGA ACTTCCCTCT CGGGTCAACC AGATTTGTCT ATTAGACGTA TCAGAAGGCT TGTTTGTCGC AGCGGCATGT AACGATGGAT CGCTGCCGGT TTTGGCTGTC CAGAGTGTAG ATAATACAGA AGGGGAGCGC CGAGGCATGA TGGCGATCGA ACCAGTCGAA GGGCCAGTGG CGCGAGAAGA GCGATTTAAA TGTATACATG CGGTCGGGGG TTATCACGTT GTAACTGCAA ACAGTGCCGG TGTTGTAAGT CTCATGGACT TGCAAGGGGC CATCAACATG ATTATGAGCG ACGACAAGAA CGACGACGGA GTTGATGCAG GTAATCCAGT GGATCCGAGC AGTGACACGG ACGACGAGAG TGTCGATCAC GAAAGTGACA AAGGTACAAG TGAAGATGAA GAAACTGGCG AGGAAGAGCT GGCGGTCGAC ATGATCGACA GTATTCAGTT AGGAACCGGA GCGCGGATTA CTTGTTTGGC GGTCTATTCT TGTGAACGAG ACGACGATTT ATCGGATCCT CCATCCGATG CGTCTGTGGA TAATGAGGAA GTAGAAACAA TACCAAGAGA GAATGCGCCG GAAGAAGACC GCGAAAACTT TCAAAGAGTG AAGCGGAAAT GGGAAAAGGA AGTCGTCATG GATCCGGAAG CCGTAGAAAG GGCAAGGGCC CTGGTCACAG AGGCGAAAAA GATTCAAAAA CGAAAGGAGA AGAAATCAAA GAAGCACAAG ACTAGATAG
|
Protein sequence | MGERTIAPPL PKVVVVVGTY EGVLAGWELS KHNSFQISFA TPVHGGSVRS LCIASRGSAS TSGNSDKNQS LPGSLLSCGY DEYLKTHDFA KKLTSSGEVR TPSEFGTPLC SSFAPPASSS GLPSTHCLLG FAGGKLVIYK KRDWSVQHVL AGHEGGVSAM AVHPSGKMVL TGGESDGKLK LWDLTKGRLA YVSKIQPART NIQGRTHYDA VVSLVWSPVN GDAYAFAYGS HLTVRDVATG KDLLDTELPS RVNQICLLDV SEGLFVAAAC NDGSLPVLAV QSVDNTEGER RGMMAIEPVE GPVAREERFK CIHAVGGYHV VTANSAGVVS LMDLQGAINM IMSDDKNDDG VDAGNPVDPS SDTDDESVDH ESDKGTSEDE ETGEEELAVD MIDSIQLGTG ARITCLAVYS CERDDDLSDP PSDASVDNEE VETIPRENAP EEDRENFQRV KRKWEKEVVM DPEAVERARA LVTEAKKIQK RKEKKSKKHK TR
|
| |