Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21543 |
Symbol | |
ID | 7202514 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 393904 |
End bp | 395061 |
Gene Length | 1158 bp |
Protein Length | 366 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181548 |
Protein GI | 219122430 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCATCGTGTG TTTTCTGAAG GCACGCCATG ATGAAGGCTC GGAGCCGGCT GACGGCTTCC GTCTCGGTCA CTGCAGCTGT TAGTTTCGTC TCTTTTCAAC AAACAAGGGC AATGGTCAGT CTACACGGCA ATCAAGCCTT GACGACATCG TTTCGTATGC GCCATACGGA TCGTCTACAC GCCGTCACGG CTCGGCCACC CGATCCGAAT TCTCTACTTT CCGCAACGGA CCAACAAGAG TGTATTGATA CGCTGCTCGC CTGGTTTGCC GGTAAATCGC AAATACTGTG TCTGACGGGA GCGGGCTTGT CTACGGAGTC TGGAATTCCA GACTATCGGG GTAACAATGG AAGCTATCAT CGTGGACATA AACCGATGGT GCACGATCAA TTTATGAAAT CGGAATGTCA ACGCAAGCGA TACTGGGGTC GCGGTATGGT GGGGTGGAAA TCTTTCGATG AGACGGCGCC AAATGCCGGG CACGTGGCGC TAACCGAGTT GGAACGTCTG GGACGAATCG GCGTCGCTTT CGAAGACAAT CGCGCGTTTT ACGAGGGTCA CGACGATGAC TTGGAATGGA CGTTTCGTTC TGGTCACCGA AAATTATCAC TCATTACCCA AAACGTGGAT ACCCTGCACC GCCGAGCTGG AACCAAGCAC TTGATTGAGC TCCATGGACG CACCGATCAG CTGGAATGCA TGCAATGTGG TACAAAAAGA GACCGCAATA GCTTTCATGC CGAATTAGAA GGTCTTAATA CGGATTGGCT AAATAGGGCA TTGGCTACAA CGGACAATGA CGATATGCGA CCAGATGGAG ATGCAGCTGT AGGGATGGAA GATTTCGAGT CCGTACAGGT TCCGCCCTGT CAGTCTTGCG GAGGCTTCAT GAAGCCCAGC GTGGTTTTCT TTGGTGACAC AGTACCAAGG AATCGTGTCG CACAATGCCA AACCGCAGTG GAAAAAGCAG ATGGATTGCT CGTCGTTGGA TCATCTTTGG CGGTCCATTC GGCTTTTCGC CATGTAAGAG CAGCATCAAA ATTGGGTGTA CCGATTGCCA TTTTAAATGT CGGTGGAACG CGTGCGGAAG CCGAGGGTTT GGATGTCCTG AAAATCGAGG CACCAACAGG GCAGACCTTG GAAGGAGTAG CGAAGGTA
|
Protein sequence | MMKARSRLTA SVSVTAAVSF VSFQQTRAMV SLHGNQALTT SFRMRHTDRL HAVTARPPDP NSLLSATDQQ ECIDTLLAWF AGKSQILCLT GAGLSTESGI PDYRGNNGSY HRGHKPMVHD QFMKSECQRK RYWGRGMVGW KSFDETAPNA GHVALTELER LGRIGGHDDD LEWTFRSGHR KLSLITQNVD TLHRRAGTKH LIELHGRTDQ LECMQCGTKR DRNSFHAELE GLNTDWLNRA LATTDNDDMR PDGDAAVGME DFESVQVPPC QSCGGFMKPS VVFFGDTVPR NRVAQCQTAV EKADGLLVVG SSLAVHSAFR HVRAASKLGV PIAILNVGGT RAEAEGLDVL KIEAPTGQTL EGVAKV
|
| |