Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21794 |
Symbol | |
ID | 7202837 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 366548 |
End bp | 367867 |
Gene Length | 1320 bp |
Protein Length | 346 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182056 |
Protein GI | 219123489 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTACCGC CACGCAAGGA GATTTATACC TACGCGGCGC CTTGGACTGT ATTTTCAATG GCTTGGAGTC GAAGGTATGT AGGCATGCAT GTAGACGGAA CTGTGTTTCT AGCTACCACG ACTTCAATAC CGTATAAGTC GGAGAATGAT CAGCGAGAGA TAGTTGTGTC CCGAATTTCT TCAGGATTTG ATTGCCACTA CCATCTTCTT GCATCACACA AGATCTAACA CCTGGCATAC TATATACTCT TTGGCTGTCA TAGACAGGAC AAAACGTCCC AATTCCGTTT GGCGATTGGT AGCTACGTAG AGCAGTATTC CAACGCGGTG CAGATTGTGA AAAAGGTCCC CCAGCACGTA GATAGTGACT TTGCCGGCGG TGGGTCCGCT TCGCTCTATC AGGCGGGGTC GTTCGACCAC CCATATCCAT GTACCAAAAT TTTGTGGAGT CCGGATCAAT CACTCGCGGC GCCAGACCTG TTGGCTACGA CCGGGGACTA TTTGCGGGTA TGGAACATAC GGGACGACGG CAGTGGACAA GGCACGGTGC AATGCAAGAA GGAGTGCTTG CTCAACAACA ACAAAACGTC TGAGTACTGC GCTCCGCTTA CTAGTTTCGA CTGGAACGAA GCTGATCCGA ACATTGTAGG GACGTCCTCC ATTGATACCA CCTGCACAAT TTGGGATATT GAAACCCAAA CGGCGCGCAC CCAATTAATT GCGCATGATA GAGAAGTCTT CGATTTGGCC TTTGCCCGAG GAAAGGACGT GTTCGCCTCG GTCGGAGCGG ACGGGAGTGT TCGCATGTTT GATTTACGGA GCTTGGAGCA TTCTACTATT ATTTATGAGT CGCCCAATTT GGATCCTTTA TTACGGTTGG AATGGAACAA GCAAGATCCG AATTACCTGG CCACCTTTAT GGTGGATAGT CGAAGGACGG TCATTCTTGA CATCCGTGTC CCTAGCTTGC CGGTTGCAGA ACTTGGCGGT CATTTAGGAT GTGTAAACGC TACGGCTTGG GCGCCCCATT CTTCCTGTCA TATTTGCACG GCGGGAGACG ACAGTCAGGC CCTGATTTGG GATTTAAGTG CCATGTCCAA AAGGCCTGTT GAAGAACCAA TTTTGGCTTA CAATGCGTCC GGAGAAATCA ATAACTTGCA ATGGAGCGCA TCACAACCCG ATTGGGTGAG CATCGCGTTT CACGACAAAT TACAGATTCT CCGCGTCTAG GAAAAAGCAC TAGAGACTAC TGGAGAGCAA TAGACCGAAG TACGGCCCTG ATTCGACAAG ATGCCAATGA ACACGCCATT GAAGTATTCG
|
Protein sequence | MVPPRKEIYT YAAPWTVFSM AWSRRQDKTS QFRLAIGSYV EQYSNAVQIV KKVPQHVDSD FAGGGSASLY QAGSFDHPYP CTKILWSPDQ SLAAPDLLAT TGDYLRVWNI RDDGSGQGTV QCKKECLLNN NKTSEYCAPL TSFDWNEADP NIVGTSSIDT TCTIWDIETQ TARTQLIAHD REVFDLAFAR GKDVFASVGA DGSVRMFDLR SLEHSTIIYE SPNLDPLLRL EWNKQDPNYL ATFMVDSRRT VILDIRVPSL PVAELGGHLG CVNATAWAPH SSCHICTAGD DSQALIWDLS AMSKRPVEEP ILAYNASGEI NNLQWSASQP DWVSIAFHDK LQILRV
|
| |