Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_15329 |
Symbol | |
ID | 7195082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 372303 |
End bp | 373901 |
Gene Length | 1599 bp |
Protein Length | 533 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183351 |
Protein GI | 219126201 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCATC CGTACGTGCG TTTGGCGATT TTACTGAGTC TCTCGCCAGC CGCGCAACGC GATATCGGAG TGCTCGTCAA AATGCTTCCC GAAGTGATGA ATATTTTGCT GATTTTGGGA GTCTTTATGG TGTTTTACGC GTGGTTCGGC ACGGTCATGT TCGTGGGCAC AGAAGAAGGG TCGATGCACT TCAGCAGTTT GATTGAGTCC ATGTGGACTC TGTGGATCTG CGTCACGACG GCCAACTATC CGGACGTGAT GATGCCCGAG TATAATCAAA ACCGCTGGGT TACGTTGTAC TTTATCTCCT TTATGATCCT TTCCTTTTTC TTTCTCATGA ACCTGGTGCT GGCGGCAGTT TTCAATGAAT ACCAGTTGGC CTTTCAGACG CGCAAGCAAG ATAGAACCAA AGCTTCGGAC GAAAACTTGC GCAAGGCGTA CGCCTTGATG GACTCTGAGG GAATGGGTCG GATTGATCAA GAAACCGTCA TGGCTCTCTT TTGCATTCTC AATGAAGATT TCCCGGAATT CCGCACGCTA TCTGATGAAG ATACAAAGCT CTTGTTTGCT ATTCTTGACA AGGACGGATC TTCGACGATA ACAGAAGAGG AATTCATGGA TTTCGGTAGT GTCCTGTTAC TGGAATTTGT CAAAACAAGC GCGTATTCAA CATTTGTTGA ACTACGACTC CCCAAAATTT TCCATTCCAG CCTGTACCAG ACCTTTTGTA CGGTCGTCAA GTCCAATCTG TTTGAGTACT CCATTGATGC TATTTTAGTC ATGAACGCTG TGGTGATTGG CATCCAATCA TATCCAGAGT TGTCGGGACA AGCGGTCCAA ATCGATCCCA AGTACTGGGA CGGATCCATC GATACCATTT GGGAAGGAGT CGAATCTGTT TTTACAGTCA TATATGCGTT GGAAGTGGTG GTCAAAGTCT TGGTTCTGGG ATGGCGAGCG TATACGGAAA GTTACAAAAA TGTGTTCGAT TTCACGATTA CAATCCTTGC CGTGATCAGC TCGAGTATTG TCTATTATCC CAACGAGTTC AGTGACAGTC GCTTGATTCG AATGATTGTC ATGGCTCGCG TATTACGGCT GATTCGTTTG TTGACTGCCA TGAAACGATT CCAGTTGATT GGTATCATCT CAGTGGAGAT TCTTCCCGCC GCGTCTAGTG CTCTTATGGT CCTGTTTTGT ATCATGTACT TTTTCTCCGC ATTGGGAATG CATCTGTATG GCGGTCTCAT TACCCGTGAT CCCGCGAATT CGTTAGCTTA TTTGCTTTTG GGTACCGACT TTTCTGAAAA CGACTACTGG GCGAACAACT TCAACGATAT GATCAGCGGG ATGAACGTGT TATTCAATAT GCTGGTTGTA AATAACTGGA CGGAGTGCGA AGTCGGCTTC GAGGCGACAA CGCAGGAAAA GTGGGTCAGA TTCTTCTTTC TCTCTTTCCA CGTGTGCGGT GTCATCCTCG TTAACAATTT GGTCATTGCC TTTATCATCA ACGCATTCTT CGAAGAGTTG GCTATATATC GTGAGCGCAC GGACGAAGAA ATTGTCGGTG ACGGCGAAGC CGTAATACGC AACCGACGC
|
Protein sequence | MHHPYVRLAI LLSLSPAAQR DIGVLVKMLP EVMNILLILG VFMVFYAWFG TVMFVGTEEG SMHFSSLIES MWTLWICVTT ANYPDVMMPE YNQNRWVTLY FISFMILSFF FLMNLVLAAV FNEYQLAFQT RKQDRTKASD ENLRKAYALM DSEGMGRIDQ ETVMALFCIL NEDFPEFRTL SDEDTKLLFA ILDKDGSSTI TEEEFMDFGS VLLLEFVKTS AYSTFVELRL PKIFHSSLYQ TFCTVVKSNL FEYSIDAILV MNAVVIGIQS YPELSGQAVQ IDPKYWDGSI DTIWEGVESV FTVIYALEVV VKVLVLGWRA YTESYKNVFD FTITILAVIS SSIVYYPNEF SDSRLIRMIV MARVLRLIRL LTAMKRFQLI GIISVEILPA ASSALMVLFC IMYFFSALGM HLYGGLITRD PANSLAYLLL GTDFSENDYW ANNFNDMISG MNVLFNMLVV NNWTECEVGF EATTQEKWVR FFFLSFHVCG VILVNNLVIA FIINAFFEEL AIYRERTDEE IVGDGEAVIR NRR
|
| |