Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44850 |
Symbol | |
ID | 7199566 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 443317 |
End bp | 444437 |
Gene Length | 1121 bp |
Protein Length | 289 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179000 |
Protein GI | 219116410 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGAAA CGCCCATACC TGTAGACCCC ATTCCGGTTT CAAGTCAGCT CCACGCCGAC GATGCTTTCG TCTCCTTGAC GGTCAGTACG GCCGAAGCCA TGAACCCGAC TCGACGGAAT ACAATGGAAG ACCGCTACGT TATCCACGCC CCTGGCACGT GGAATGCTCC CGACCCCGAC ATGGCGTATT TGGCGGTGTA TGATGGACAC GGCGGTAAGT ACGATCCGCG GAGACTCCTT TCTTCATATT TGGTGGCCCC TAAGTGGCTT ACTGTGCAAT ATTTTTGGGA ACAGGCCGTG ACATGGTGGA CTTTCTGGAA CATGGAATGG CCTATCACGT TGCCCAAGAG TTGACGGACC AAACCGACGA CGCCCCAATC ACCACCAGGC TTGAAAGGGC CTTTCTTATG GCCGACATTC ATTCGAAACA CGCAGGGATC ACAACATCCG GGGCTACTGT AGCCGTATGT CTCGTACAGG TGCGTTCGTA AGGAGTTTGC AGTCGACTCA CTATTCCTAC ATGGCTCTGA CACTCCGGAT GCTCTCGCTC AATTTGGCGC AACCATGTAT AGAGAGACCC GTCCACTCCC CAGAAACTCC GTATCTTTAC TGCCAATGCC GGAGACTCTC GGATTGTATT AGGTCACGAC GGGGAAGCAA CGCGTTTGAC GCTAGACCAT CGTGTGGATG ATCCGAAAGA AATAGCTCGT ATCCAAAAAG CTGGTGGCTT CATCTTCAAG GGCCGAGTTA TGGGGGTTTT GGCTGTAACT CGGAGTCTAG GGGATCACAT TTTCAAAACC TTTGTCATTG CGCACCCGGC GGTTCGTGAA TTAGACCTTC AACTGGAGAG CGGCTCCGTT GAACCCAGTT TTCTCATCAT CGCCTGCGAT GGATTGTATG ATGTCATGAC AGACAAAGAA GCCGTCGACT TAGTGCGGAA CTTTGCGGGG GAGAGAGAAG ATGCTGCACA GTTTTTGGTG CAGCAGGCAC TGCGGAGAGG GACTACAGAC AACGTAACTG CAATTGTGGC TTGGCTCGAG TAGGGAAAAA CTTTCACCCT ACCACGATGG GAATGCAATA ATTACATTCT AATTTGCTAC GTAATTTTCT GTTTCTTGTT C
|
Protein sequence | MIETPIPVDP IPVSSQLHAD DAFVSLTVST AEAMNPTRRN TMEDRYVIHA PGTWNAPDPD MAYLAVYDGH GGRDMVDFLE HGMAYHVAQE LTDQTDDAPI TTRLERAFLM ADIHSKHAGI TTSGATVAVC LVQRDPSTPQ KLRIFTANAG DSRIVLGHDG EATRLTLDHR VDDPKEIARI QKAGGFIFKG RVMGVLAVTR SLGDHIFKTF VIAHPAVREL DLQLESGSVE PSFLIIACDG LYDVMTDKEA VDLVRNFAGE REDAAQFLVQ QALRRGTTDN VTAIVAWLE
|
| |