Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_19692 |
Symbol | |
ID | 7200091 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 473206 |
End bp | 474499 |
Gene Length | 1294 bp |
Protein Length | 356 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179438 |
Protein GI | 219117287 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGGGTTTTA CCATCGACCG GAAGCTCAAA GTCTGTTATC TGATCTTCAG TATCGTGTCT TTGAAAATAT TCTTTCACTA TTAAGAGAGG GAGAAGCAAT TCTGCAAGGC TTGGACTTTC AAAAAAAAAC CTTTTCTTCC AATTATGGCC ATTAGCTCAG CCGAAGCCGC TTCTGGAACG CGCTTGACGA TTCGATTGCC CGACGACTTC CACCATCACT GCCGAGACGG AGCTAAGACA GCTGCAGTTC TTCACCATGC CGTACAGCGG TTTGGGTACT GTCTAATGAT GCCCAACTTG CAGCCTCCGG TGACGACCAC TGAGATGGCC TTGAACTACA AATCTCATAT CATGGCGTCG ATGCCTGAGG GGTCGTTCCC ATATTTCAAG CCAGTAATGA CTTTGTACCT TACTGATAAG ACAACACCGG ACGAAATTAA GAAAGCATCT TCGTCAGGAG TTGTTGGATG CAAGTTTTAT CCAGCTGGGG CCACAACAAA TTCTGCCTTG GGTGTAACGG ACATAAAAAA CTGCTATCCT GCGTTGCAAG AAATGAGCGA CCAAAATATG ATGTTGTGCA TTCATTCCGA GGTAACTCAT GCAGATATTT TTGACCGAGA ACCTGTATTC ATCGAAGAAA TTATGACACC GTTAGTGACC GACTTTCCGA ATCTCAGGAT TTCCATGGAG CATATAAGTA CAAAAGAAGC AGTTGATTTC GTTCTCTCCT CTCCTGACAA TGTCAAGGCG AGCATAACTT GCCACCATTT GCTTTACAAT CGAAATCGTG AGTATAAGGA CGTGGGAATG GGAACCATCC TGTGTAATGG ACTTTTTATC TTATATCACC TTTTCCTCTG CTTTAGACAT GCTTGTAGGC GGCATTCGAC CCCATCTATA CTGTTTACCG ATTCTCAAAG CGGAAATTCA TAGATTGGCA CTAGTTAAAG CTGCAACATC TGGAAGTAAG AAGTTTTTTC TTGGAACCGA CTCAGCACCA CATTCGACGG ATATGAAAGA GGCTTTTTGC GGATGCGCTG GAATTTTCAC GGCTCACGCT GCGGTTGAGT TGTACGCTGA AGTTTTCGAT AAAGTCGGGG CTTTGGACAA GCTTGAGGCT TTTTGTAGCA GCAACGGCGC GGACCATTAT GGCCTTGAAC GAAATACTGC AACAATTACT TTGGAAAAGA AATCTTGGAG GGTTCCGAGA ACGTTTGATT TCGGTGACGG AAAGGGTGTA ACACCCCTTC GAGCTGGAGA AGAAATTAAA TGGTCCATTG CTGCACAGGA ATAA
|
Protein sequence | MAISSAEAAS GTRLTIRLPD DFHHHCRDGA KTAAVLHHAV QRFGYCLMMP NLQPPVTTTE MALNYKSHIM ASMPEGSFPY FKPVMTLYLT DKTTPDEIKK ASSSGVVGCK FYPAGATTNS ALGVTDIKNC YPALQEMSDQ NMMLCIHSEV THADIFDREP VFIEEIMTPL VTDFPNLRIS MEHISTKEAV DFVLSSPDNV KASITCHHLL YNRNHMLVGG IRPHLYCLPI LKAEIHRLAL VKAATSGSKK FFLGTDSAPH STDMKEAFCG CAGIFTAHAA VELYAEVFDK VGALDKLEAF CSSNGADHYG LERNTATITL EKKSWRVPRT FDFGDGKGVT PLRAGEEIKW SIAAQE
|
| |