Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45518 |
Symbol | |
ID | 7200597 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 422181 |
End bp | 425391 |
Gene Length | 3211 bp |
Protein Length | 918 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179642 |
Protein GI | 219117704 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTGCAAAA CATGAAGCTC TCCATCGGAG GATGGGAATG GAAGAAGAAA GAAGGGACCT ACAAGAAGGA AGTGTCAAAC AAGGAAGGAA ACAAACTCAC CAACACATCG TCAGCCACCG AAAGAATTCT AGAGAGAAAT AGCATTATAC AATGAGCTTC TTCTGGACTT CCTCGATCAC AAACGAACCT GAGAATTCCT TGGATACCGG TGTCACTGTC AGCAATTCGA GACCATCGTC ATCGCTGCAA CAGCAGAACA AGAATATAAA GGAACGATTT GGAGGAATCG AGACAGAATC CCGTCAAGTT CCATTCTTGC CTGACGAAAA CTTGACTATT GAATCAATAT ACGACTTTTA TGTGCATACA CTATTCCCTG CTGTGATGAG TTCTTTGCCT TTATTGGGTC GTCGCCTTTT CCATGACACG CAATCGGTAC TTCATGATAG TTTTCAACGT TTTGTAAACT TTATTCTTCC GGACTATTGG ACGTCTTCCT CCTCATACTC ACATCCGCAG CTACCAAGCT TTCGGCGGAT TTGGGAAGAA AACGCCGCTG ACCTGTCCGT TTCCGACTTT TTGCTCCGAC ATTTCGATAC CAACCACGAC GGCAAGATAT CTCCCGAGGA ACTCTTTCAC GTGGACGAGA TGCTACTCTC TCGCATACTA CCTCGCTCAG ACGAAAGTTG GTGGAGCTGG TTCTCCCGCG AATGGCCCCT GTTGGATTGG AAAGTGGGAC TTTTCTTGTG GCGCACTTTT GGTGGCGTGC TCTTAACCCT GGCCGTTTTA TCGATTGTAC CGGGACGGTT GCATGGCTGG TCGGCTCGTG TCTTGCGATG GCCCATTCTC GGACTCACTT ACTTTCTCAT CGCCGTTGAA CTTATGGTCT ATATCGTAAT TCGCATTTTC ATTCGGATTG CCGAATACAT AATCGCACGG CCCAAGCATC GAGCACTGCG CCAACAAATG GCTAAATCCC AGTCCTATGA AGAGTGGTAC GGCTACGCAG AAGATTTGGA CAGATCACAG AAACGAGATG TATGGATTCG ACGAATCAAA GATCAAACAT CTTTTCACTA CAATTGGAGC TTTGTTCAGG ACCTCATTCG GGATTTGCGT AACGCTCGCG AGCAAGGCGA CTCCATACTA GCGCTTGCCG TTATTCAACA ATGCACGCGG AAAAATGTAG GCGGTATTAT GAGCGAAGAT TTATTTTCAT ATACGAATAC GGGGGAACCG AAAGCAGTCG TACGGGAGTT CATCGAGGAA GCTGTCCAAA CCTTGCACTG GCTCACTGAT GAAGCATTGC ATATTCCAGT AGATGATTCC GCACAACAGC AAAGCAACGA AAAACGCACC TATGAAGAGC AAATGGAAGA AACTGTTCAA GCCGAAAAGG ACAAAATATG GAAATCATTG ACCAATTTGT TTGGTAATGG CGAAGATGGA AAACAAAATG ACGATGTCAA GACGAAAGAT GGAAATGAAA CGCATTCGCC GGAATCCGCC CCCCACCAAG ACATAGAAAG TCGGACGGAT AATCCTTTGC CATCCTTTCA CCGACAAGAG GTTTTAACGT TTTTGAAGCG GGCTCGAGCA GCTTACGGTC GAACCGCTTT GTGCCTGAGC GGTGGAGCGA TGATGGGAGT TTATCATTTT GGACACGTTC GGGCTCTACT GGAAACTGGT TCTCTGCCCA ATATTGTACG TACCGGGATC GTGTTTTTAG ATCACACACT CGTTACAAAT AGGAAACTTT GTATTTATGT GCGACAACGC CTGATGTTTG TTTGCTTCTT TTTTGTGAAA AGATTTCAGG AACTAGCGCA GGAAGCATTA TTGGTGCGAT TTTGTGCACC AGGAAGGACG ATGAATTGGA TCGTGATTTA CGCCCAGAAA TTTTAGTACA TAAGCTTACG TGCTTTTCGC GTCCTTGGAG GGAGAGGATC TCTAGCCTTG TCAAAACCGG GAGCATGTTT GATGTCGATG AATGGTTGGA GCTACTGAAA TGGTGAGTTA CTTGCTGCTG CTTGTTTGCA TCCCTATACG TCAGGTCGTT CTTGTCTCAC CGTCATGTTT ATAAGGTTTT GTCGGGGTGA TATGACATTT GCTGAAGCAT ATCGTTTGAC AGGGCGCGTT TTTTGTATTA CTCTATCCCC TACTACGAAG AAGGCGCCGC CGGTGTTGAT AAATTATTTG TCTGCGCCCA ACGTTACGAT AGCCTCTGCT GTCGTAGCGA GTGCAGCTGT ACCTGGGTTT GTTGCGCCTG TTCGCTTGCG TATCAAGGAC ACCAATGGCG TAGTTCAGAG AGGCGGAGCA AAGGACGAAG CATACTTTGA TGGATCGATC AAACAAGACA TTCCGACTAC AGGATTAGCG GAGATGCTCA ATTGCCAGTT TTTTGTCACG GCTCAATGCA ATCCACATAT TGTTCCAATG TTTTACAACA GTAAAGGCGG AGTTGGGCGT CCCAGCCGAT GGTCCAGTGG AGCACAAGAA GATTCCTGGA GAGGCGGGTT CCTACTGGCC GCGTTGGAGA TGTATCTCAA GAATGACATG AAAGCCAAGT TTGTTTTTCT TCGTGATCTA GAGGCGGCTG TTGGCTTTAC ATCGGAGCTA CTGACGCAAG ACTTTGTAGG CACGACAACC ATCGTACCTC AAGTTTCCTT TAAGGATTAT TTCGGAGTAA GGACAAGCGC AAATGAGATG TGTTGCCCAA ACAATCGCAT GAAAAACTAA CTGCTTCTTT CTGTTCGTTC GTTCGCAGCT CTTTGAGAAT CCGTCTTTGG AGCAACTCCA GCGGTGCTGT CATGCAGGAT CTGTTGCTGC GTACGAACAT ACTGTTATGA TCCAGATGCA TTACAGCATT TCGGATGCAC TGGAGGAATG CATTGCAAAG CTTGAAACCA ATAAGCGCAA AGTACATATT CGGCGACGTA CAAAGCTAGG ATCGGCGAGT ATGACGAGAG GCGATCCAAA GGGAGGGGTC GTGGAGGAAT CAACGGAAGC GCGAATCCAG CCAACAGTGG AAAATACCCA GACGTTTCTG GTCGGTGGGT TGACATCAGA CGGACTGAAA GTACGTACAG CGTTCAATGA ATCATCGGAT GATACAGATC GAGAATCGGA GTACGATGAG TTCGAAGCTG ATTGGACGGA CCTTAAGTAA AAATTGCATC GGCTTTAAAA TACAAAGAAA GTTGAACTGT T
|
Protein sequence | MSFFWTSSIT NEPENSLDTG VTVSNSRPSS SLQQQNKNIK ERFGGIETES RQVPFLPDEN LTIESIYDFY VHTLFPAVMS SLPLLGRRLF HDTQSVLHDS FQRFVNFILP DYWTSSSSYS HPQLPSFRRI WEENAADLSV SDFLLRHFDT NHDGKISPEE LFHVDEMLLS RILPRSDESW WSWFSREWPL LDWKVGLFLW RTFGGVLLTL AVLSIVPGRL HGWSARVLRW PILGLTYFLI AVELMVYIVI RIFIRIAEYI IARPKHRALR QQMAKSQSYE EWYGYAEDLD RSQKRDVWIR RIKDQTSFHY NWSFVQDLIR DLRNAREQGD SILALAVIQQ CTRKNVGGIM SEDLFSYTNT GEPKAVVREF IEEAVQTLHW LTDEALHIPV DDSAQQQSNE KRTYEEQMEE TVQAEKDKIW KSLTNLFGNG EDGKQNDDVK TKDGNETHSP ESAPHQDIES RTDNPLPSFH RQEVLTFLKR ARAAYGRTAL CLSGGAMMGV YHFGHVRALL ETGSLPNIIS GTSAGSIIGA ILCTRKDDEL DRDLRPEILV HKLTCFSRPW RERISSLVKT GSMFDVDEWL ELLKWFCRGD MTFAEAYRLT GRVFCITLSP TTKKAPPVLI NYLSAPNVTI ASAVVASAAV PGFVAPVRLR IKDTNGVVQR GGAKDEAYFD GSIKQDIPTT GLAEMLNCQF FVTAQCNPHI VPMFYNSKGG VGRPSRWSSG AQEDSWRGGF LLAALEMYLK NDMKAKFVFL RDLEAAVGFT SELLTQDFVG TTTIVPQVSF KDYFGLFENP SLEQLQRCCH AGSVAAYEHT VMIQMHYSIS DALEECIAKL ETNKRKVHIR RRTKLGSASM TRGDPKGGVV EESTEARIQP TVENTQTFLV GGLTSDGLKV RTAFNESSDD TDRESEYDEF EADWTDLK
|
| |