Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48542 |
Symbol | |
ID | 7194780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 162265 |
End bp | 165231 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183167 |
Protein GI | 219125814 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.295351 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCCG TCGTTACCCA CGGAGAAAAG ACTCCCAGTC TACCGAGAAC ATATTCAGTA CCGCAGATTC GGCACGCGGG TAGGGCGCAC GTGCGGGGGA AAATCCGTCG CGTTTGGCGA CACCGGTATC TGGAATTGTG GGACAACGGG CTCGTGCGGT ATTACGAACT TCCGTCGGCC CGGACGAACG AAGGGAACTT CGGCTCTTCC TCGACAACAG CTGAGAGCCC CCATCCGGAG GACGCCCAAC ACCCACACCG TCGCGTGCAG AAGTATACCT TGGCGATTTA TCACGCTCGC ATTCTCGACG TTACCACCTT ACGAGATATT CACGTGGGAT TGCCGAGGGG CAGTTTCGGC TTTTTGTTTC GGGGACAAAG ACTTTTGCAT TTGGAAACCG ACGAGGAGGA TGAACAAGAA GATGCCATAG TAGACGCTTT GGACGATCCT TCTGGGACCA CCACGATGCG TACGTTGGAG GCACAATCGC AGTCTTACCA CCAGCAGCAA CACCAACATC ATCATAGCCT CTCGGCACTT GCCCTCCTCT CCACGCCATC CATCATGATG CCTACCCGTC GACACAACAA ACTCACTGCT TCGTGTCAAG GCGAAACACC CAAAGAACAA CGGGACTTTC TGTGCGCCGT GAGTAGTTTG GAAGAAGCAC AATCCTGGGT GGTCGCCCTC CAGTGGGCCT CCACGCAACA GCAACACCAA CGGGGACTCG TGTCGGAACC GTGGTGGCTT TCTCCGTCTC CTACGGGAAC CCCTACGGAC ACTCGCTCTG TTTCTACCAC CGCCTTGACC ACCGCCCACA GCACCCCTGC TAATACCAGA AAGGGTGCCG AGCCCGAGCT TCCGATCGTC GTCGATGACT TTGACGATGA CACTTCGTCG TGGGAAATTC ACGACACGCC CAAAGTGTGG ACGCTGGATG GTTCGGCAAG CAGTCCATCT TCCCCACGGA ACAGTCCCGC ACGCACGACA AATGCGGTTG CTATATCCAC CGTTCCGCCA CTTTCCCCCA CCACGACTGG CACTACTAAC GCTAGTGTCT CCACGGCAAT TCCGGGCCAC CGTTCTGCCC ACAAAGCTAC CCATGCCATC GCGCCTTCTT CATCGCCATC CAAAGGCAAA GTCGTCGTTA CCAAAGTAAC CACCTTTCGT ATGGTGCGTC TCTTGGGAAT GTTGAAATTC GAAGTGGCCT ACGAAATTCA CGGACTACTT CTGAAATCCG TTGTGGTTCA CGATTCCGCC GAAACTGCCA CGCCTTGGCA AGCAGAGTCG TGGGTTTTAT TGCGAACGGC CGACGATTTT CAAACACTCC TGAGCGATTT ATGCCAAGAA CTTGGACCAT CCCTCTTGGA TCAGGTACAG CTGGAGCCAA TTCGAAAGTT GCCGCGCTAT CGGCAGTTTC CGTCCTTCCG GACCGTGCAA TCTTCACTAT CGACGGTGGA CAGTATTTTG CGCAGTCTCG TGATGGATGC AAGTATGGTT AACGCAAAGG CCATGAAGAA GTTTTTGGGC ATCGGAACAA CCACCGGAGC GGCGGACCAG ACAAACACGT TTTCGTCCGA CTCTTTCTTG CGACGCTTTT GGCAAGCTCA CGCTTCCCAG TCTACTTTGC AAAATAAAAC TCGCACGTTG CGTCCTCGTA CACGCACGGA CCAATACGTG AAATCCTGGT TGCAGTCCTG TCGCAGTGAT CCGTCGATAT GGGATGTCTA CGCGGTGCGT TGGCTACGAC GGCCTTGGTG GCTCGTGAGT GGTATTGGCG TCACGGCGGC CAGCATTGTA CCTCTGGCAC GGTGGTGGCA ACGCGCTATA CCTGTATTGG CGGTGCGGCT GGATGTACTT GTGGTGTCTT GGCTGGGGGC CGCCTACTTT GGCCGTTGGG TCCTTTTGAT TCCCGGCACA GAACGGTCTG CCGTGAACAC CATACTTTCT AACCGATCGG CAACAACCAC TCCCCAAAAA ACCGCCAAAT CAGTGTCGGC TACAACAGGT GATACACAAT CCAAGACAGC AATTACAAAG ATACCGAACG CTTTGATCGA ATCCCAGCAA ATGCAGTTGG AGGGGGACAA TTTTGTGGAT TCTGCTTCGG CATTGGATGA GATAGACTAC GATGGGGAAG AATCGGAAGG GGAAGAAGCG AGTGGACTAC CGGATACTTT AGATGTATCG TTCAACGAAG GTTTGCTATC GTCTCCACTG CCGCAATATC CACGAAACGA CGGGGTTTCG TGCTGGAGTC AACCTCCGTA CGGCATTTTT CATGTCCGGG GGAATACGTA CTTGCAAGAT CGCGTCAAAG TACCTTCCGG TCCCGCACCG TTGACTTGTC GGGGAGTCGA CGTTTGGATG ACGGACAATC CGGAACGACA CATCGCTCGA CACCCAGCGG TATTGGGAGG TAAGCTCGGT GAACACGACA CATTTTTGGT CAATTTTCTG CTGCCCTTTG GCAACTTTGT CGCGTATTTT AGTATCCCCC CTTTGGACAA ATTTCCCGAC AAGCTACGCC AAGTTTGGCT TAATTTTCTC AAAGGCGACC AGCAGTATCG AGATGCGCGT CTTAAGCTGT TGCCTATCGT TATTGAAGGT CCGTGGATTG TCAAGACGGC CGTCGGTCCG GGAAAGTCCC CAGCCCTGTT AGGGAAAGTA ATACCGTTGC AGTACTTTTT CCGCGATCCG GAGCCGGGGG GCCGCAAGGG AGTGTACGAA GTGGACGTAA TAATTACGGC ATCCACCATT GCGAAAGGAA TTTTGAGTGT CGTCAGGGGG CACACTAAAG CGGTAACGAT TGGATTCGCG TTTATCATAG AAGCTTCGAA GCAAGAGGAA TTGCCCGAAA CCGTACTGTG TTCCTTTCAG GTACACTCGT TGCATTTGGA AGATTGTCCC CTTTTACCCG TATGCAATTT GGATAAGGTG AACGATAGTG TGCTAACTGT AAGATAA
|
Protein sequence | MTSVVTHGEK TPSLPRTYSV PQIRHAGRAH VRGKIRRVWR HRYLELWDNG LVRYYELPSA RTNEGNFGSS STTAESPHPE DAQHPHRRVQ KYTLAIYHAR ILDVTTLRDI HVGLPRGSFG FLFRGQRLLH LETDEEDEQE DAIVDALDDP SGTTTMRTLE AQSQSYHQQQ HQHHHSLSAL ALLSTPSIMM PTRRHNKLTA SCQGETPKEQ RDFLCAVSSL EEAQSWVVAL QWASTQQQHQ RGLVSEPWWL SPSPTGTPTD TRSVSTTALT TAHSTPANTR KGAEPELPIV VDDFDDDTSS WEIHDTPKVW TLDGSASSPS SPRNSPARTT NAVAISTVPP LSPTTTGTTN ASVSTAIPGH RSAHKATHAI APSSSPSKGK VVVTKVTTFR MVRLLGMLKF EVAYEIHGLL LKSVVVHDSA ETATPWQAES WVLLRTADDF QTLLSDLCQE LGPSLLDQVQ LEPIRKLPRY RQFPSFRTVQ SSLSTVDSIL RSLVMDASMV NAKAMKKFLG IGTTTGAADQ TNTFSSDSFL RRFWQAHASQ STLQNKTRTL RPRTRTDQYV KSWLQSCRSD PSIWDVYAVR WLRRPWWLVS GIGVTAASIV PLARWWQRAI PVLAVRLDVL VVSWLGAAYF GRWVLLIPGT ERSAVNTILS NRSATTTPQK TAKSVSATTG DTQSKTAITK IPNALIESQQ MQLEGDNFVD SASALDEIDY DGEESEGEEA SGLPDTLDVS FNEGLLSSPL PQYPRNDGVS CWSQPPYGIF HVRGNTYLQD RVKVPSGPAP LTCRGVDVWM TDNPERHIAR HPAVLGGKLG EHDTFLVNFL LPFGNFVAYF SIPPLDKFPD KLRQVWLNFL KGDQQYRDAR LKLLPIVIEG PWIVKTAVGP GKSPALLGKV IPLQYFFRDP EPGGRKGVYE VDVIITASTI AKGILSVVRG HTKAVTIGFA FIIEASKQEE LPETVLCSFQ VHSLHLEDCP LLPVCNLDKV NDSVLTVR
|
| |