Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50630 |
Symbol | |
ID | 7199474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011701 |
Strand | - |
Start bp | 18925 |
End bp | 21887 |
Gene Length | 2963 bp |
Protein Length | 971 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185607 |
Protein GI | 219130934 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0698106 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACCATGGTA GTGGAAGCTA TCAAAACCAA ACCACGACGC CGTCGTGGTT CCAAATCGGA AAGTAGGCAA CACAGTCAAA GCGAGAGCGC CGCGGTGGAA AGTTCGTCAA CGTATCAGCC GCACAGTACC TTGCTGATTC AACTGAACGA AGATGCCCCA ACCTGGTACC GCCTCGGCAA CAAGCAGTAC GCGGAAGAGC GAGACGCTAC CACCGCATCC AATCCACCCG AAAAGGGACA ACGAGTTAGC AAAGATCTCG TTCTAAAGTA TCGGGCCCTG GGAGACGCAA TCTATCGACG GGAGGTGCAG CTCTTTGGCA AGGAATCAAA CACTTCGGAT GATCGATGGG TAGAGTCCAC CATGAAGAAA GGTACACTCA AGGATCGCGT CGCTGCTATG AGTGTGGTCG TTAGCACTGA TCCGGTACAT AAGTTCTACG TATTGGATGG ACTGTTGAAC ATGGCGGGCT GTTCCGATTC GAATTCTTCG ACTCAAACAA ATTCTCGTGT GGCTCAGCTT GCTGCAGAAG CACTGGAAGA CCTCTTCGTT AACACCTTTC TACCGAATCG GCGCAAACTG ATATCAATGG AACAACGTCC CCTATACCTG TACGAGAGCG AAGGCGTAAA GAACACGACG AAAAAGACCC TGTCCCCGCG GATTCTTCTT CTATGGCGCT TCGAAGAAAT GGTCAAGGCC AAGTATCATC TGTTTCTACG TCAATACGTA TCCCTAATAC TTCGAGAAGG AACGGAGTTA CAAAAAATTC CAACTATTCG TCTTGCTGCA GTTCTTTTGC GTTCCATTCC AGAAGGAGAA TCTACATTGT TACCAGTAAT TGTCAATAAA CTTGGCGACC CAGCCAAAAA AGTTTCGGCT GGCGCTGCCT TTGAGCTACG AAAACTTCTC CAACAGCATA CAGCTATGCA AGTGATTGTG GCGCGAGAAG TGCAACAGCT TGCTCACCGA CCACATCTAT CATCACGGGC TTTATATAAT TGTATCACCT TTTTGAACCA GCTGAAATTA AAACGGGAAG AGACTCAAGG GGGAGCCGAC GAAGCGACCG AGCCGTCACT ACCTGCGTCT CTGATCAGCA CGTACTTTCG TTTGTTCGAA CTTGCCGTAC AAAAACCTAA AAAAAAGGAA ACAGCGAATG AGGAAGAGGC AGGAATGAAA TCTCGACTTT TATCGGCACT GCTGACAGGC GTGAATCGAG CCCACCCCTA TTTGCCTCAT CATGATTCAA CAATGGAACA GCACATCGAT GCCCTCTACC GGGTCGTGCA TACAGCACCG GCCGCTGCCA CGACACAGGC GCTACTACTG CTTTTTCACT TGTCCGTTGG TGCGGAATTC GAACAGGACC AGCGACAAAC GATTTCGCGC AAATTACGCC CTGAAGAGCA TGCCCGCCGT GATCGCTTCT ATCGAGCCTT GTATTCAACG CTGGCGCAGC CCTCCCTTTT GGGTACCGGC AAACACCTTA CAATGTTCTT CAATCTTCTT TACAAAGCCA TGAAATATGA CAATGACCAA ACTCGCGTGG TTGCATTCGC GAAACGTATT TTATGTACAA CCATTCACTG TTCTTCATCG GTAGTTGCTG GTTCTCTTTT TCTGCTTAAC GAAATAACGA AACACCACGG GAACCTACTA TCCTGCTTTC AAGATGTCTT GGAAGGATCC GACGCTTTTC GTGTTTTGGA TCCAACCAAA CGAGAACCTC GCGGTGCTTT AGTCTTATCG GAGTATGTCG ATGCACCTGA AATAGCTTCC GAAGAAAACG AACAATCAAT CGAGAAAGCT ATAACGAAAG CTCCGCTGTG GGAATTGACG TTGTTGTTGA AACATTTTCA CCCATCGGTC TCAAGGTTCG CCAGTGCAAT TGGGAACATT GACTACAGTG GCGACCCTTT GCGCGATTTT GGTGTGGGAC CGTTCCTGGA TAAGTTCGCT TACCGTAATC CAAAGTCGAT TGATCGGGTA GCTGGCAAAT TTCAACGCGG TGAGAGCGTG GCAGAGCGAA AGAGTGGTAC TGGGCTTTTG GTAGAGTCAC AGGTTGCGCT ACCTCTGAAT GACCCAAGCT TTTTAAGCAA TCCCAACGTC GACGCACCTG ATGACTTTTT CCACAAATTC TTTTTGGAGC AAGCTCGGCG TGACAAACTC AAAGGCATTG TCCGGCATAA ACCAAAAGTT GATGCTGTAG AACATTTGGA GGAAGATGCT TTTGACGAGG CCGAAGTAGC AACTTTGGAC GTACAAAAAT TTGATGATTT GGAGCAAGGC TGGGAAACCG ATGATGATGA GGAGGCCTAT GTCGACGCTT TGGCACAAAA AATTATCGAA GACTCGATTA ACGAAAACGG GCCCGCGGAT CTTGACGAGG AAGATCCAGA TATGGAAGGT TGGGGGGATA TGTATAGTGA TGAAGAACTA GAGGATGAGA GCGACGACGA AAGTGAGTCG TCACAGAAGG GCAAAGCCCT GACCAGAAAC GACACCATTG TAGGCGACGG TATCAAAGAT CTTGACAGCG AAGAAAATGA TGTCGATGCG TTCATGGATA TTGACGGAGC CGATAGTAGC GTAAGTGACG ATGACGAGCT GTTCATGGAC GAGCAGTTGG TGTTGATGAG TGTCGACTTG GATGGTTTGG ATTCGTCGGA CGACCACATC TCCGACAGCG GTGTGGATGG CAACTTGACG TTGATAAATG AAGAGAAATC TAATGACGGC GAAGTTGTCG AGGAAGGAGC CTCGTTGAAC CATACAGTAG AGGGATTAGC CACCTTTGTA GATGCTGATG AATATGAAGC AATGATAACG AAATCCTGGA ACGAGAAAAA GCGCTCGAGA AAACGAATCG ATGGAAAAAG TGACCATGCA AGAGCGACTT CACCAAAGAG AAGGATGTAA ATATTGCATA GCTTACAACA GCTAAATCAC CACCGTGCAG TAC
|
Protein sequence | MVVEAIKTKP RRRRGSKSES RQHSQSESAA VESSSTYQPH STLLIQLNED APTWYRLGNK QYAEERDATT ASNPPEKGQR VSKDLVLKYR ALGDAIYRRE VQLFGKESNT SDDRWVESTM KKGTLKDRVA AMSVVVSTDP VHKFYVLDGL LNMAGCSDSN SSTQTNSRVA QLAAEALEDL FVNTFLPNRR KLISMEQRPL YLYESEGVKN TTKKTLSPRI LLLWRFEEMV KAKYHLFLRQ YVSLILREGT ELQKIPTIRL AAVLLRSIPE GESTLLPVIV NKLGDPAKKV SAGAAFELRK LLQQHTAMQV IVAREVQQLA HRPHLSSRAL YNCITFLNQL KLKREETQGG ADEATEPSLP ASLISTYFRL FELAVQKPKK KETANEEEAG MKSRLLSALL TGVNRAHPYL PHHDSTMEQH IDALYRVVHT APAAATTQAL LLLFHLSVGA EFEQDQRQTI SRKLRPEEHA RRDRFYRALY STLAQPSLLG TGKHLTMFFN LLYKAMKYDN DQTRVVAFAK RILCTTIHCS SSVVAGSLFL LNEITKHHGN LLSCFQDVLE GSDAFRVLDP TKREPRGALV LSEYVDAPEI ASEENEQSIE KAITKAPLWE LTLLLKHFHP SVSRFASAIG NIDYSGDPLR DFGVGPFLDK FAYRNPKSID RVAGKFQRGE SVAERKSGTG LLVESQVALP LNDPSFLSNP NVDAPDDFFH KFFLEQARRD KLKGIVRHKP KVDAVEHLEE DAFDEAEVAT LDVQKFDDLE QGWETDDDEE AYVDALAQKI IEDSINENGP ADLDEEDPDM EGWGDMYSDE ELEDESDDES ESSQKGKALT RNDTIVGDGI KDLDSEENDV DAFMDIDGAD SSVSDDDELF MDEQLVLMSV DLDGLDSSDD HISDSGVDGN LTLINEEKSN DGEVVEEGAS LNHTVEGLAT FVDADEYEAM ITKSWNEKKR SRKRIDGKSD HARATSPKRR M
|
| |