Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42726 |
Symbol | |
ID | 7196361 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 906036 |
End bp | 908770 |
Gene Length | 2735 bp |
Protein Length | 814 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177187 |
Protein GI | 219110871 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATGT ATGTGCTCAA ACGCAACGGC AGACGCGAGT CTGTGCACTT TGACAAGATC ACAAGCCGCG TTTCCAAGCT CTGCTACGGA TTGGACGCCA AGGTAGGTTG CCACGCTCCT CGGAGTGACC CTGAAAGGCA TTGGTGCGGC TCTGCCGCGA CGTCGACACG TTCCGACGTC CTCTCACCGC ATGTGTGTAC GGTCGTTTTC TTTTAGCATG TGGATCCGGT GATTATTTCC CAGAAAGTGA TTCAGGGTGT CTACCCGGGT GTGACGACCT CGGAACTCGA TGAGTAAGTA ATTCCTGTAT CGCAAATCTC CACGCGAGAA TCAGTGGGTA CCGGAACGGG TGAACGGTTC TTACCGCGGG CATTTCTTGC TTTTACAGAC TCGCGGCACA GACCGCTGCT TCGTTCGCTA CCCAGCATCC CGATTATTCC ATCCTGGCGG CGCGCGTTTC GGTTTCCAAC TTGCACAAAA TGACGATTGC TCGTTTCTCC GATCTCGTCG AAGTCTTTTA CAATTACAAA CACCCCAAAA CGGGTGAACC CGCCTCTCTG ATTTCCGAAG AAGTGTATCA GATTGTGCAA AAGCACAAGG ACGAACTCGA CGGCGCCATT GTACACGCGC GTGATTTCGA GTACGACTAC TTCGGGTTCA AAACGCTCGA AAAGTCCTAC TTGCTCAAGG TCGATGGCAA GATTGCCGAG CGCCCGCAGC TTATGCTCAT GCGTGTGTCC GTCGGTATCC ACGGAGACGA TATCCCTCGG GTCATTGAGA CCTACAACTT TCTGTCGGAG CGATACTTTA CGCACGCCAC CCCCACGCTC TTCAACGCCG GCACCAACAT GCCACAGATG AGTTCCTGCT TTCTCATGAC AATGAAGGAA GACTCCATCG ACGGCATCTA CGATACGCTC AAGAACTGCG CCGTCATTTC CAAGTATGCT GGAGGTATTG GCCTCGCCAT TCATAACGTT CGGGCCAGCC AGAGCTACAT TCGCGGCACT AACGGCTCCT CCAACGGCAT CGTACCCATG CTCCGTGTCT TTAACAATAC GGCGCGGTAC GTCGACCAGG GCGGTGGCAA ACGCAAGGGA TCCATCGCCA TTTACTTGGA ACCCTGGCAC GCGGACGTCT TTGCCTTTTT GGATCTGCGC AAAAACCACG GGAACGAAAG TGATCGCGCG CGCGATTTGT TCTTCGCCAT GTGGGTGCCG GATCTCTTCA TGAAGCGCGT CAAGGACAAC GGCACCTGGT CCTTGTTATG CCCCAACGAG TGCCCCGGTC TCGCCGATTG TTTCGGCGAC GAGTTTGAAG CCCTGTACGA ACGCTACGAA CGAGAAGGCA AAGTCCGACA AACCATCAAG GCGCAGCAGC TGTGGTTTGC CATTCTTGAC TCCCAAGTCG AGACGGGAAC GCCGTACATG CTCTTCAAGG ATCACTGTAA CCGCAAGTCA AACCAGCAAA ATCTGGGTAC CATCAAGTGC TCAAACCTGT GTACCGAAAT TGTCGAATAC ACCGCACCGG ACGAGGCGGC CGTTTGCAAC TTGGCGTCCA TTTCGTTAAG CAAACTGGTG GTACCCGGCC AATATGGTCA GGGTGGGTCG TTTGATTTTG AGAAGCTCCG TGAAGTCAGT GGAGTCGTGA CCAAGAATCT CAATCGCATT ATTGACCGAA ATTTTTACCC CATCGAGGAA GCGAGGCGGT ACGTGCGAAT TCTTTTTTAA CGTGCGACGA ACACGGAAGT CACCCACGCT CACCTTTCCC ACCTACTCCA TTTTTGAAGC TCCAATATGC GCCATCGCCC AATTGGAATC GGCGTCCAAG GCTTGGCGGA CTGTTTCCAA ATGCTGCGAA TTCCTTTCGA TTCTCCAGAA GCTCGTAAAC TCAACACAGA TATCTTTGAG ACCATCTATT TTGGAGCCTG TACTGCCAGT TGTGATTTGG CGGCAGTGGA CGGCCACTAC GAGTCTTACC CTGGGTCTCC ATCGAGCAAA GGACAGCTTC AATTTGATCT TTGGAACGTA CAGCCCAGCA ACCGATGGGA CTGGGCCAGC CTCAAGGCAA AGATTGCGGA ACACGGTATC CGCAATTCAC TGTTGGTAGC TCCCATGCCA ACGGCGTCGA CCGCCCAAAT CCTTGGAAAT AACGAATCGA CTGAGCCCTT TACATCCAAC ATGTACAACC GACGCGTGCT AGCTGGCGAA TTTACCGTTG TTAACAAGCA CTTGTTGCGT GAGCTGACGG CTCGCGGCAT CTGGACTGAA AACGTCCGCA ACCGAATCAT TGCCGAGAAT GGATCTATCC AGAATGTCCC CGAGATTCCG GTGGAGATTC GCGAGATTTT CAAGACCGTG TGGGAAATCC CACAACGTGC GATTTTGGAT ATGGCAGCGG ATCGCGCTCC TTACATCTGC CAAAGCCAAA GCTTGAATGT GCACATTGCC GACCCTAACT CGAAGAAACT GACTTCAATG CACTTCTACG CCTGGCAGAA AGGGCTCAAG ACAGGAATGT ACTACTTGCG TACGCGCCCC AAGGCCGACG CAATCAAGTT TACCGTTGAC CAAGAACAAC TCGTCTCGAA CAAGATGAAG GAGGTCAAGC TGGTGGACAA GGAAAATTCA AGCAACATGG TTACGCCACC GCCACCAGGG GCGACCGCCA TGAAGGGCAC ACAGGGGCTA ACCACGGAGG ACGAAGAAGA TATGTGTCTA AGCTGCGGGG CCTAA
|
Protein sequence | MSMYVLKRNG RRESVHFDKI TSRVSKLCYG LDAKHVDPVI ISQKVIQGVY PGVTTSELDE LAAQTAASFA TQHPDYSILA ARVSVSNLHK MTIARFSDLV EVFYNYKHPK TGEPASLISE EVYQIVQKHK DELDGAIVHA RDFEYDYFGF KTLEKSYLLK VDGKIAERPQ LMLMRVSVGI HGDDIPRVIE TYNFLSERYF THATPTLFNA GTNMPQMSSC FLMTMKEDSI DGIYDTLKNC AVISKYAGGI GLAIHNVRAS QSYIRGTNGS SNGIVPMLRV FNNTARYVDQ GGGKRKGSIA IYLEPWHADV FAFLDLRKNH GNESDRARDL FFAMWVPDLF MKRVKDNGTW SLLCPNECPG LADCFGDEFE ALYERYEREG KVRQTIKAQQ LWFAILDSQV ETGTPYMLFK DHCNRKSNQQ NLGTIKCSNL CTEIVEYTAP DEAAVCNLAS ISLSKLVVPG QYGQGGSFDF EKLREVSGVV TKNLNRIIDR NFYPIEEARR SNMRHRPIGI GVQGLADCFQ MLRIPFDSPE ARKLNTDIFE TIYFGACTAS CDLAAVDGHY ESYPGSPSSK GQLQFDLWNV QPSNRWDWAS LKAKIAEHGI RNSLLVAPMP TASTAQILGN NESTEPFTSN MYNRRVLAGE FTVVNKHLLR ELTARGIWTE NVRNRIIAEN GSIQNVPEIP VEIREIFKTV WEIPQRAILD MAADRAPYIC QSQSLNVHIA DPNSKKLTSM HFYAWQKGLK TGMYYLRTRP KADAIKFTVD QEQLVSNKMK EVKLVDKENS SNMVTPPPPG ATAMKGTQGL TTEDEEDMCL SCGA
|
| |