Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22187 |
Symbol | GDCP |
ID | 7203267 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 205248 |
End bp | 208721 |
Gene Length | 3474 bp |
Protein Length | 1005 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | glycine decarboxylase p- protein |
Protein accession | XP_002182636 |
Protein GI | 219124701 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCAGTTGGT TTTTTTGCCG TTCCGGAACG ACAAATAAAC GAAGACCCAA AGTCGATCTC CCAGGACGTG CCAATCATCT TTTTTTATCT CCGCCACCAT GATTCGACGT TCTATTGCTC TTCGTCGAAT TTTGGCACGC GAAAAGTGTA GCCGAGCCTT TCACGCGTCG GCGGTTTTCG CGGACGCCTT GGACATGAAA GACACCTTTG CCCGCCGCCA CGGTAAGTAC TGGCTGGGTC CTGCGCTTCC AGGAAGTTCA ATTTTTGCAT TAGAGTTTTG TGTTTGCGTC ATCGTGACGT CAAGATTTTC TCTACTGTAG TTTTGTGTAA AATCTCTACT GGCACGTGCA ATTTCGATGA CGTCCGCTAG TGCGTGGTAT AAACCCTAAT CGAAGCCGGT TTTGATCGAC GATGAAAAAG ATCGAGTATG ACCATGGTGG TCCTGTGGCG GTTGATCTTC AGCTGAAGCC CTCTAACGAC TGATTTCTTA CCTATCAATT TCTGTTTTTA CAGTGGGACC GTCTCCCGAG GACTCCAAGT CCATGTTGGC AACGATTGGC TTTGACTCGT TTGAAGGTCT CATCAAATCG ACCGTCCCGC CCAATATTCT GTCGCCCCGA GATCTCGCCT TGGAGCCAGC TCGTACCGAG TCGGAAGCGC TCCACCGTAT CAAGGAAATG GCCAAAAAGA ACAAGGTCAT GAAGTCTTAC ATCGGAGCCG GCTACTACGA TACACAGGTT CCTCCCGTCA TCCTCCGTAA CATGCTGGAA AACCCCGGCT GGTACACGGC TTACACTCCT TACCAGGCGG AAATTTCGCA GGGACGCTTG GAGATGTTGC TAAACTTTCA GACACTGGTT GTCGATCTGA CGGGATTGCC CATGGCAGTA GCATCATTAT TGGACGAAGC TACGGCGGCT GGTGAAGCTA TGCAGATGAC GTTTGCGCTC AAGGGGAAAA AGGGCAAGAA GAACAAGTTC TTTGTCTCGC AGGATGTCCA CCCGCAGACC ATTGGTCTTA TCCAAACTCG TGCCGAAGCC ATTGGAATTG AGGTGATTGT TGGCGAGCAC AGCAAATCTG ACTTTTCGGC TGGTGACTAC TGCGGCGCTA TGGTGCAGTA CCCGAACACC TACGGAGAGA TTGAGAGCGG AGGCGAGTCT TACGAAGCCT TCACTGCTCG CGCCCATGAA GGCAACGCCA TGGTTATCGC CGCCACTGAC TTGCTCGCTT TGACCAAGTT GGCCCCGCCC TCTACCTGGG GTGCAGATAT TGCCGTTGGA TCGGCCCAGC GTTTTGGTGT CCCCATGGGT TTTGGGGGCC CACACGCTGG TTTTCTCTCC ACATCGGACC AGTATAGCCG TAAAATGCCC GGCCGTATTA TCGGTGTTAC GGTCGACTCG TACGGAAAAC CCTGTTTGCG AATGGCCATG CAGACTCGGG AACAGCACAT TCGTCGAGAC AAGGCGACCT CCAATATTTG TACCGCCCAG GCTTTACTGG CCAACATGGC TGCGGCCTAC GCTATTTACC ATGGCCCAAA AGGTCTTGCC GACATCGCCG GACGTGTCCA CGCTCTGGCA GCTGTTGGGC ATCGCGAAAT TGGCAAGGCT GGTTTCAAGG TTACGGAAGG ACCTTTTTTT GACACGTTTA CCGTCGATGT TTCCTCCAAG GGCATGAACG CCACGGAAGT CCAGGCTGGT GCCGCTAGTG TTGGCGCTAA CGTGCGTGTC ATCGACGAAA AGCGTGTTGG AGTAGCCATG GGTGAGGGCA TCACTCGTGA CGATCTCGGA AAACTACTGT CTGGGGCTTT TAAGATTTCC AGTCCTGATT TGTCAGCAGA CGATTCCCTC TCCAACTTGG ATCCTGCCGT CGCCCGTGAA GGTGAAATCT TGACACACCC TATTTTCCGT CAGCACCACT CCGAAACGCA AATGCTCCGG TACCTCAAGA CACTCGAAAA CCGTGATTTG GCTCTGAACC ACTCGATGAT TTCGCTTGGC TCCTGTACCA TGAAGCTCAA CGCAACGAGC GAAATGATTC CCGTGACTTG GCCTGAATTT TCTGACATAC ACCCGTTTGC GCCGCACGAT CAAACTATTG GCTATCACGA AATGATTGAA GATTTAAACA AGGATTTGTC GGAAATCACA GGTTTCGCTG CGGTGAGTGC CCAGCCTAAC TCGGGTGCGA CTGGCGAATA CGCGGGTCTA CTGGCCATTA CAAAGTACCT AGAAAGCACC GGTGAAGGAC ACCGGAACGT TTGCTTAATT CCCAAATCAG CCCATGGGAC CAACCCGGCT AGTGCAGTCA TGGCTGGTAT GAAGGTTGTT GTCGTAGAGA ATGACGACCA GGGCAACGTG GATTTTGGTG ACTTGACCGC CAAGATTTCC AAGCACAAAG ACAACTTGGC CGCCTTCATG GTCACATATC CTTCCACGTT CGGTGTCTTT GAAGAAAGAA TCGTAGAAAT TTGCGACGCA ATCCACGATG CCGGTGGCCA AGTATACATG GACGGTGCTA ACATGAACGC ACAGGTTGGT CTGACGAGTC CGGGTTTGAT TGGCGCTGAC GTGTGCCATT TGAACTTGCA CAAGACCTTC TGCATTCCTC ACGGAGGTGG TGGCCCAGGT AGGTTTGAAT TGCATGCAGT TTGATTGCTC AAGGTTTTCG GGTATGACTG ACACTTTTGT TCTAGGTGTC GGTTCTATCG GAGTTCGTGA GCACCTTGCT CCTTTTTTGC CAGGGCACGT AATGGATCCT CAGGCGTCGG GAAAGCTTTG CGGAAACGAC ATTTGTGTAC CAAAGACTGA GGGTGCTGTA GCTGCAGCCC CTTTCGGTTC TGCCGCCATT CTCCCGATTT CTTGGATGTA CATTAAGATG CTTGGAGCTG AAGGTCTTAA AGCTGCCACC AGCCATGCTA TCTTGAACGC GAACTACATG GCTGCGCGTA TGAATGGAGC ATATGATGTC CTCTTTACTG GAAAAAACGG ACAGTGTGCG CACGAATTTA TTTTGGATCT TCGTCCCTTG AAAGCCGCAA CCGGTGTAAC GGAAGAGGAC GTTGCTAAGC GTCTTCAAGA TTACGGATTC CACTCTCCTA CCATGTCTTG GCCCGTTGCC GGAACTCTCA TGATTGAGCC GACTGAGTCG GAAGATTTGG GGGAACTTGA TCGCTTTTGC GACGCCATGC TTTCCATTCG GGCAGAAATT GATGATATTG GCAGTGGCCG TATTGCTTTG GAGGATTCGC CATTGCACTA CGCCCCGCAC ACAATGAATG ACCTGGTCAA TGAGAAGTGG GATCGTCCGT ACTCGAAGGA GGTTGGCATT TACCCCGCTC CTTGGATCCG CGCCAACAAG TTCTGGCCCA GCTGTGGCCG TGTCGATAAT GTCTACGGGG ACCGAAACCT GGTATGCACA TGTCCCCCAC TGGATGTATA TGAAGACGAT GATGACAAGA AGCTTGCCGC TTAA
|
Protein sequence | MIRRSIALRR ILAREKCSRA FHASAVFADA LDMKDTFARR HVGPSPEDSK SMLATIGFDS FEGLIKSTVP PNILSPRDLA LEPARTESEA LHRIKEMAKK NKVMKSYIGA GYYDTQVPPV ILRNMLENPG WYTAYTPYQA EISQGRLEML LNFQTLVVDL TGLPMAVASL LDEATAAGEA MQMTFALKGK KGKKNKFFVS QDVHPQTIGL IQTRAEAIGI EVIVGEHSKS DFSAGDYCGA MVQYPNTYGE IESGGESYEA FTARAHEGNA MVIAATDLLA LTKLAPPSTW GADIAVGSAQ RFGVPMGFGG PHAGFLSTSD QYSRKMPGRI IGVTVDSYGK PCLRMAMQTR EQHIRRDKAT SNICTAQALL ANMAAAYAIY HGPKGLADIA GRVHALAAVG HREIGKAGFK VTEGPFFDTF TVDVSSKGMN ATEVQAGAAS VGANVRVIDE KRVGVAMGEG ITRDDLGKLL SGAFKISSPD LSADDSLSNL DPAVAREGEI LTHPIFRQHH SETQMLRYLK TLENRDLALN HSMISLGSCT MKLNATSEMI PVTWPEFSDI HPFAPHDQTI GYHEMIEDLN KDLSEITGFA AVSAQPNSGA TGEYAGLLAI TKYLESTGEG HRNVCLIPKS AHGTNPASAV MAGMKVVVVE NDDQGNVDFG DLTAKISKHK DNLAAFMVTY PSTFGVFEER IVEICDAIHD AGGQVYMDGA NMNAQVGLTS PGLIGADVCH LNLHKTFCIP HGGGGPGVGS IGVREHLAPF LPGHVMDPQA SGKLCGNDIC VPKTEGAVAA APFGSAAILP ISWMYIKMLG AEGLKAATSH AILNANYMAA RMNGAYDVLF TGKNGQCAHE FILDLRPLKA ATGVTEEDVA KRLQDYGFHS PTMSWPVAGT LMIEPTESED LGELDRFCDA MLSIRAEIDD IGSGRIALED SPLHYAPHTM NDLVNEKWDR PYSKEVGIYP APWIRANKFW PSCGRVDNVY GDRNLVCTCP PLDVYEDDDD KKLAA
|
| |