Gene PHATRDRAFT_22187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_22187 
SymbolGDCP 
ID7203267 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp205248 
End bp208721 
Gene Length3474 bp 
Protein Length1005 aa 
Translation table 
GC content52% 
IMG OID 
Productglycine decarboxylase p- protein 
Protein accessionXP_002182636 
Protein GI219124701 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCAGTTGGT TTTTTTGCCG TTCCGGAACG ACAAATAAAC GAAGACCCAA AGTCGATCTC 
CCAGGACGTG CCAATCATCT TTTTTTATCT CCGCCACCAT GATTCGACGT TCTATTGCTC
TTCGTCGAAT TTTGGCACGC GAAAAGTGTA GCCGAGCCTT TCACGCGTCG GCGGTTTTCG
CGGACGCCTT GGACATGAAA GACACCTTTG CCCGCCGCCA CGGTAAGTAC TGGCTGGGTC
CTGCGCTTCC AGGAAGTTCA ATTTTTGCAT TAGAGTTTTG TGTTTGCGTC ATCGTGACGT
CAAGATTTTC TCTACTGTAG TTTTGTGTAA AATCTCTACT GGCACGTGCA ATTTCGATGA
CGTCCGCTAG TGCGTGGTAT AAACCCTAAT CGAAGCCGGT TTTGATCGAC GATGAAAAAG
ATCGAGTATG ACCATGGTGG TCCTGTGGCG GTTGATCTTC AGCTGAAGCC CTCTAACGAC
TGATTTCTTA CCTATCAATT TCTGTTTTTA CAGTGGGACC GTCTCCCGAG GACTCCAAGT
CCATGTTGGC AACGATTGGC TTTGACTCGT TTGAAGGTCT CATCAAATCG ACCGTCCCGC
CCAATATTCT GTCGCCCCGA GATCTCGCCT TGGAGCCAGC TCGTACCGAG TCGGAAGCGC
TCCACCGTAT CAAGGAAATG GCCAAAAAGA ACAAGGTCAT GAAGTCTTAC ATCGGAGCCG
GCTACTACGA TACACAGGTT CCTCCCGTCA TCCTCCGTAA CATGCTGGAA AACCCCGGCT
GGTACACGGC TTACACTCCT TACCAGGCGG AAATTTCGCA GGGACGCTTG GAGATGTTGC
TAAACTTTCA GACACTGGTT GTCGATCTGA CGGGATTGCC CATGGCAGTA GCATCATTAT
TGGACGAAGC TACGGCGGCT GGTGAAGCTA TGCAGATGAC GTTTGCGCTC AAGGGGAAAA
AGGGCAAGAA GAACAAGTTC TTTGTCTCGC AGGATGTCCA CCCGCAGACC ATTGGTCTTA
TCCAAACTCG TGCCGAAGCC ATTGGAATTG AGGTGATTGT TGGCGAGCAC AGCAAATCTG
ACTTTTCGGC TGGTGACTAC TGCGGCGCTA TGGTGCAGTA CCCGAACACC TACGGAGAGA
TTGAGAGCGG AGGCGAGTCT TACGAAGCCT TCACTGCTCG CGCCCATGAA GGCAACGCCA
TGGTTATCGC CGCCACTGAC TTGCTCGCTT TGACCAAGTT GGCCCCGCCC TCTACCTGGG
GTGCAGATAT TGCCGTTGGA TCGGCCCAGC GTTTTGGTGT CCCCATGGGT TTTGGGGGCC
CACACGCTGG TTTTCTCTCC ACATCGGACC AGTATAGCCG TAAAATGCCC GGCCGTATTA
TCGGTGTTAC GGTCGACTCG TACGGAAAAC CCTGTTTGCG AATGGCCATG CAGACTCGGG
AACAGCACAT TCGTCGAGAC AAGGCGACCT CCAATATTTG TACCGCCCAG GCTTTACTGG
CCAACATGGC TGCGGCCTAC GCTATTTACC ATGGCCCAAA AGGTCTTGCC GACATCGCCG
GACGTGTCCA CGCTCTGGCA GCTGTTGGGC ATCGCGAAAT TGGCAAGGCT GGTTTCAAGG
TTACGGAAGG ACCTTTTTTT GACACGTTTA CCGTCGATGT TTCCTCCAAG GGCATGAACG
CCACGGAAGT CCAGGCTGGT GCCGCTAGTG TTGGCGCTAA CGTGCGTGTC ATCGACGAAA
AGCGTGTTGG AGTAGCCATG GGTGAGGGCA TCACTCGTGA CGATCTCGGA AAACTACTGT
CTGGGGCTTT TAAGATTTCC AGTCCTGATT TGTCAGCAGA CGATTCCCTC TCCAACTTGG
ATCCTGCCGT CGCCCGTGAA GGTGAAATCT TGACACACCC TATTTTCCGT CAGCACCACT
CCGAAACGCA AATGCTCCGG TACCTCAAGA CACTCGAAAA CCGTGATTTG GCTCTGAACC
ACTCGATGAT TTCGCTTGGC TCCTGTACCA TGAAGCTCAA CGCAACGAGC GAAATGATTC
CCGTGACTTG GCCTGAATTT TCTGACATAC ACCCGTTTGC GCCGCACGAT CAAACTATTG
GCTATCACGA AATGATTGAA GATTTAAACA AGGATTTGTC GGAAATCACA GGTTTCGCTG
CGGTGAGTGC CCAGCCTAAC TCGGGTGCGA CTGGCGAATA CGCGGGTCTA CTGGCCATTA
CAAAGTACCT AGAAAGCACC GGTGAAGGAC ACCGGAACGT TTGCTTAATT CCCAAATCAG
CCCATGGGAC CAACCCGGCT AGTGCAGTCA TGGCTGGTAT GAAGGTTGTT GTCGTAGAGA
ATGACGACCA GGGCAACGTG GATTTTGGTG ACTTGACCGC CAAGATTTCC AAGCACAAAG
ACAACTTGGC CGCCTTCATG GTCACATATC CTTCCACGTT CGGTGTCTTT GAAGAAAGAA
TCGTAGAAAT TTGCGACGCA ATCCACGATG CCGGTGGCCA AGTATACATG GACGGTGCTA
ACATGAACGC ACAGGTTGGT CTGACGAGTC CGGGTTTGAT TGGCGCTGAC GTGTGCCATT
TGAACTTGCA CAAGACCTTC TGCATTCCTC ACGGAGGTGG TGGCCCAGGT AGGTTTGAAT
TGCATGCAGT TTGATTGCTC AAGGTTTTCG GGTATGACTG ACACTTTTGT TCTAGGTGTC
GGTTCTATCG GAGTTCGTGA GCACCTTGCT CCTTTTTTGC CAGGGCACGT AATGGATCCT
CAGGCGTCGG GAAAGCTTTG CGGAAACGAC ATTTGTGTAC CAAAGACTGA GGGTGCTGTA
GCTGCAGCCC CTTTCGGTTC TGCCGCCATT CTCCCGATTT CTTGGATGTA CATTAAGATG
CTTGGAGCTG AAGGTCTTAA AGCTGCCACC AGCCATGCTA TCTTGAACGC GAACTACATG
GCTGCGCGTA TGAATGGAGC ATATGATGTC CTCTTTACTG GAAAAAACGG ACAGTGTGCG
CACGAATTTA TTTTGGATCT TCGTCCCTTG AAAGCCGCAA CCGGTGTAAC GGAAGAGGAC
GTTGCTAAGC GTCTTCAAGA TTACGGATTC CACTCTCCTA CCATGTCTTG GCCCGTTGCC
GGAACTCTCA TGATTGAGCC GACTGAGTCG GAAGATTTGG GGGAACTTGA TCGCTTTTGC
GACGCCATGC TTTCCATTCG GGCAGAAATT GATGATATTG GCAGTGGCCG TATTGCTTTG
GAGGATTCGC CATTGCACTA CGCCCCGCAC ACAATGAATG ACCTGGTCAA TGAGAAGTGG
GATCGTCCGT ACTCGAAGGA GGTTGGCATT TACCCCGCTC CTTGGATCCG CGCCAACAAG
TTCTGGCCCA GCTGTGGCCG TGTCGATAAT GTCTACGGGG ACCGAAACCT GGTATGCACA
TGTCCCCCAC TGGATGTATA TGAAGACGAT GATGACAAGA AGCTTGCCGC TTAA
 
Protein sequence
MIRRSIALRR ILAREKCSRA FHASAVFADA LDMKDTFARR HVGPSPEDSK SMLATIGFDS 
FEGLIKSTVP PNILSPRDLA LEPARTESEA LHRIKEMAKK NKVMKSYIGA GYYDTQVPPV
ILRNMLENPG WYTAYTPYQA EISQGRLEML LNFQTLVVDL TGLPMAVASL LDEATAAGEA
MQMTFALKGK KGKKNKFFVS QDVHPQTIGL IQTRAEAIGI EVIVGEHSKS DFSAGDYCGA
MVQYPNTYGE IESGGESYEA FTARAHEGNA MVIAATDLLA LTKLAPPSTW GADIAVGSAQ
RFGVPMGFGG PHAGFLSTSD QYSRKMPGRI IGVTVDSYGK PCLRMAMQTR EQHIRRDKAT
SNICTAQALL ANMAAAYAIY HGPKGLADIA GRVHALAAVG HREIGKAGFK VTEGPFFDTF
TVDVSSKGMN ATEVQAGAAS VGANVRVIDE KRVGVAMGEG ITRDDLGKLL SGAFKISSPD
LSADDSLSNL DPAVAREGEI LTHPIFRQHH SETQMLRYLK TLENRDLALN HSMISLGSCT
MKLNATSEMI PVTWPEFSDI HPFAPHDQTI GYHEMIEDLN KDLSEITGFA AVSAQPNSGA
TGEYAGLLAI TKYLESTGEG HRNVCLIPKS AHGTNPASAV MAGMKVVVVE NDDQGNVDFG
DLTAKISKHK DNLAAFMVTY PSTFGVFEER IVEICDAIHD AGGQVYMDGA NMNAQVGLTS
PGLIGADVCH LNLHKTFCIP HGGGGPGVGS IGVREHLAPF LPGHVMDPQA SGKLCGNDIC
VPKTEGAVAA APFGSAAILP ISWMYIKMLG AEGLKAATSH AILNANYMAA RMNGAYDVLF
TGKNGQCAHE FILDLRPLKA ATGVTEEDVA KRLQDYGFHS PTMSWPVAGT LMIEPTESED
LGELDRFCDA MLSIRAEIDD IGSGRIALED SPLHYAPHTM NDLVNEKWDR PYSKEVGIYP
APWIRANKFW PSCGRVDNVY GDRNLVCTCP PLDVYEDDDD KKLAA