Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_56477 |
Symbol | GDCT_1 |
ID | 7201643 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 173615 |
End bp | 176516 |
Gene Length | 2902 bp |
Protein Length | 854 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | glycine decarboxylase t-protein |
Protein accession | XP_002180958 |
Protein GI | 219120440 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.645348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAACAGTTGG CCGAAGACAC CATGAGAATT CTTCGAAAGA ATCATAGTCA TCGCTGCACA TCTGGCAAGA AAAGAATAAC TTACCGAAAG CACTGATCAT CCGCGTAAAG AACATACCGC AAATACTTGT AGCCACGTTT CCCAATCTTT CGACGGCCTA CTGTCACCGA AGTATGGGAA ACAGGACGTG CTAGCGAAAA TGAAAAGGTT GTCTTGTGTG CGGGGTCTTC GCCTACGGAA AGGTCGCGTG CATTTAAGAT GTGCATCGTC GCAGACAGTC CCGGAACGAG CAAACGTAGT GGTGGTGGGA GGTGGGATCA TCGGAACTTC GGTTGCCTAC CACTTAGCCA AGGCCGGCGT CGAAGATGTT CTCCTCTTGG AAAGAGACCG TCTGACTTCT GGAACAACGT GGCATGCCGC CGGTTTGATG AATAGTTTCG GATCCATGTC GTCGACATCT ACATGGTCCC GTCAATATAC GCAAGAACTG TATCGAGATA TTCTGCCTAC GGAGACTGGT TTGGAAACCG GATATATGGG TATTGGATTC ATTGAGCTGG CATGCGATGC GGATCGACTG GAAGCCTTTC GAAGAATAGC TGCCTTCAAC CGATTTCTAG GGGTAGACGT AGCAGAAATT TCACCCGAGC AAGTCAAAGA CCTATTCCCT CTCTGTGAGA CGTCGGACGT CCTCTCAGGC TTTTGGGTTG AGAACGATGG ACGAGCCAAC CCTACTGATG CGACGATGGC ATTGGCAAAA GGGGCGCGTT TGCATGGTGC CAATATTATT GAACAGTGTC ACGTAGCCGG TGTGACGACC TCGAAACCAA ATGGTAATTA TCGTGCCAAG GTAACAGGTG TCCGACTTGA AAATGAAACC GTAATTGCAG CGAATATCGT CGTTAATTGT GCTGGTATGT GGGCACGGCA GTTTGGGGAG GCCTGTGGAG TGTATAACAT TCCGAATCAA GCAGCTGAAC ACTACTATTT GATTACTGAG CCTATGAAGG AAATTGATCC TTCTTGGCCT GTCATAGAAG ACTCTTCGAA ATGCGTTTAC ATCAGACCGG AAGGGAAGGG GCTAATGCTG GGCTTTTTTG AATGGGAGGG AGCGGCATGG AAGCCCGAAG GGGTCCCTTT GGATTTTAGC TTTGGTGAGC TTGATCCAGA TTGGGATCGC ATGATGCCAT ACGTGGAGCA AGCCATGAAG CGGGTCCCGG CCGCTGAAAA TGTTGGTGTC AAAGCGCTTT TTTGTGGGCC GGAATCATTT ACCCCAGACA ACCGTCCCAT TGTAGGGGAA TCACCGGAGC TTCGCAATTA TTACATCGCT GCGGGGCTGA ACTCCATTGG AATTTTAACG GGAGGGGGTA TTGGGAAAAT TTTAGCTCAA TGGATACAGC AGGGATGCTC ACCCCATGAT GTCGACGTTA CTGCCATTGA TGCAAGTCGA TTCCAACGGT ATCAAAGTAA CATAACATAC CGAAATGACC GTACCGGTGA GGCGCTGGGA AATACTTACA AGGTTCATTA TCCAGACCAT CAACCAACGA CGTGTCGAAA CGCGAAGCAA TCTGTTCTGC ACGAGCGATT GGTAAACGCC AATGCATTTT TCCAGGAGAC CAGCGGTTGG GAATCTCCAT CCTGGTACGC TCCCCATGGA ACCAATCCAA AGGTCGAGAC TGAGAGTTTT GGCAGAGAAA ACTGGTTCCT ACACTGGGAG GCAGAACATA TCAGCTGCCG AAATAATGTT GCCCTGTTCG ATATGAGCTT CATGAGCAAG TTTCATGTAC AAGGAAATGA TGCAGGGAAG TTTCTCAATC GTCTGTCTAC AGCCAACGTA GACGGTGATT GGGGTATGAT AACATATACA CAGTGGCTCG ATGAACAGGG GTATATGGCG GCGGATTTGA CGATAACTAA AATGGCAGAG AATCACTTTA TGGTGGTAGC AACAGACACA ATGCTCAACA AAGTCTACAG TCATATGCTC GATCGACTGG TGCACGGAGA GCACGTTTTT GTAACTGACG TAACAGGTCG CTACGCGCAA CTTAATTTGC AGGGTCCACG ATCGAGAGAG TTACTGCAAG GCCTGACTTC TGTCGATCTG AACAACTTTG CTTTCCGTAG AGCAGAAGAG ATTGACATAG GCTTAGCGCG AGTTCTTTGT ATTCGAATTA CCTATGTCGG AGAGCTGGGA TACGAACTTT TTGTTCCAGT AGAACAAGCG AGGCACGTTT ACGATTGTAT CGTTGAATTA GGCCGGGAAT TTTCCCTCTC TCACGCGGGT CTCAAAGCTC TGGGAAGTCT AAGAATGGTA CGTCTTGTGG TTTGTTGGAA ATATCGATGG TTATCGTTAG TTCCTTACGC TTTCTTCATA CCACCTCAGG AAAAGGGATA TCGGGATTAC GGACACGATA TGGACAACAC GGATAGACTT CTGGACTGTG GGTTGGGATT CACCTGCGAT TTTGAGAAAG AAGGTGGCTT CATCGGCCAG AAGCACGTCC TTGCACAAAA GGATGCTGCG AAAGAGCGAG GAGGTTTATT GAAGCGAATT GTGAATGTTT TAGTCTTAGA CCCTGCACCT CTATTGCATC ATGGTGAAAT CCTTTGGAAG GACGGAAGGC GTATATCTGA TATTCGAGCT GCATCTTACG GACACACTGT CGGGGGCGCC GTGGGTTTGA GCATGCTTAC GCGTGATATT CCCGTAAAGA AAAATTGGTT GGATGGCAGC GACTGGGAGG TTGAAGTTGG CAGTCGAAAG CATCCTTGTA GGTTGTCGAT ACGCCCGATG TACGATCCCG CTAGCGTTCG CGTAAAAGAT GCGTAAATGT CATCACTTTT AAATGTAGTG AAGCGGAATA AATCGATGTC TAATTTTGCT TAGTTGATCC AA
|
Protein sequence | MKRLSCVRGL RLRKGRVHLR CASSQTVPER ANVVVVGGGI IGTSVAYHLA KAGVEDVLLL ERDRLTSGTT WHAAGLMNSF GSMSSTSTWS RQYTQELYRD ILPTETGLET GYMGIGFIEL ACDADRLEAF RRIAAFNRFL GVDVAEISPE QVKDLFPLCE TSDVLSGFWV ENDGRANPTD ATMALAKGAR LHGANIIEQC HVAGVTTSKP NGNYRAKVTG VRLENETVIA ANIVVNCAGM WARQFGEACG VYNIPNQAAE HYYLITEPMK EIDPSWPVIE DSSKCVYIRP EGKGLMLGFF EWEGAAWKPE GVPLDFSFGE LDPDWDRMMP YVEQAMKRVP AAENVGVKAL FCGPESFTPD NRPIVGESPE LRNYYIAAGL NSIGILTGGG IGKILAQWIQ QGCSPHDVDV TAIDASRFQR YQSNITYRND RTGEALGNTY KVHYPDHQPT TCRNAKQSVL HERLVNANAF FQETSGWESP SWYAPHGTNP KVETESFGRE NWFLHWEAEH ISCRNNVALF DMSFMSKFHV QGNDAGKFLN RLSTANVDGD WGMITYTQWL DEQGYMAADL TITKMAENHF MVVATDTMLN KVYSHMLDRL VHGEHVFVTD VTGRYAQLNL QGPRSRELLQ GLTSVDLNNF AFRRAEEIDI GLARVLCIRI TYVGELGYEL FVPVEQARHV YDCIVELGRE FSLSHAGLKA LGSLRMEKGY RDYGHDMDNT DRLLDCGLGF TCDFEKEGGF IGQKHVLAQK DAAKERGGLL KRIVNVLVLD PAPLLHHGEI LWKDGRRISD IRAASYGHTV GGAVGLSMLT RDIPVKKNWL DGSDWEVEVG SRKHPCRLSI RPMYDPASVR VKDA
|
| |