Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55086 |
Symbol | P5CS |
ID | 7198171 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 443526 |
End bp | 446454 |
Gene Length | 2929 bp |
Protein Length | 747 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | delta l-pyrroline-5-carboxylate synthetase |
Protein accession | XP_002184463 |
Protein GI | 219128528 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATCGACCCG GATATCCTTG GCTTTCTTTG AAGTCGTCGT ACCAATTCAT CGGTCGCTAG ACACAACTTT CCCAACAGTA ACAACAAGAG TAACTACACA CGATACTCAT ACAACAACAA CAATCATATA ATCACGCAAG AATGATGCTC AAGATGCAAG CAACCGACTT TGGCAATTCC CGCAACGAGT GTCGTTCGCG TGCGGCGTTG CGCATGGCGC GACGCGTCGT CATCAAGGCC GGCACTTCGG TGGTGGCGAA CGACGACGGG TCACCGTCGC TCACGCGTCT CGGCGCCATT GTGGAGCAGA TTGCGGCACT GAATCAAGCC GGCGTCGAAG TCATCTTTGT GAGTTCCGGA GCCGTCGGCA TGGGCAAGAA TCTCTTGCGC AAACAGGCAC GCCTCAACAT GAGCTTCAAG GATCTGCACA AGTCCGACAG TCACCACGCG CACTTGGGGA GTGGACTGCA CGCCGTCACG GAAGACACGC ACGACACCAT TCTGCCGGAT TCGCATTCCG CACACGCACA ACGACTCACC CCGACAACCT CCATCTCCGA TCTACTCACC ATTAACGAAC ACATGGAAAC ATCCGCCGCA AAGTCCTACG CGGCGGCGGG ACAATTCGAA CTCATGAATC TCTACCAGAG TCTCTTCTCG CCCAAGGGCG TCACGGCGTC GCAAATGTTG CTCACACAGG CGGATTTCGG GGACGCACAT CATCTGCAGA ACCTACGCTA CGCCGTGGAT CGACTCCTCT CGCTCGGTAT TGTCCCCATT GTCAACGAAA ACGATGCCGT ATCCGTACTG CCCGGAACCA CGCAAGAAGC CGAACCCGTC TTTTCCGACA ACGACAGTCT CGCGGCGCTC TGCGCCCGCA CCTTCGGAGC CGAAGTACTC CTCCTGCTCA CCGATGTGGA CGGTGTCTAC ACGCTCCCGC CCACACACCC CAAGGCCAAA CTCATACCCT TTTATCACGC CAACACGGAA CAAGTCGGAA TCGGCGTCAA GAGTGCCCAA GGTCGCGGCG GGATGGCCGC CAAAATTGAT GCCGCCCGGA ACGCCGTCGC ACCCGGATCG GCCTGTACCG CCTGCGTTAT TTGCGCCGGG AGTGACCTCG ACGTCGTCCG TGCCGTACTC AGTAAGGACT GGGACGGGGC CGAAACCGGC AAGGGCACTC TCTTTGCCAC CCCCGGTAGT GACCTCGAAA CCCAAGCCCT CGATGAACTC GTTCATTCCG TCGAAGTAAG TTGACGTGTG CTTGGTTGGT TGTCGTGGGA CTGTTGTAGT CGTACAATTG TTGTTGGGTC ATCTGTCGTG CGAATGCTTG ATTATTCTTG TCAACCGCCC ACTCACATAT TCTTACACTG GACAGGACGA AGAATCGTGT TCGGATGGGG CCCGCGATAT GGCCTTGCGG GCTCGCCGAG AAGCCCGTAA ACTCCAGGCA CTCCCGGCCG AAGTTCGTCG GAATATACTC AACGCCGTTG CCGACGCCCT GCTGACCCGC AAGGACGAAA TCGTACAGGC CAACCAACTT GATTTGGCCA ACGCCGCCCA AGACAAAGTG GCGGGACCTC TCGTCAAGAG ACTCGGCTTG ACGGACGAAA AACTGGCCAC GCTTGTGACC GGTATTCGCC AACTCGCCAC ACTCCCAGAT CCCCTCGGCA ACGTCAAGGC CAAACGAGAA CTGGCCGACG GACTCGAATT GTCCCTCACC ACCGTCCCCA TTGGCGTACT CCTCATTATC TTTGAATCTC GACCGGATAG CCTACCCCAA ATTGCCTCGC TCGCACTAGC CTCCAGCAAT GGGCTCTTGC TCAAGGGCGG CAAGGAGGCC TACCATTCCA ACACGGCCTT GCACGACGTT ATTGGAGATG CCATTGAAAA AGGCAGTGAA GGACTCATTT CCAAAGACAT TATTGGTCTC GTCACGTCTC GTGGTCAAGT CAATGATCTA CTCTCCCTCG ACGACACGAT CGATTTGGTC ATTCCTCGGG GTTCCAACGC CCTGGTCAGC TACATTCAGG AGAATACCAA GATTCCGGTT TTGGGACACG CCGACGGCGT GTGTCACGTC TATATTGACG CCAGTGCGTC GGCAGAAGCC GCGTGTCGGA TTGCCGTCGA CGCCAAGACG GACTATCCCT CCGCCTGCAA CGCCATGGAA ACCTTGCTTT TACACGCCGA CACGGTCGAA AACGGCGTCG CCTTGAATGT CTTACAGGCC CTCCGCTCGG CCGGCGTCAA GTGCCTCGGT GGCCCCAAAG CCATGAAGGC TGGATTAAGC GACGTGGCCA CCCAAAAGAC CAAGCACGAG TACGGCGATT TGACGTGCCT GGTCGAAATT GTCGACAGCA TAGACGAGGC TATTGACTGG ATCCACGAAA ACGGTAGTGG CCACACCGAA GCGATTGTGT GTGCGCCGGA CTCGCCCGCC GGAGAAATTT TTCTACAAAA GGTCGACGCC GCTTGCGTCT TTTGCAATAC CTCGACCCGT TTCGCCGACG GTTACCGGCT CGGGCTCGGC GCCGAAGTGG GTATTTCCAC CGGTCGGATT CATGCGCGGG GACCCGTCGG AGTCGAAGGA CTTTTGACGA CCAAGTGGCA GTTGCGTAGT CTGTCGGGAG AATCTTACCT GGCGACCGAC TTTGCCGGGG CCGACCCGAA ACGGACGTTC ACCCACAAGA ATTTGCTGTA AGCTCAGGTT ATGGGTGGTA GTGGCTGGTA TCTCTGTTGG GCCTCGGTAC GTACGGACTT TTTTGGTTGG GATACGATAA ACGCCCTCGA ACTACGGTGT GATTGTGTGT AGCTCATTGA CTGTGAGTAG ACGATGGCAC TCTCGGCAGT TTTGCATCCG CACACATATC CACGCCACCT CTCGTTTTAG ATTAATCTCA TTGTTCTATT AAAACAAACT GGACAATGT
|
Protein sequence | MMLKMQATDF GNSRNECRSR AALRMARRVV IKAGTSVVAN DDGSPSLTRL GAIVEQIAAL NQAGVEVIFV SSGAVGMGKN LLRKQARLNM SFKDLHKSDS HHAHLGSGLH ASYAAAGQFE LMNLYQSLFS PKGVTASQML LTQADFGDAH HLQNLRYAVD RLLSLGIVPI VNENDAVSVL PGTTQEAEPV FSDNDSLAAL CARTFGAEVL LLLTDVDGVY TLPPTHPKAK LIPFYHANTE QVGIGVKSAQ GRGGMAAKID AARNAVAPGS ACTACVICAG SDLDVVRAVL SKDWDGAETG KGTLFATPES CSDGARDMAL RARREARKLQ ALPAEVRRNI LNAVADALLT RKDEIVQANQ LDLANAAQDK VAGPLVKRLG LTDEKLATLV TGIRQLATLP DPLGNVKAKR ELADGLELSL TTVPIGVLLI IFESRPDSLP QIASLALASS NGLLLKGGKE AYHSNTALHD VIGDAIEKGS EGLISKDIIG LVTSRGQVND LLSLDDTIDL VIPRGSNALV SYIQENTKIP VLGHADGVCH VYIDASASAE AACRIAVDAK TDYPSACNAM ETLLLHADTV ENGVALNVLQ ALRSAGVKCL GGPKAMKAGL SDVATQKTKH EYGDLTCLVE IVDSIDEAID WIHENGSGHT EAIVCAPDSP AGEIFLQKVD AACVFCNTST RFADGYRLGL GAEVGISTGR IHARGPVGVE GLLTTKWQLR SLSGESYLAT DFAGADPKRT FTHKNLL
|
| |