Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22113 |
Symbol | |
ID | 7203242 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 726456 |
End bp | 729426 |
Gene Length | 2971 bp |
Protein Length | 870 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | bifunctional aspartokinase |
Protein accession | XP_002182284 |
Protein GI | 219123963 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCACACCA TTCCACCCTG CAGTGATAGC GTCGTCACCC ACGACACACG TATTACACAC AGAACCCACA CTCCCCGCTA GTATTCCCAT GACGCCGGCA CCGACAATGG CGGCGACGAC AACGCCCGGT GAAGGGGACT CTGATGCGGT GCACACGGGA TTTGCGGAGT TGCCCTGGCA GGTGCACAAG TTCGGGGGCA CTTCCGTCGC GACGGCGGAA TGTTTCCTCG CCGTCGCCCG TGTTCTGGAG CACGAGTTGG AGATTGACCC CGTACATTCT TGCATCGCTA TTGTAGTCTC TGCCGTGGGG GGAAAGCCCA AGGTTACCGA CTTGCTCCTG GACACCGTCA AGGCGGCGGC GCAGCGGGAT GCGCAGGGCG TCGACAGCCT CCTAGAGGTT GTTCTCCAAA AGCACCACGA CTGCCTCGCC GCGCTCTTCG TCAAGGAACC TAGCGAACGC GATCGACTCA TGGAGATCAT TCAGGGAGAT ATTATTGACA TTCGGGATAT TCTCAAAACG GTCGCCCTCA TGAAATGGCA AGCCGCCCGC ATCTCCGAAC TCGTTTCGGG ATACGGCGAG CTCTGGTCGG CACAAATTCT TACCGCTTTG ATGCGACTGC GTGTGCAAAC GAACGTGGAA GCTCTACGGA ACTCATCCAC ACCGACTCTG GAAGACGACG GACTCGCCCA TTCACATCGG AGAGAGTACG TCTACCTCGA CGCGCGGAGG GTCATTACGG TTGACGAAGA AGCCATCCAC AACGGCGCCG TCGTTTGGGA ACTCTCGCAA GAGAAACTGG ACCGAGTCTA TCGAGAGGAA TTACAGAAAC TCCCCCAAGC CGCGATTTTG CATTTCGTTA TTACTGGATA CGTCGCCAGC AACACGGAAG GAGTCGCCAC AACCTTACAA CGTGACGGTT CCGATTACTC CGCCGCCATT ATGGGACGAC TTCTGCAGGC ACACAAAATC ACCATTTGGA CCGATGTGGA CGGGGTACTC TCGGCCGATC CTCGACGTGT CCCCCTGGCG CAAATCCTGC CCGAAGTCTC CTACACCGAA GCCATGGAAT TGGCCTACTT TGGCGCCAAG GTCATTCACC CCAAAACCAT GCAGCCCGCT ATTTCCAGCT CACCACAGAT TCCTATTTTC ATTCGAAATA CCTTCAATCC GAGTTTCCGA GGTACCCGCA TTTACGCACC CGGTCTCAAC AAGGACAAGG ACAAGGTGGT GACGGGATTT AGCAGTGTCG AAGATATGGC GCTCCTCAAC GTGGAAGGAT CCGGACTCGT CGGTGTCTTG GGCGTCGATC GACGGCTCTT TGCCACACTC GAACGGATCG GCGTCAACGT CGTCCTTATT TCACAAGGCT CCTCCGAACA CTCCGTCACC TTTGCCACCA AGGAGAGCCA AGCCAACAGC GCCAAACTCG CCATTGAAGA AGAATTCCGC CGCGAATTGC TCCAGCACCG CATTTCTAAA ATCGAAATCC AGGCGCCCTG CTCCATTCTT GCTGCCGTCG GTGACAACAT GGCGCTCACG ACAGGCGTAG CCGGCCGCTT CTTTTCTGCT CTCGGCGACG CTAAAATTAA CATTCTCGCC ATTGCCCAGG GTTCCTCGGA ACGCAACATC TCCGCCGTCG TACTCACTTC GGAATCGTCC CGGGCCCTGC GTGCCGTCCA CGCGGCCTTT CGCCTCTCAC ATTCCACCGT ACGTGTCGCC ATTGTCGGTA TGAATGAATT GGGAATTTCG CTACTCAAGT TGCTGGAATC GCAAAGATCA TCCCTTCGAT CGACTTTCGA TATCGATCTA CAAGTGTGCA CTATTTTGTC CGACAGCACC AGTCCGCAGC TTATCACATT GCTCAACGAC CGCGATGGCG GCGCAGAATC TATCACGATG AACGGCTTCA ACCGGGCCAG TGGTGGTTCC AATTCTCTAT TGCTCGGGGC TCCGGCGACA AATGCTTCGC AGACGTCCTT TCGCGACGAC GAAACGACTT TTCTGGAAGA CGGCGGCCGG GACGTCCTGC TCAACCGCCT CATTCGCAAC GAATGCCCGA GTCACGTGAT TTTCGATTGT ACCAACGACG AAGAATTGGG TCAGTCGCAT GCCGCATGGT TGCGCGCCGG CGTCAACGTC GTGACAGCCA ACAACACGGG TATTTCGGGA CCTGCGGCGC AGCGCGAAGA AATCGCGGAA GCCGAACGAG CCCAAGGAAA GAATGGCGCC AAGTACCTAC GCGAAGTTAC CGTTGGTGGC GGCTTGCCAG TTATTAATAC GCTACGGTCA CTACTCCATT CGGGAGACAA GATTCGTCGT ATCGACGGTA TCTTGTCCGT GAGTCTCTCG TTCATTATGT TTCGCATTTC TCCGGCGACT GATATTGCGA AATGCAGTGA ATTTGACCAA ATGTCGAGTA AGGGTGCCTT TCACGAGGAC CGGTCCATGT CACCCACTGC AACTTTGACA AAAGCGTGCA GTTTTAGTCA AGCCGTGAAA GAAGCCATCG CGCTCGGATT GATGGAAGAA GACCCGACCA AGGATTTGAA CAACGAATAC ACGTCTCGTG TGCTCATGGT GCTGGCGAAA GAATTGAATA TGGACAAAGG TGTGGAAGTG AGCGATATTC GCGATTCCAG CGACAAGCTT TTGGAATTGA TTTGCGGCGA GACGGTGGAT TACACCAAGT TTTCGCCCGC AGTGGATGAG CTAGTACAGG CACGGGTGGA TGCCGCTAAA TCTCGCGGTT GTGTGCTCCG TCACATTGCA AGCGTGGATG TCAAGGCCAA GGAGCTCTCC ATTAAAGTTG TAGAAGTTCC GGAACACCAT GTTCTAGCCG TGACACCCCC GAGTTGCGAA TGTGTGCGGT TCTTCACCCA CCGTCACCAG CGCTACCCTC TAATTGTGCA AGGACCTAGT GCTGGTGCCG ACTCGACCGC AAGTGCTCTC TTAGCCGAAC TACTACAGCT T
|
Protein sequence | MTPAPTMAAT TTPGEGDSDA VHTGFAELPW QVHKFGGTSV ATAECFLAVA RVLEHELEID PVHSCIAIVV SAVGGKPKVT DLLLDTVKAA AQRDAQGVDS LLEVVLQKHH DCLAALFVKE PSERDRLMEI IQGDIIDIRD ILKTVALMKW QAARISELVS GYGELWSAQI LTALMRLQYV YLDARRVITV DEEAIHNGAV VWELSQEKLD RVYREELQKL PQAAILHFVI TGYVASNTEG VATTLQRDGS DYSAAIMGRL LQAHKITIWT DVDGVLSADP RRVPLAQILP EVSYTEAMEL AYFGAKVIHP KTMQPAISSS PQIPIFIRNT FNPSFRGTRI YAPGLNKDKD KVVTGFSSVE DMALLNVEGS GLVGVLGVDR RLFATLERIG VNVVLISQGS SEHSVTFATK ESQANSAKLA IEEEFRRELL QHRISKIEIQ APCSILAAVG DNMALTTGVA GRFFSALGDA KINILAIAQG SSERNISAVV LTSESSRALR AVHAAFRLSH STVRVAIVGM NELGISLLKL LESQRSSLRS TFDIDLQVCT IFGGSNSLLL GAPATNASQT SFRDDETTFL EDGGRDVLLN RLIRNECPSH VIFDCTNDEE LGQSHAAWLR AGVNVVTANN TGISGPAAQR EEIAEAERAQ GKNGAKYLRE VTVGGGLPVI NTLRSLLHSG DKIRRIDGIL SVSLSFIMFR ISPATDIAKC TVKEAIALGL MEEDPTKDLN NEYTSRVLMV LAKELNMDKG VEVSDIRDSS DKLLELICGE TVDYTKFSPA VDELVQARVD AAKSRGCVLR HIASVDVKAK ELSIKVVEVP EHHVLAVTPP SCECVRFFTH RHQRYPLIVQ GPSAGADSTA SALLAELLQL
|
| |