Gene PHATRDRAFT_22113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_22113 
Symbol 
ID7203242 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp726456 
End bp729426 
Gene Length2971 bp 
Protein Length870 aa 
Translation table 
GC content55% 
IMG OID 
Productbifunctional aspartokinase 
Protein accessionXP_002182284 
Protein GI219123963 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTCACACCA TTCCACCCTG CAGTGATAGC GTCGTCACCC ACGACACACG TATTACACAC 
AGAACCCACA CTCCCCGCTA GTATTCCCAT GACGCCGGCA CCGACAATGG CGGCGACGAC
AACGCCCGGT GAAGGGGACT CTGATGCGGT GCACACGGGA TTTGCGGAGT TGCCCTGGCA
GGTGCACAAG TTCGGGGGCA CTTCCGTCGC GACGGCGGAA TGTTTCCTCG CCGTCGCCCG
TGTTCTGGAG CACGAGTTGG AGATTGACCC CGTACATTCT TGCATCGCTA TTGTAGTCTC
TGCCGTGGGG GGAAAGCCCA AGGTTACCGA CTTGCTCCTG GACACCGTCA AGGCGGCGGC
GCAGCGGGAT GCGCAGGGCG TCGACAGCCT CCTAGAGGTT GTTCTCCAAA AGCACCACGA
CTGCCTCGCC GCGCTCTTCG TCAAGGAACC TAGCGAACGC GATCGACTCA TGGAGATCAT
TCAGGGAGAT ATTATTGACA TTCGGGATAT TCTCAAAACG GTCGCCCTCA TGAAATGGCA
AGCCGCCCGC ATCTCCGAAC TCGTTTCGGG ATACGGCGAG CTCTGGTCGG CACAAATTCT
TACCGCTTTG ATGCGACTGC GTGTGCAAAC GAACGTGGAA GCTCTACGGA ACTCATCCAC
ACCGACTCTG GAAGACGACG GACTCGCCCA TTCACATCGG AGAGAGTACG TCTACCTCGA
CGCGCGGAGG GTCATTACGG TTGACGAAGA AGCCATCCAC AACGGCGCCG TCGTTTGGGA
ACTCTCGCAA GAGAAACTGG ACCGAGTCTA TCGAGAGGAA TTACAGAAAC TCCCCCAAGC
CGCGATTTTG CATTTCGTTA TTACTGGATA CGTCGCCAGC AACACGGAAG GAGTCGCCAC
AACCTTACAA CGTGACGGTT CCGATTACTC CGCCGCCATT ATGGGACGAC TTCTGCAGGC
ACACAAAATC ACCATTTGGA CCGATGTGGA CGGGGTACTC TCGGCCGATC CTCGACGTGT
CCCCCTGGCG CAAATCCTGC CCGAAGTCTC CTACACCGAA GCCATGGAAT TGGCCTACTT
TGGCGCCAAG GTCATTCACC CCAAAACCAT GCAGCCCGCT ATTTCCAGCT CACCACAGAT
TCCTATTTTC ATTCGAAATA CCTTCAATCC GAGTTTCCGA GGTACCCGCA TTTACGCACC
CGGTCTCAAC AAGGACAAGG ACAAGGTGGT GACGGGATTT AGCAGTGTCG AAGATATGGC
GCTCCTCAAC GTGGAAGGAT CCGGACTCGT CGGTGTCTTG GGCGTCGATC GACGGCTCTT
TGCCACACTC GAACGGATCG GCGTCAACGT CGTCCTTATT TCACAAGGCT CCTCCGAACA
CTCCGTCACC TTTGCCACCA AGGAGAGCCA AGCCAACAGC GCCAAACTCG CCATTGAAGA
AGAATTCCGC CGCGAATTGC TCCAGCACCG CATTTCTAAA ATCGAAATCC AGGCGCCCTG
CTCCATTCTT GCTGCCGTCG GTGACAACAT GGCGCTCACG ACAGGCGTAG CCGGCCGCTT
CTTTTCTGCT CTCGGCGACG CTAAAATTAA CATTCTCGCC ATTGCCCAGG GTTCCTCGGA
ACGCAACATC TCCGCCGTCG TACTCACTTC GGAATCGTCC CGGGCCCTGC GTGCCGTCCA
CGCGGCCTTT CGCCTCTCAC ATTCCACCGT ACGTGTCGCC ATTGTCGGTA TGAATGAATT
GGGAATTTCG CTACTCAAGT TGCTGGAATC GCAAAGATCA TCCCTTCGAT CGACTTTCGA
TATCGATCTA CAAGTGTGCA CTATTTTGTC CGACAGCACC AGTCCGCAGC TTATCACATT
GCTCAACGAC CGCGATGGCG GCGCAGAATC TATCACGATG AACGGCTTCA ACCGGGCCAG
TGGTGGTTCC AATTCTCTAT TGCTCGGGGC TCCGGCGACA AATGCTTCGC AGACGTCCTT
TCGCGACGAC GAAACGACTT TTCTGGAAGA CGGCGGCCGG GACGTCCTGC TCAACCGCCT
CATTCGCAAC GAATGCCCGA GTCACGTGAT TTTCGATTGT ACCAACGACG AAGAATTGGG
TCAGTCGCAT GCCGCATGGT TGCGCGCCGG CGTCAACGTC GTGACAGCCA ACAACACGGG
TATTTCGGGA CCTGCGGCGC AGCGCGAAGA AATCGCGGAA GCCGAACGAG CCCAAGGAAA
GAATGGCGCC AAGTACCTAC GCGAAGTTAC CGTTGGTGGC GGCTTGCCAG TTATTAATAC
GCTACGGTCA CTACTCCATT CGGGAGACAA GATTCGTCGT ATCGACGGTA TCTTGTCCGT
GAGTCTCTCG TTCATTATGT TTCGCATTTC TCCGGCGACT GATATTGCGA AATGCAGTGA
ATTTGACCAA ATGTCGAGTA AGGGTGCCTT TCACGAGGAC CGGTCCATGT CACCCACTGC
AACTTTGACA AAAGCGTGCA GTTTTAGTCA AGCCGTGAAA GAAGCCATCG CGCTCGGATT
GATGGAAGAA GACCCGACCA AGGATTTGAA CAACGAATAC ACGTCTCGTG TGCTCATGGT
GCTGGCGAAA GAATTGAATA TGGACAAAGG TGTGGAAGTG AGCGATATTC GCGATTCCAG
CGACAAGCTT TTGGAATTGA TTTGCGGCGA GACGGTGGAT TACACCAAGT TTTCGCCCGC
AGTGGATGAG CTAGTACAGG CACGGGTGGA TGCCGCTAAA TCTCGCGGTT GTGTGCTCCG
TCACATTGCA AGCGTGGATG TCAAGGCCAA GGAGCTCTCC ATTAAAGTTG TAGAAGTTCC
GGAACACCAT GTTCTAGCCG TGACACCCCC GAGTTGCGAA TGTGTGCGGT TCTTCACCCA
CCGTCACCAG CGCTACCCTC TAATTGTGCA AGGACCTAGT GCTGGTGCCG ACTCGACCGC
AAGTGCTCTC TTAGCCGAAC TACTACAGCT T
 
Protein sequence
MTPAPTMAAT TTPGEGDSDA VHTGFAELPW QVHKFGGTSV ATAECFLAVA RVLEHELEID 
PVHSCIAIVV SAVGGKPKVT DLLLDTVKAA AQRDAQGVDS LLEVVLQKHH DCLAALFVKE
PSERDRLMEI IQGDIIDIRD ILKTVALMKW QAARISELVS GYGELWSAQI LTALMRLQYV
YLDARRVITV DEEAIHNGAV VWELSQEKLD RVYREELQKL PQAAILHFVI TGYVASNTEG
VATTLQRDGS DYSAAIMGRL LQAHKITIWT DVDGVLSADP RRVPLAQILP EVSYTEAMEL
AYFGAKVIHP KTMQPAISSS PQIPIFIRNT FNPSFRGTRI YAPGLNKDKD KVVTGFSSVE
DMALLNVEGS GLVGVLGVDR RLFATLERIG VNVVLISQGS SEHSVTFATK ESQANSAKLA
IEEEFRRELL QHRISKIEIQ APCSILAAVG DNMALTTGVA GRFFSALGDA KINILAIAQG
SSERNISAVV LTSESSRALR AVHAAFRLSH STVRVAIVGM NELGISLLKL LESQRSSLRS
TFDIDLQVCT IFGGSNSLLL GAPATNASQT SFRDDETTFL EDGGRDVLLN RLIRNECPSH
VIFDCTNDEE LGQSHAAWLR AGVNVVTANN TGISGPAAQR EEIAEAERAQ GKNGAKYLRE
VTVGGGLPVI NTLRSLLHSG DKIRRIDGIL SVSLSFIMFR ISPATDIAKC TVKEAIALGL
MEEDPTKDLN NEYTSRVLMV LAKELNMDKG VEVSDIRDSS DKLLELICGE TVDYTKFSPA
VDELVQARVD AAKSRGCVLR HIASVDVKAK ELSIKVVEVP EHHVLAVTPP SCECVRFFTH
RHQRYPLIVQ GPSAGADSTA SALLAELLQL