Gene PHATRDRAFT_45989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45989 
Symbol 
ID7201054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp878783 
End bp881883 
Gene Length3101 bp 
Protein Length1008 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180133 
Protein GI219118732 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.400632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTATT TGAACAATCG TGTCGGCAAC GGCGGTCCCA GCGGCAGCAA AGCGGTTCGC 
CCCCAGAACT ACGTACCCGG ACTCGGGCGC GGGGCTGCCG GATTTACAAC GCGCTCCGAT
GTCGGGCCCG CAGCGAACGT TGCCCTTACG GCCGAATCCA CGGGAGGCAG TCGGGCAGCG
GATGCACGCG CTGCCAAACT GCAAGCCCAA AAGCAGAAGG GTCTCTTTGG AGATGCGCCT
CAGAATTACG TACCGGGAGC GGGACGGGGT GCGGGCAGCA TGGGTGCAGC GGGGACCGGG
GGGCCTGCCA CGGCGACAGT GGGTTCCTAC GACGCCTTTG GCGGCTATCA GGAACGCCCA
GTGAACGAAG TTCCGGGACA GTACGACGAA GATGATGACG AAGCGGATCG TATTTGGGCT
GCCATTGACG AACGTGTGCA GAGCAGGAAA CGAAAGTCCC AACAGCGAAA AGAGGAAACC
GTAGCAACGT CCGCCGAGAC GGACAATGCT CGTGTACGGA TAGGATCGCA ATTCAGAGAG
CTCAAAGAGA AACTGAAAGA TGTTTCGGAA GACCAGTGGG CAGCTATTCC CGACGTGGGG
GATCACAGCA TCAAGTACAA GAGACAAAGG CAGCAACAAA ACGAAATGTT TACTCCGCTC
TCCGATACCC TACTAGAGCA GAGGAATCAA GCCAATCTTG ACGCAACTGC GGGCAACACA
GCCTTGGCCG GGACTACTAC AGCCGCTGAT GGCATCCACA CCACGGGAGT AACTACGACC
ATGGCCAACA TGTCGGGATT GAGTGCCGCG CGGGGTACCG TCTTGGGCAT GAGTCTCGAT
AAAATGTCTG ACTCCGTATC CGGTCAAACC AACGTGGATC CTCAAGGCTA CCTGACGTCG
CTCGGCAGTA GCACTACAGC CCTCAACAAC GCTTCACAAG TGGCTGATAT TCATAAAGCT
CGACTGCTGC TGAAATCGGT TCGTGATACC AATCCACAGC ATGGACCTGG CTGGATTGCC
TCTGCCCGCG TAGAAGAAAC GGCAGGAAAG CTTCTGCAAG CCCGTAAGAT TATTCAGGAA
GGAACCCGTG TTTGCCCCGA CAATGAGGAC GTTTGGCTTG AAGCCGCCCG TCTGCATCCA
ATTCCCGTAG CCAAATCCAT TTTGGCGACA GCCGTCCGGA GAATACCTAC ATCGATACAA
ATCTTTTTGA AAGCTGCTTC CTTGGAAACG GCCGACTCCG CCAAGAAAGC TGTCCTGCGC
AAGGCTCTCG AAGCCAACCC AACCTCCACA CTGCTCTGGA AGGCTGCGAT TGACTTGGAG
GAGGCCGACG ATGCCCGAGT ACTATTAGCG GTCGCTGTCG AAAAGGTTCC GCAAGACGTC
GATTTATGGC TTGCCTTGGC ACGCCTCGAA ACATATCAGA GCGCTCAAAA GGTGCTGAAC
AAAGCCCGCA AAGCCTTACC ATCCGATCGC TCGGTCTGGC TGGCAGCGGC CAAGCTGGAA
GAATCGCAAG ATCACGTTGA TACGGTATCC AAGATTGTCG ATCGAGCCGT ACGGTCGCTT
CGGAAACAAG ACGCCGTTAT ATCGCGAGAA CAATGGTTAG AGGAAGCCGA GAAGGCGGAA
TCGGCCGACG CACCGATCAC AAGTGCGGCA ATCATACACC ATACGATTGG TCAAGACGTG
GAAGAAGAGG ATTGTCTGCG TACCTGGTCG GAAGACGCCA AAGCTTGTGT TGCCCGGGGT
TCCGTCGTGA CGGCACGGTC TATTCTCGCG CACGCGTTGC GAGTGTTTCC GAGCAAGCGT
GTTTTGTGGA TGCAAGCGGT AGAATTGGAA CGCCAGCATG GGACAGCGGT AACACTAGAA
GAGCGTTTAC GGGATGCCAC ACATGCTTTA CCGCGGGTGG AGATTTTCTG GTTATTGCGC
GCGAAGGAGC AATGGATGGC GGGCAAGGTC GACGAGGCAC GTCAGATCCT GACGGACGCC
TTTGCGGCCA ACCCGGATTC TGAGTCGGTC TGGTTGGCGG CGGCCAAGCT GGAGTGGGAA
AACGACGAAC TGGAGCGAGC GCGAGTCCTG TTCGCTCGGG CGCGTGAACG AGCACCGACG
GCCCGCGTAT ACATGAAATC GGCGATTCTG GAACGGGAGC AAAAGTGTTT CGGGGATGCG
CTGAAGCTGG TAGAAGAAGG AATCGAAAAG TATCCCAAAT TTGCCAAGCT GTACATGATC
GGGGGACAGA TTTACGCGGA CGACATGCCG AAGCACAAGG GAAGCTTGGA TCGAGCGCGC
AAGTTTTATC AGCGAGGACT GGAAGCTTGT TTGGAGAACG TGACGCTCTG GAAGTTGGCG
AGTCGGTTAG AAGAATCGGC GTGGCGGTTC GACGCAAAGG ATGCGGCTGG GGAATCCGAC
AAGGCTGTGA GCAACGGGAA CGTTGTAGCC AAACCTGGAG CTGCGGGTGC TACCAAAGCG
CGCAGTCTTT TGGAATTGGC CCGTCTGAAG AATCCCAAAA ACGCGGAATT GTGGTTAGAA
GCCGTCCGGT TAGAGCGTCG GAACGGGAGT CTCCGCATTT CCGAAAGTTT GCTGGCGAAA
GCGTTGCAGG AATGTCCGAC TTCGGGAATG CTGTTGGCCG AAACGATTTG GACAGCGCCG
CGCGCGACTC AAAAGTCGAA ATCGGCAGAC GCCATTCAAC TGTGTCCGGA CGACCCGCAG
GTAATTGTGG CCGTGGCGAG CCTATTTGCG TCGGAACGCA AGCACGAAAA GGCGCGGAAG
TGGTTCGATC GCGCTGTAAC ACTGAATCCG GATCTCGGTG ACTCGTGGGT CCGTTACTAC
GTGTTTGAAC TGCAATGGGG GACTGTGGAG CAGCAAGGGG CCGTGAAAGA ACGATGTATT
GCGGCGGAAC CCAAACACGG CGAGTTGTGG GCATCTACGA GAAAGGAGGT AACCCGACGA
CACGAGTCGA TCGGAGAAGG TCTCGAGGTG GCCGCCCAGA AGCTTCGCAA CGCGCAGGAA
AGCGAGAATC CCTCCGTTAT GCTCTAATGG AGTAGCAAAG CGTTTCCCTT AGTGTATGTC
CCTGTGGCTA TTTTCCAGTT ACTAGTGTAT GAGGTACAGT T
 
Protein sequence
MSYLNNRVGN GGPSGSKAVR PQNYVPGLGR GAAGFTTRSD VGPAANVALT AESTGGSRAA 
DARAAKLQAQ KQKGLFGDAP QNYVPGAGRG AGSMGAAGTG GPATATVGSY DAFGGYQERP
VNEVPGQYDE DDDEADRIWA AIDERVQSRK RKSQQRKEET VATSAETDNA RVRIGSQFRE
LKEKLKDVSE DQWAAIPDVG DHSIKYKRQR QQQNEMFTPL SDTLLEQRNQ ANLDATAGNT
ALAGTTTAAD GIHTTGVTTT MANMSGLSAA RGTVLGMSLD KMSDSVSGQT NVDPQGYLTS
LGSSTTALNN ASQVADIHKA RLLLKSVRDT NPQHGPGWIA SARVEETAGK LLQARKIIQE
GTRVCPDNED VWLEAARLHP IPVAKSILAT AVRRIPTSIQ IFLKAASLET ADSAKKAVLR
KALEANPTST LLWKAAIDLE EADDARVLLA VAVEKVPQDV DLWLALARLE TYQSAQKVLN
KARKALPSDR SVWLAAAKLE ESQDHVDTVS KIVDRAVRSL RKQDAVISRE QWLEEAEKAE
SADAPITSAA IIHHTIGQDV EEEDCLRTWS EDAKACVARG SVVTARSILA HALRVFPSKR
VLWMQAVELE RQHGTAVTLE ERLRDATHAL PRVEIFWLLR AKEQWMAGKV DEARQILTDA
FAANPDSESV WLAAAKLEWE NDELERARVL FARARERAPT ARVYMKSAIL EREQKCFGDA
LKLVEEGIEK YPKFAKLYMI GGQIYADDMP KHKGSLDRAR KFYQRGLEAC LENVTLWKLA
SRLEESAWRF DAKDAAGESD KAVSNGNVVA KPGAAGATKA RSLLELARLK NPKNAELWLE
AVRLERRNGS LRISESLLAK ALQECPTSGM LLAETIWTAP RATQKSKSAD AIQLCPDDPQ
VIVAVASLFA SERKHEKARK WFDRAVTLNP DLGDSWVRYY VFELQWGTVE QQGAVKERCI
AAEPKHGELW ASTRKEVTRR HESIGEGLEV AAQKLRNAQE SENPSVML