Gene PHATR_33124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33124 
Symbol 
ID7204255 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp108080 
End bp111525 
Gene Length3446 bp 
Protein Length1108 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186280 
Protein GI219113393 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0372289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCCTC TCGAACATGT CCTTGTGAAC CTCTTGGGAG CAACAACGTT GGATTCGTCG 
TACCGTCGGT TCTTTGAAGA GTATGGGATT ACTCAGGCCA GTGAATTGGC CTCAATCACT
GAACATTGTC TTGCAATGGT GTCTTATGGC GTCTTGACCC CTGCTGTGGG AGACGGCCCT
GCTTCAATTG TTCGTACATT CCTTCCGCCT GCGCAACAGG ATCGGATTTT GAAGATTGTA
CAATGGTTCC TTTTGAAAGG CACCAATGTG ACAAACAACA CCTGGCTTGA ACTTACCTCT
GATGTTCTTG AGTATTGGCA ACCAGCCTCT GCTATTGTTG CCCCCGCTAC TCCTGTTGGA
TCTGATGCTC GGAGTTCCTT TGCCGAAAGT GCTGCTGCAA AATTTCGAAA GACGATCAAG
AACCATTCCG TCCCGTACCC AAAGTTCAGT GAGGACCGTT TTTGGGTTAC TTGGAATACG
AATATTCGTA TTAAACTCCG TATCCACGGT GTTCAGTTTG TACTTGACCC GGACTACTTG
CCCCAGACCG TTGATGAGAT GGACACGTTC GTTGAGATGC AGAATTTTGT CTTCGGTGTA
TTCAACGACA TTTTGTTGAC CCCTCGTGCG CGTGGGATCC TCCACAAGCA TGTGGATGAG
CTGGATGCTC AGTCGGTCTA CCGCGACCTT GTTGCCTCGT ACGGTAAAGG TATTAACGCG
CAGATCACGG CTACATCCAT TGAAACGAAG CTCACCTTGT ATTCGTTTGC GACTTCAAAG
AGCAAAACTT GCGTTACTTT TTTGACAACT TGGCGCAATT TGATCTACGA TCTTGAACGG
ATCAACAAGT TCCCTTTGCC AGATCACCAA AAGAGCGTGC GACTGAAGTC AGCTGTCCGT
TCCCATCCTC AATTGAAACT TTTCCTTGGC AATGTGCAGC TTTACTCTTG TACCCATGTG
GGAAAGAGTT CCGACGACTC AGACTTCGAG TACGTCTATG ATCTGATGCT CGAGCATGCA
ACCAATATTG ATCAGACCGA TTTTGAAGAC CGCGGTAATA ACCGTGGTGG CCGTTCTGCA
AACAACGCGA AGTCCCAGTC TTCTTCAAAG AAGAAAACTA ACAAGCCGCT TGGTAAGAAG
CACAAGAATT ATGTGCCTCC TGAGAAATGG AATGCTCTCT CTCCTGAAGA GAAGCGCACC
ATTATGGACC AACGAGGACC TTGTCCTGCT GCAGCTCCAG CCTCTGCTCT GTCCGTGAAT
GCCGCTGCTA CTCAGCCTCC TCCCACCGTG TACGTGAGCG ACTCGACGGC TGTGGATAAC
CAAAGTCTTG CTTCGACTCA AGTTCCAAAT GCTGCTGCAT CCGGACACCT GCTACGGTCG
CTAATTTCAA ATTCTGCCGC TCGCCAGCCG TCCAACGGCG CAACCTCTGA TTCTTTTTCG
GTGAATGGTA CCACGTACCG CCGTGAGGTA AACCATGCCT CCGTCAGATA CCGTCTGTCT
ACTCACGATG TTTCTTTGAC TAAAGATTCT TTGATTGATG GTGGTGCCAA TGGTGGCCTT
AGCGGCTCGG ATGTGACCGT TATTTCCCAA TCTCTGTCGC AAGCTACTGT TTCTGGGATT
GGAAACTCGG AGCTCACCAA CCTTCGTCTG TCGACGGTTG CCGGACTCAT CCACACCACG
GATGGTCCTA TCATTGGAGT GTTTAATCAG TATGCTCATC TTGGTACTGG TAATACCATT
CATTTGTGCA ACCAAATGCG CTCCTGGGGA GTCACAGTTG ACGACGTCCC TCGTACTTTT
GGTGGCAAAT AGAGTATTGT CACGTCCGAT GGCCGTTTTG TCATCCCGCT TTCAGTTTCT
GGCGGACTCA CCTATTTGTC TATGCAGGCC CCCACCGAGG AGGACCTGGA CAATTTCGAG
TGGGTTCATT TTACCGCCGA CAACGAGTGG GACCCGAATG GCGTGTCTTC TCTCCTGCTG
CGACCGATGA TGATCTCAGT TTGCAGCTTT CTGCCGACCA TGTTCCGGGA TGAACGCTTC
AACAACTTTG GCCTTCTTGC GCACTCCACG GTTGTCAGTC GTTCCCCCTT GAACGCCGAT
GTCTTGCAAC CCAATTTTGG ATGGGTCCCC AGCGCTCGAA TCTCCCGCAC GTTCGAAAAT
ACCACACAAT TTGCTCGTGC CGATGCTCGT TTGCCTTTGC GCAAGCATTT CAAGTCGCGC
TTCCCTGCTG CCAATGTTTC TCGTTTGAAC GAAATTGTGG CAACCAATAC TTTTTTCTCG
GATACCCCTG CGGCCGATGA CGGCATTTTC AACCATGGGG GGGGGGGGGG GGTACAATGG
CCCAACTTTT CGTAGGAAAA AGTTCGCAAA TCACCTCTGT CTTCCCAATG AAGCGCAAAT
CTCAGTTTGC CCATGCTTTC GAGGACTTTA TTTGTACCCA TGGTGCTCCC AATGCACTCC
TCAGCGACAA TGCTCGTGCT CAGATCGGTA AGCAGGCGCT TCAGATTTTG CGAATGTACG
CAATTGACGA TATGCAGTGC GAGCCGCATC ACCAACACCA GAACTACGCG GAACGCCGCA
TTCAGGAAGT GAAAAAGATG GTAAATACCA TCATGGATTG TACAAACACT CCTCCGGAGT
ATTGGTTGCT TTGCTTATTT TATGTGACCT ACTTGCTGAA TTGCCTTGCA GTTGAGAGCT
TGAATTGGCG TACCCCGTTG CAAGTTGCGT ACGGCCAGCG TCCTGATATT TCCGCTTTGC
TCCTTTTTCG TTGGTTTGAG CCGGTCTACT ACTACGACCC TGACCATGCA TCTTTCCCGT
CGCAATCTCG CGAGAAGACT GGTCGTTGGA TTGGTGTTGC CGAACATAAA GGTGATGCTT
TGACTTATTG GATTTTGACC GATAATACTC ACCAAGCCGT TGCCCGTTCT GTTGTTTGTT
CAGCCAACGT TGATAACGGT CTGAAAAACC ACCGTGCTGC GAATTCCTCT CCCGATGGTG
GGGAGCCTTC GAATCCTAAG CCTATTGTGT TGGCTTTGAG TGATCTACGC AATCCTGCTG
CGATCAACCC ATCGCTCTTT GAATCCCCTG CGTTTTCTCC TGACGAATTA ATTGGTTGAT
ACTTGGTTCG TGAAGCCCCT GATGGCCAGA GCCACCGAGC CCTTGTTGCT CGTAAAATTG
TTGATGCCGA TTCCGACAAT CGCCAAGCAA TCCGTTTCCT ATTGCAAATT GATGAGAAGG
ATGCCGACGA GATCATTTTG TATAATGAAC TCTGTGACTT GATGGAAGCT CAGCAAACCG
ACCGTGTCAC GAATGGAAAT GTTGAAGGCC ACTTCAAATT TACTGGTGTC ATTGGACATC
AAGGACCGTT GCAACCGACT GATGTAAACT ATAAGGGATC GTCGTGGAAT GATTTGGTTC
AATGGGAAGA TGGTTCCCAG ACCTAG
 
Protein sequence
MDPLEHVLVN LLGATTLDSS YRRFFEEYGI TQASELASIT EHCLAMVSYG VLTPAVGDGP 
ASIVRTFLPP AQQDRILKIV QWFLLKGTNV TNNTWLELTS DVLEYWQPAS AIVAPATPVG
SDARSSFAES AAAKFRKTIK NHSVPYPKFS EDRFWVTWNT NIRIKLRIHG VQFVLDPDYL
PQTVDEMDTF VEMQNFVFGV FNDILLTPRA RGILHKHVDE LDAQSVYRDL VASYGKGINA
QITATSIETK LTLYSFATSK SKTCVTFLTT WRNLIYDLER INKFPLPDHQ KSVRLKSAVR
SHPQLKLFLG NVQLYSCTHV GKSSDDSDFE YVYDLMLEHA TNIDQTDFED RGNNRGGRSA
NNAKSQSSSK KKTNKPLGKK HKNYVPPEKW NALSPEEKRT IMDQRGPCPA AAPASALSVN
AAATQPPPTV YVSDSTAVDN QSLASTQVPN AAASGHLLRS LISNSAARQP SNGATSDSFS
VNGTTYRREV NHASVRYRLS THDVSLTKDS LIDGGANGGL SGSDVTVISQ SLSQATVSGI
GNSELTNLRL STVAGLIHTT DGPIIGVFNQ YAHLGTGNTI HLCNQMRSWG VTSIVTSDGR
FVIPLSVSGG LTYLSMQAPT EEDLDNFEWV HFTADNEWDP NGVSSLLLRP MMISVCSFLP
TMFRDERFNN FGLLAHSTVV SRSPLNADVL QPNFGWVPSA RISRTFENTT QFARADARLP
LRKHFKSRFP AANVSRLNEI VATNTFFSDT PAADDGIFNH GGGGGRKSQF AHAFEDFICT
HGAPNALLSD NARAQIGKQA LQILRMYAID DMQCEPHHQH QNYAERRIQE VKKMVNTIMD
CTNTPPEYWL LCLFYVTYLL NCLAVESLNW RTPLQVAYGQ RPDISALLLF RWFEPVYYYD
PDHASFPSQS REKTGRWIGV AEHKGDALTY WILTDNTHQA VARSVVCSAN VDNGLKNHRA
ANSSPDGGEP SNPKPIVLAL SDLRNPAAIN PSLFESPAFS PDELIAPDGQ SHRALVARKI
VDADSDNRQA IRFLLQIDEK DADEIILYNE LCDLMEAQQT DRVTNGNVEG HFKFTGVIGH
QGPLQPTDVN YKGSSWNDLV QWEDGSQT