Gene PHATRDRAFT_43268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43268 
Symbol 
ID7196977 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2485975 
End bp2489447 
Gene Length3473 bp 
Protein Length1050 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177530 
Protein GI219111557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCGCAGCGTC ACCCGAAAAA GCTCAAAAGC AACCCAACAG TAACCATGAT CGAGAGTACT 
ACTAGACACA GCGCCAACGG TAGCGAAGGG GACTCGAACG AGGTGGTAAG CCACAGCACG
AGAACCCCAC TTTCCTCACA GCTGTCTTTG GGTCCCGAAC GCATTCCCCC AACTTCAGCA
AGAAAGAGTG GTGACGAACT AGTGAAGTAT TTTGCTCTTC CTTGGCGGAC ATTGCTTCGG
AAGCTTGCGC GTGTCGGCAC GAAGCAACTG TCCGCAGCCG TAGCAGCCAC GGGAAGGGCG
TACTACAACG TGGGAACAGT GAAGAAATCG GGGATTGGTC ATCGCGATGT AGCACCAATG
GCAGCGGGGC TGCGGTAGTT ACCGAAAAGA CCTCCTCGGG AGATGACGTC GAGTCACGTG
CCAACACTTC GAATGTCGCG TCTTTATTGG GAGATAGACG ATCCCTACCA ATGTCTGTAG
GCCATGACAA TGGTGAAGTG TTTTGCGAAC CCTCGGGAGC AGCTCAGCAA TTTCATCACT
ACCCGCATCG CCAGCGAAGA CACTCTGAGA GAGAGGGAGC TGGAACTACA GGAGAATTAC
CCCTGGGCGG GGATCCACTT CAGGATTGCA ATTCACACTT ACACGCAGCC AATCGCCGTC
GAGTCACCGC ATCATCACCG CTTTCCAAAC ATTGTCCTTC TCCAAAGGCA TCGTTCGAGG
CAATGCTGGC AACCACCACC AGCTCGATTC GCTCAGGAAA AAATGGGGCG GCAAAGAAGA
ACACAGGATT ATCGAAGGGA CGAAAGAACA CGAAAGGTGT ATTGGCACAT CATATGGCTG
GGACGTATCC GATGCAGAGA ACAACTCATC GATTTGCCCA ACTACAACAA AAACGCACCG
ACTCGGATGG GATGACCTCG AATGAACCTC GAGATGCTTT TAAAAAAGCA TGGATGAAAA
TGGATAGTGC TTCCTTCTCA AATGGAAGCC CTTCATCTCC CAGTAGCCGG ACTTCCGACG
ACGACATAGA AGGCTGGAAC GGGAGCTACT CTGGAAGTGG CACAGAAGGG GGTTATGCGG
CTTCGGCGTC GTCTAACGAA ACTCTTTCAG GAGCTGAAAG ATCATGCTCG TCTGATTTAT
CTGCAGAAGC TTTAAAGTTA AGACACATCA ACACTTCCTG CAATCCGGAC GCTTCTGAGC
ATGGCAGGAA GCAAGAGTCG ACGTTTTCAA CATCTTCCGA CATTGCCGAT TTCAGCTCAG
GTGGCTCCGA GTTTGAATCC GTTGACGAAA GCATACCTGA CGCGGTATCG GATTCCGCTT
CGAACGCATT GTCTTCGAAC GGTAGCCGCT TCTTCGACAA AAGAGAGGAT GAATTCAAGG
CCACCAAACG AAACCATCCA AGACGTCAAG CTTCCATACC TCGCAATCAC TTGAGCTTGT
CGCAACTTGC AATGCAGAAT AGTAATAAAG TTTCGAATCA GACCAACCTG ACTGTGAAAC
CAAGCGCTTG TCACGAAATT AATGGAAAGG CCCCTATTTT AGCCTTGGGA GGAGATGTTA
TGGCGCACGT CTTGGCATTT TTGGAGCCAC CAAAGATCCT GGAGGTTATT ACGGCTCCAC
TCTCGAAAGA TTGGCTGAAC GCCTTCTCCC ATCAGTCAGA GCTTTGGCGT GTGCTCTGTC
TTCAGGAGCC TTTGAAGGCT AGAATCGAGA ACGAACCGGA CAGCGACGAC GAATCTTTTA
CAGGTTCTTT CTCTTCTATC ACCGGATCCC AACACCGCCT CACATTTGGC AAGTTTAGAC
TCTTATACAC GGCATTTGTC CGTTGTATGA AATATTTGGC TCGCATTAAG GACGACGCAT
CAAATGGAAG ACCTCTATCG GTGGTAGATT ACGGTGTTGC CGACGGCATG GGAAGCCACG
ACATTGGTTC CAATCAGAAT CTTCGACAGT TTCTGGCAAG AAGTAGAGGT GTAAGTTTCC
TCACTGGACA GAATGAACAG GCCGATACGC CCCACAGCGG CAGCCGTCAT CACGCATTTG
CTACTATAAC TCAGTCGATT GGAGTCTCGG ATAACGGCAG TGCGACTAAA TCCAAACGGA
AGCATAAGGA GGAGGAGGAA TGTATGGCTG TCAAGAAGAT GCGACGCTTC GCCAGTGGGC
CCTCGGCGCT TACGCAGAGA TTGCTCGGAC CACCCAGCAC CGGAACTCCA GGTAACACCG
AGTTGCCATG GTCCTGTGCG ATCTTTTCGA TCGTTAATTG GATGCTTGCT TTCTCGGATG
TGGAGGGTAT TCAGACGATG TGCCTGAGAG ATTTGCCGTC TTTGCTCGAA GATGAACAAC
AGCGAATCAC TGCCCAGCGA GCAGGACTGA CGGACGTGGT ACTCCGCGCT ATGGTTACGT
TTCCCGACAG TAGCCCACTG CACACGGCAG CGTTTCACAC CATTGTCCTT CTGGCCCGTC
CATTGGGTGG TCGAGAGGGC ATGTTGTTCC ATACGTCAAT GGTGAATTCA TCGGGCATCT
TCAGTGCTAG CAGTGTGGCC TCTCGCAATG GTAAGAGCGG TATAGCTGTT ATGCTTGATT
CAATGAAAAG GTTTCAGCAA GACGAAGTAC TTCAAGCCAT GAGTTGTTGG TCCCTTGTAA
ATATTGCTCT AGCCCCGGCA CAGAAGGAAG TTCTTGTGAA TCTTGGTGGT ATCGAAGTGA
CATCCAGGGC TATGTGTGCT CATCCGCATA GCGCTGAAGT TCAATTCCGT GCGCTTTTTG
CTCTGATCAA TCTTGTAATC CCCTCGCGAG ACCAAGGAGA GCCTTTAAGA GGAGAAGTGA
TTACCGAAAA GGAAATGCTC GATGAAAGTG TTGATCAGAT CATCCACCTC GTTCTCCTGG
CGATGAAGAA CTTTTGCGCA TCTGAAGCAA TCGTGAATAG AGCGTGCCTT GTTCTTCACA
ATGTGTCACT TACTCGAGAG TACCACGAAA CACTCCTTTG TTGTCCAAAC TGTTACCAGA
TGCTGGAATG GTGCTTGGCC AACTACCCAA CTGATCAGGT CCTGCAGCAA AGCGCTTCGG
GAACCTTACA CCGCCTCCAG CTCACTTTGA ACAGTGACGA AATCCTTCGA ACTCGATTTG
CTACTACTTT ACAAGCGCAG CAGCAAATGT CCCTCGAGAA TGTGCATAGA GAGGCGATTG
TCGCTCATGA GCAGCACGCT CAAAGTCGGA CGATATAACT GTAAAATACA CCGCCGGAGC
ACATGTACTG AAATGTTGTT GACAGTGAGT GACTTTTGAT TATATTAAAT TTGACCATAA
GGACGTGATC AACTATTTGA CTGAATCGAC ATGATATTGC TACAACTGTT TGTTAATATA
CGCACAAACA ACGTAGCATC AATGGACAAT TGGTCTTCGC GCTTTCTGGT ATTAATAGTG
AGGAAAATAG GCGCTATACA TTAGACTAAA CGGTGTAATT TCACGCGATG CCG
 
Protein sequence
MIESTTRHSA NGSEGDSNEV VSHSTRTPLS SQLSLGPERI PPTSARKSGD ELVKYFALPW 
RTLLRKLARV GTKQLSAAVA ATGRAEEIGD WSSRCSTNGS GAAVVTEKTS SGDDVESRAN
TSNVASLLGD RRSLPMSVGH DNGEVFCEPS GAAQQFHHYP HRQRRHSERE GAGTTGELPL
GGDPLQDCNS HLHAANRRRV TASSPLSKHC PSPKASFEAM LATTTSSIRS GKNGAAKKNT
GLSKGRKNTK GVLAHHMAGT YPMQRTTHRF AQLQQKRTDS DGMTSNEPRD AFKKAWMKMD
SASFSNGSPS SPSSRTSDDD IEGWNGSYSG SGTEGGYAAS ASSNETLSGA ERSCSSDLSA
EALKLRHINT SCNPDASEHG RKQESTFSTS SDIADFSSGG SEFESVDESI PDAVSDSASN
ALSSNGSRFF DKREDEFKAT KRNHPRRQAS IPRNHLSLSQ LAMQNSNKVS NQTNLTVKPS
ACHEINGKAP ILALGGDVMA HVLAFLEPPK ILEVITAPLS KDWLNAFSHQ SELWRVLCLQ
EPLKARIENE PDSDDESFTG SFSSITGSQH RLTFGKFRLL YTAFVRCMKY LARIKDDASN
GRPLSVVDYG VADGMGSHDI GSNQNLRQFL ARSRGVSFLT GQNEQADTPH SGSRHHAFAT
ITQSIGVSDN GSATKSKRKH KEEEECMAVK KMRRFASGPS ALTQRLLGPP STGTPGNTEL
PWSCAIFSIV NWMLAFSDVE GIQTMCLRDL PSLLEDEQQR ITAQRAGLTD VVLRAMVTFP
DSSPLHTAAF HTIVLLARPL GGREGMLFHT SMVNSSGIFS ASSVASRNGK SGIAVMLDSM
KRFQQDEVLQ AMSCWSLVNI ALAPAQKEVL VNLGGIEVTS RAMCAHPHSA EVQFRALFAL
INLVIPSRDQ GEPLRGEVIT EKEMLDESVD QIIHLVLLAM KNFCASEAIV NRACLVLHNV
SLTREYHETL LCCPNCYQML EWCLANYPTD QVLQQSASGT LHRLQLTLNS DEILRTRFAT
TLQAQQQMSL ENVHREAIVA HEQHAQSRTI