Gene PHATRDRAFT_52174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_52174 
Symbol 
ID7202040 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp566051 
End bp569924 
Gene Length3874 bp 
Protein Length1143 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181228 
Protein GI219121760 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.338516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAAAAAAGA CGGTCGAGGA AACTTATCAG AAAATGTCGC AGCTCGAGCA TATCCTCATT 
CGTCCCGACA CCTACAGTAC GTATACATCT GAAGCAAAAA TAGCCATACG CTTGTTTTTA
TGTTGCTTAT TCGCTTACTC GTATTTTTTG AAATACCTTT AACAGTCGGC TCAGTGGAGC
CTCACACCAA GCTCATGTTT ATCTTGGAGG ACGATAAGAT TATCGAAAAA GAAATCACCT
ACACTCCAGG TCTTTTCAAG ATTTTTGACG AAATTGTTGT CAACGCAGCA GATAACAAAC
AGCGCGACCC GAACATGGAT AAGCTTGATG TGACTATAAA TGCCGAAGAG AATGTCATTT
CTATTCAGAA CAACGGAAAA GGTATTCCAA TCGTGATGCA TAAGGAACAC GGCATTTATG
TGCCAGAACT CATTCTAGGA CACCTTTTGA CGGGCTCCAA CTTTGATGAC GGAGAAAGGA
AAACAACAGG TAAGTTTAGC ATGCAATGGA AACCATCCCA AAGATCCTTG CCAAGTGAAT
AGACACTCAC CTATGGCATT ATTCCAAGGT GGACGAAATG GATATGGTGC CAAATTAGCG
AACGTATTTA GCACGGAATT CGTCGTAGAA TGTGCGGATA CGGAAAACGG GCTCAAGTAC
CGCCAGGTAT TTCGAGAAAA CATGAAGCAA AAGGACAAAC CAGTAGTCAA GGCGCTCACA
GCCAAAGAAA TGAAGGCTGC GGACTACGTC AAAATCACCT TTAGTCCCGA TTTAGAGCGC
TTCAAAATGG ATCGTTTGGA CGAAGACACA GTCGGTCTCC TGTCCAAACG TGCCTACGAT
ATTGCGGGAA CCATGTCTCA GAGCAGTGGC AAGAAACTTC TCGTCTCCCT CAATGGCAAA
AAACTTCCAA TTAAGTCTTT CAAAGATTTC ATAGGCGTAT TTGAGGGAAT TAACAAGCCG
TCCGCGTTCG AAACGGTACG TCTTGCAAAT GTAGTTCGCT TCGAATTTGG GTGCCCTTCA
GTATCTCACC TATGCGATTC CTTGTGGATA GTCTGATCGC TGGGAGGTTG GTGTTGCCGC
TTCCGACGAT GCGGCGATGA AACAGGTTTC GTTCGTCAAC TCGATTTCGA CGTCCAAAGG
AGGCACCCAC GTTCAGTACA TCGCAGATCA AGTGGCCTCT CACTTGGCCA AAGCCATTAC
CAAGAAAAAC AAGAAAGGTG GGGAGGTGAA GGCGTCGTTT ATCAAAAATC ATTTGGCGAT
TTTTGTCAAC TGTTTGATAG AAAATCCCGC CTTCGACAGC CAAACCAAGG AATTCATGAC
GAGCAAACCC AAGGACTTTG GTTCCACCTG TAAGCTCTCA GACAGCTTCT TGAAAAAGCT
CGAGAAAAGT GCAATCGTTG AGAATGTACT AGCATTTAGT CAGTTTCGGG ATCGTCAAGC
CTTGAAAAAC AAAAGCGGTA AGAAGAAGGC CAAACTCACT GGTATCGACA AGCTCGATGA
CGCCAACTTT GCAGGGACGG CAAAGTCAAA AGATTGCACT CTCATCATCA CAGAGGGAGA
CTCAGCCAAG TCGTTGGCCA TGGCGGGCTT ATCAGTCATT GGACGTGACT ATTACGGAGT
CTTTCCCCTT CGTGGGAAAC CATTGAACGT CCGTGATGCT CCCATCAAGG CCGTCACCAG
CAACGAAGAG ATCAAAAACG TGGTTGAGAT AATGGGTCTA AAGTTTCAGA CAGTGTACGA
TGAAACGAAT ATCAAGCAAC TCCGTTATGG ACATTTGATG ATTATGGCGG ATCAGGATAA
TGATGGGTCA CACATCAAGG GTTTGATCAT TAATTTTATT CACAGTGAGT GACTGTTGCT
CTTCCAAAAT TTTGCGGGTG TCGTTTTCTC ACTCGAATTC CGTTTCTTGC TAGACTTTTG
GCCGAGCCTG CTCGATATTC CTGGATTCCT CCAACAGTTT ATCACACCGA TTGTCAAGGT
ATCGAAAGGC CAAAAATCCC AGTCGTTTTT TAATTTACCA GAGTACGAGA ATTGGCTGGA
ATCAACGGGA AAAAACGGAC ACGGATGGAA AATCAAGTAC TACAAAGGGT TGGGAACTTC
AACGAGCGCT GAGGCAAAGG AGTACTTTTC CAATTTAGAC CTTCACGAAG TTCACTTCGG
GATGCTCTCT AACGACAAGA TTGAGGTAGC TATCGATGAC GATCTGCAGC AGGTCCTTCC
AGACACAGTG CAGTCTGGCA ATGACCTCAT CGACATGGTT TTTCGCAAAA ATCGTGTGGA
AGATCGCAAA CAATGGTTGA ACGCCATTGC TAAGGATACG TTCCTCAACT ATTCGGAAGT
ATCCAAGGAA GGGGTAATGT ATTCAGAATT TATCAACCGT GAATACATTT TGTTTTCAAA
AAGTGATAAC GAACGCTCTA TACCTCATCT ATTGGATGGC TTTAAGCCAT CTCAGCGCAA
AATCCTCTTC GCATGCTTTA AAAGAAAGCT GAAAGGCGAG ATAAAAGTTG CCCAGCTCAC
CGGCTATGTC GCCGAGCATT CGGCGTATCA TCACGGTGAA GCGTCGCTCC AAGCCACAAT
AGTGAACATG GCTCAGAACT TTTGTGGTTC TAACAACATC AATCTTCTCA CGCCGTCTGG
TCAATTCGGT ACGCGTCGAA TGGGTGGCAA GGATGCTGCC TCGGCTCGAT ACATTTTCAC
CAAACTCGAG CCAATCACAA GAGCCATCTT TCATCCGGAC GACGACGAGC TCTTAAGCTA
CATAAATGAC GACGGTGTGA CCGTCGAGCC AGAGTATTAT GTACCCGTCA TCCCCATGAT
TCTCGTCAAC GGAGCTGATG GAATTGGTAC GGGCTGGTCC ACGTCAGTCA ATAACTATAA
TCCAAGGGAA ATTGTACGCA ACCTGCGCCG TAAGATTGCT GGTGAAGATT TTGTCGCAAT
GGCGCCCTTT TACAGTGGCT TCAAAGGCGA GGTATGATGC TTAATGACCA GGAGAATGAT
ATGAGGGTAG TACTGATTGT TTCTCACATC TCGCTAAACC TTGCTCATCT CAGATTATTC
CTGTTGATTC CAACTCCCGC CGATCTGGAT CGTACGATAT GCTTGGCAAA GTCGAACGCA
TCAATGACAC AACGATCATT ATCTCTGAAT TACCCGTTCG GAAATGGACT CAAGACTACA
AAGCCTTCCT TGAGATCATG TTGACTGGCG ACGGGAAAAA GAAACTACCA GAGATCAAGG
ATTTTACTGA GAATCACACT GAGACAACTG TCTCATTCAC CATCATTGCC GAGAAAGAGA
AAATCGACGA ATTTGAGAAA GAGAAAGCCG GCTTGATGGG TAAATTTAAG TTGACTGGGT
CGCTCTCTAC TTCGAACATG ACGCTTTTCG ATGAAAGAGG AAGAATCACA AGATTCGAAG
ATCCTGAATC GATTATGAAT GCTTTCTACG ATATCCGTCT AGACTTCTAC GATAAACGAA
AGAGGTTGCT GGTCAAAAAG CTGAAAGAGG AACAACGAAA ACTCTCCAAC AAAGCCAGAT
TCGTGGAGGA AGTGTGTCGT GTGGAACTTG TTGTCAACAA TCGTAAAAGA CAGGACATTC
TACACGAGCT TCGAAATCGG GGCTATGAGA CTTTTGGGGC AGATGCACGT TCGAAAGAGA
CTAGCGACAG CGATGGCGAG GAGGATTCTA TCAATGAGAG CCAGTCCGAC GCTGAACTCG
CTCGAGGGTA TGAGTACCTC CTTGGAATGA AAATTTGGTC GCTGACCTTC GAGAAAGCTG
AGGAGCTGCG ACGTAAGCTT GGTGAAAAAA CTACGGAACT CAACGCGTTG CAGGGTACTT
CACCTTCTGA GTTATGGTTG AACGACCTGG ATGA
 
Protein sequence
KKKTVEETYQ KMSQLEHILI RPDTYIGSVE PHTKLMFILE DDKIIEKEIT YTPGLFKIFD 
EIVVNAADNK QRDPNMDKLD VTINAEENVI SIQNNGKGIP IVMHKEHGIY VPELILGHLL
TGSNFDDGER KTTGGRNGYG AKLANVFSTE FVVECADTEN GLKYRQVFRE NMKQKDKPVV
KALTAKEMKA ADYVKITFSP DLERFKMDRL DEDTVGLLSK RAYDIAGTMS QSSGKKLLVS
LNGKKLPIKS FKDFIGVFEG INKPSAFETV RLANSDRWEV GVAASDDAAM KQVSFVNSIS
TSKGGTHVQY IADQVASHLA KAITKKNKKG GEVKASFIKN HLAIFVNCLI ENPAFDSQTK
EFMTSKPKDF GSTCKLSDSF LKKLEKSAIV ENVLAFSQFR DRQALKNKSG KKKAKLTGID
KLDDANFAGT AKSKDCTLII TEGDSAKSLA MAGLSVIGRD YYGVFPLRGK PLNVRDAPIK
AVTSNEEIKN VVEIMGLKFQ TVYDETNIKQ LRYGHLMIMA DQDNDGSHIK GLIINFIHNF
WPSLLDIPGF LQQFITPIVK VSKGQKSQSF FNLPEYENWL ESTGKNGHGW KIKYYKGLGT
STSAEAKEYF SNLDLHEVLP DTVQSGNDLI DMVFRKNRVE DRKQWLNAIA KDTFLNYSEV
SKEGVMYSEF INREYILFSK SDNERSIPHL LDGFKPSQRK ILFACFKRKL KGEIKVAQLT
GYVAEHSAYH HGEASLQATI VNMAQNFCGS NNINLLTPSG QFGTRRMGGK DAASARYIFT
KLEPITRAIF HPDDDELLSY INDDGVTVEP EYYVPVIPMI LVNGADGIGT GWSTSVNNYN
PREIVRNLRR KIAGEDFVAM APFYSGFKGE IIPVDSNSRR SGSYDMLGKV ERINDTTIII
SELPVRKWTQ DYKAFLEIML TGDGKKKLPE IKDFTENHTE TTVSFTIIAE KEKIDEFEKE
KAGLMGKFKL TGSLSTSNMT LFDERGRITR FEDPESIMNA FYDIRLDFYD KRKRLLVKKL
KEEQRKLSNK ARFVEEVCRV ELVVNNRKRQ DILHELRNRG YETFGADARS KETSDSDGEE
DSINESQSDA ELARGYEYLL GMKIWSLTFE KAEELRRKLG EKTTELNALQ GTSPSELWLN
DLD