Gene PHATRDRAFT_45251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45251 
Symbol 
ID7200121 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp612316 
End bp614525 
Gene Length2210 bp 
Protein Length656 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179252 
Protein GI219116915 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGTCTACGAG TGGTACGGCA ACTCACAATC AATAACGTCT CAGGACCCAC CCTTTCATCA 
CCGATAGCCA CAATTGGCTG CGACCGCTAC ACACTGCTTC TCCTCTAAGC ACTACTGCAA
AGATGCAAAA GGCGATAGTC GACCCCACAG AACGAGAGTT AGGTATCTTG ACATACCAGT
CACCGGCGTT AACGGGATTT ACGGCGGTTC TCAAGGCCCG CTATTCCGAC TTTGTCGTTC
ACGAAGGTAA AATATATACG TGGTTGTTGT CTCAAATACC GTGTTTCTAG CACCCTAGCG
CTGACTCTCG TCCTTTTAAT CTTGCGTTTG CTCAGTCGGA ATCGATGGAC GGATTGCGAA
GTTGGAATCT GTGGAGCCCA AAGCCAGCAA CGGTGTCGTA AAAGAAGAAA AATCCAGGCC
GGAAGAACCT GAGATAAAAA AGCGTAAGCT GAACGATGAC GGTAGCCCAA ATTGGGGCGA
ACTTGAGTCT GAATTAAGAG AATTTGTTGG CGAGCAGGGG GCCACCGAGA CTATTGAACT
GCTCCAAAGA GCTAACCCCT CGGAAGAGAA AATGTTTGTC AAACTACCGC CTTGCGCCGA
AAAGGAAACC CGTCGGAACT TGCATCAATG GATTCGGTCT CGTCTGAACT TTTGCGCACG
GGCCGATACA ATCGACGAAA CGAATTCGGA TGGCAATGCA GTCACTAGAG GGCTTATTCG
CATTTGGCAC AAATCCTTTG AGAGGAAAAT GCCAAACTAC GGAAAGTTTG AGCGGAATCA
GCAAAACCGA ATGCCCCGGG AAAAGGCTCC TTCGGATAAG CAATACCTAC AGTTTGTGCT
ATATAAGGAA AACATGGATA CAGGAGCAGC CGTCGGGCAA ATTCAGCAAT TCGTGCAACC
ACCAGGAGGT GGTCGGGGCC GCGGTCGAGG AGGAAGAGGT AGCTTTAAAC ATGGTGGTCC
TAAGCTACGA ATGGGATATG CTGGAATGAA AGACAAACGT GGAATCACGA GTCAATACAT
CACAGTACCG GCGTCAACAT CTCTCCACGC GCTCGCCGGT TTAAATCGCC CCGGGCAAGG
GGGCGGCCAT ACGCGCAACG GAGGTGTCGG TATCATAAGG GTGGGTAACT TTGAGTACGT
TGCCAATGAA CTGAGATTGG GTCGGTTGAA AGGAAACCGA TTTGATATTG CTCTACGAAA
CATAGAATTA GGGTCCGAAA CGAAAAATAT TGGATCTTCT TTGGAAAGTG CTGCCCAAGC
GCTAAAGGAC AATGGATTTA TAAACTACTT TGGTGTGCAG CGATTTGGCA AGTATCACGA
TACACATTTG ACTGGTATCG CCATTTTGAA AGGCGACTAT GAAGGTGCGA TTGACATCAT
CATGTCACCG AAGCCTGACG AGAGGCCAAA TATTGCCGAG GCACGAGCCA TGTGGAAAGA
TCGGTTCTCT CACAATAGCG ATAGGGCAAG TGCCGAAAAA GAATGCGCGC AGAAGTTATC
CCGACAGTTC AATAGATTCA TGCACAGCGA AATGGCCATT GTGAATAGCC TAGCTCGCGA
TCCTTTGAAC TATGAGAAAG CTTTTTCTTG TATTAACAAA ACAATGCGCA TGATGTTTAT
CCATGCGGTG CAAAGCTATT TGTGGAATCA CGTCGTCTCA AATCGTATTG AAACGTTTGG
GGGCAAAGTG TTGGAAGGAG ATTTGGTGCT GGCGGACCCC AACAGTATTG ATGGGGAAGG
CATTCCTGAA ATTTTGGTGG TCTCAGGAGA GGATATTTCC GCAAATAAAT ACACGCTTAC
GGACGTTGTG GTACCGCTGA TCGGATCAAA AACGCGCGAT CCGGCCAACG CGTCAGCGGA
TCTTTTCGAT ACAATCCTAC TAGACAAGGG ACTTACTCGC GAAATGATGG TGAAAATGGA
AGACCGTGAT TTTAATAGCG CGGGAGACTA CCGTAAGATG ATCTGTCGTC CTGCGGATGT
GGATTTTCAA GTGCTCGAGT ACACCGATCC TCATCAGCCG TTACTACAAA CCGATTTAAT
GATGCTAGAT GGTATTGAGA TCAAGGCCTC GCTGACCTCC GAAAAGGCAT CTGCTACTGC
ACTGCTGGCC ATGATTATTC GATTTACTCT ACCACCGTCC GCGTATGCTA CTATAGCACT
CCGTGAGCTA ATGAAGCGAC CCACATCTAG CGAATATCAA AGCGAATTGA
 
Protein sequence
MQKAIVDPTE RELGILTYQS PALTGFTAVL KARYSDFVVH EVGIDGRIAK LESVEPKASN 
GVVKEEKSRP EEPEIKKRKL NDDGSPNWGE LESELREFVG EQGATETIEL LQRANPSEEK
MFVKLPPCAE KETRRNLHQW IRSRLNFCAR ADTIDETNSD GNAVTRGLIR IWHKSFERKM
PNYGKFERNQ QNRMPREKAP SDKQYLQFVL YKENMDTGAA VGQIQQFVQP PGGGRGRGRG
GRGSFKHGGP KLRMGYAGMK DKRGITSQYI TVPASTSLHA LAGLNRPGQG GGHTRNGGVG
IIRVGNFEYV ANELRLGRLK GNRFDIALRN IELGSETKNI GSSLESAAQA LKDNGFINYF
GVQRFGKYHD THLTGIAILK GDYEGAIDII MSPKPDERPN IAEARAMWKD RFSHNSDRAS
AEKECAQKLS RQFNRFMHSE MAIVNSLARD PLNYEKAFSC INKTMRMMFI HAVQSYLWNH
VVSNRIETFG GKVLEGDLVL ADPNSIDGEG IPEILVVSGE DISANKYTLT DVVVPLIGSK
TRDPANASAD LFDTILLDKG LTREMMVKME DRDFNSAGDY RKMICRPADV DFQVLEYTDP
HQPLLQTDLM MLDGIEIKAS LTSEKASATA LLAMIIRFTL PPSAYATIAL PNIKAN