Gene PHATRDRAFT_44591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44591 
Symbol 
ID7197610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp987391 
End bp990639 
Gene Length3249 bp 
Protein Length972 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178620 
Protein GI219115649 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGTAAGGGC TTGCTTTCCT AAATTTTGGA GGCCATCACA ACCAAAACGA CTTTGTACAT 
CTCAGTCTCA CAGTCAATAT CCTCTTCGGC ATTATACCTT CTCCTAGGCT ATCAACGAGT
GGAAAAATAA AGTCCTCGGA AAGAATCTAT TCGCAAAAGG CTCCCAACCA TAGCTAAATC
GCCTTTGGAG TATTTGTTCC TGTGACGCAT CCTGTACAAG AAAGCAATCA TTCAAAACGG
TCCGTATCGA GTCCGAGGAA ACAAGCACCG TAGAACAATA GCAAGATTTT TCCCTCTCGC
CGTCGCCAGC GAAGGAAAAA ACTCTCCGTC ATGACGAGCG CGCGCGAGAG GTCCAGTAGT
GCTGGCGCTG GTGTCCCCCA CCAGCGTGAT CTACCAGTTT CTTTTCATCT CGCAGGGGAT
GCTCAAATTG ACGAGACTAT TAGTAACGGT CGTATGCGAG CAAATACGGC ACCATCGATA
GTTGATGAAG AAAGTGACAC AGTCCAGCAC CAGGAAAAAC AAGACATTCC TGAGATACCG
CCTTCAGGGA GTTCGGAAGA TGAAGAGCTG AAGCTGGCGC AAACAATGGC GCTTGCGATA
CAAAACAATC CCAATCTGAC TCGAGATGAA ATTCGTAAAT TGATTCAGAC AACTACAGGG
GGTACCAAGC CAGCTTCTTT GATGAGTACT AACAATGGAA GTGAGACTTC TGGTTCGGCC
GTCCTCCAGC CATCGTTGTT GATTCCGGGA AGGCTGATGT CGGGGTTTTT TCAATCGGAC
GAGGCCGAAA AAGAGGGAGA TAATGTAAGT TCTGGGAAAG ATGGTAAGAA AAAGTCGTTT
CGATTTGATG GAAGCTTGCG TGAACGCGCC AAAGGAAGGC TTTCAGAAAT TTGGACGTCG
AGCGGGGCCA ATAGTAATAG TATCCGTGGT AAAGAGGAAG GTGATATCAA GGACGCTCAG
GCTTCCGGTC ACTCTAGTCT TTCATCCACC ACGGGCAGAG ACAATTCGTC GCGGTCTTCT
TTTCGTAAAG AGAGGAATCA TGTACGATCT CCACCTGTCA GTCCCTTGAC TACGCCAATG
GTCATAGCTG ACGATAACAA ATTCAAGAGT ATCGTTTCCC CTGCGAGTGC TTCGATGACG
ACAAGTAAAT CCGTACCGAT ACGTATGATA GGAATTGCCT GGAAGCGGCG CGGCGGCATG
GGAAAATACT CTGCAAATGG GGCCTGGGAG CGTCGTCGTA TCGAGTTGCA AGGAACCAAA
CTGCTTTACT ATCGGACCGA CGGAGACACG ACTTATGAGC AAGTTTCACC TTCAGCATCG
CGTGACGATG CTTCCTTGAC CGCCTCCGCC GTTATGCCGT CTATGGATGA TAACAACAAT
CCAGAAGGTG TAGTCATTGC CCGTCGCCAG ACTTGGCTTG AATCTGCTAC TTCCAAGGCG
GCCTCGAGTT GGGTTGTCAA CGATCAAGAC CACACTACTC CTCGTGGACA CATTGATTTG
GCCAAGGAAA AGGCGACAGT GAATGCTGCC TTTGGGCATT CAAATGCACC TTCGCCCTTC
GCCATCTCGA TCAAAGTCCG CGCGGAAACA AAATGGAAAT TGTGTTTCGA CTACCATCGG
ACCCAAATGG AATGGTTAGC TGCGTTGACA GATGTAGTGG TTCAAGGCAG TGTAGACTCA
TACAATTCCA ACATCTTGAT AGCAGCCGAT CCCTCGAACC AGACAGAATC AGCTCTCTTC
CATCCGCCGC AGGTGAATCA TCCTCCCACG GACAGCAACG AACCAGCTGT GTCGCGGCGT
TTGTGGATGA TGGAGAGTTA CACAGTTTCT ACAACGCAAA ATGTAGATAA TGATCACTCG
GACAGTGAAA ATGAAGACGA TAGCGAATCT TCGTCCCTAG ACGAAAGAGA CGATGATGTG
GAGGAGTCTG TCACAATTTC CCGGGGAGCA TCGGATCCGA TGACCTTATC GGCATCACAC
ACAAGTATTG TTGAAGGAGC GAAGAAGGTA CTTTTCATTC CCGAACGATA TCTCGCTTTC
GTTTCAGGAA TCGTCAATGC ATCATTGGCC TATGCTCGAG CGTCTTCCAC TTCCACAGAA
AATTTTTGGT ACGCTGTTGT CGTCGTAAAT TTCTGCGTTT TGTGGTGTTT GGTGAAAGAA
CCCGATTGGA GGGGCATGGT TTCTCAGCCA GAAATGAGAA GCGCTCAACC TTCGCGTTGT
GACAAGCAGC AACGGCGTGA AAACACGAAG CGAAAAGCAG CAGCGATAGA GCCAAAGTCT
ACTGTAACAG GAAGATCAAC GGATCCATCT GCTTATATAC CCATGGCCGG ATCAACAACA
GTGAAGCTCA AGCACACAAC GGATCTTCCC GTAAACGATA AAAATGAGGT TTTTGCTGGA
TGGTGTGATC CGCCAGGAAA TATTTTGGCT GTTCGGTCCC ATGGATATAG TTTGACGAAG
AAGAAAATTC CGTCGCCTGG TAGTCTCTAC AACTGTGCAA GGGTGGATAT ATTTGAATCC
CCAAGTCGGT ATCCCGATAT GGCGCTTCGC GTCAAACTCC CATCGGTGGA CTTCAAAGAC
GATGACAGGC CGAAAACGTG GAGGACACCA GATGTTCTGA TTGTCAGTAT TGCTTTGCCA
ACGGACCCGC CCAAGCTTGG TCGGTCCAGC AGCGATGGAG GCGGATACAC AGTTACGTGT
TACTTCACAA TGACACAGGA AACACGCGAT ATTCTACGTC GTGTTACGGC AGATGACTAT
GATCCTTCAA AAGAGAACAT CGATGACATT CAAAAGAGTA CAGTCAACGC TGTTCGACTA
CTAGAAGAGT GGGTTCGAAG GGCTCCAACG GATCCGTCAT GGTTTTCTCG TTTCAAGGTT
GTCCCCAATG CTCACAACTT GAAGGAAATT GGCATGCCGG CATGGATATC AAAATACAAC
GGTAAGCCTT TTTTGATCAA ACGCCCTGGA ACAACTGGTT TTATTTTCGA ACATCCCGAG
CTGTCTTGTT TGGAGTTTGA CGTCTCGCTG CACCCTTTTC CGTATATTGC CAAGCAGGCG
ATATGTTTTA TGAAGGAAAG CTATTTCAAA AAGGTTCTTG TCAGCTTTGG TTTTGTTATT
GAAGGGAAAA GTGACGATCA ACTACCGGAG TGTGTCATTG GGTGCATGCA GCTTTGTTAT
CCTGACCCAG CTCACGCTAT CTCCGGGGCA TCGTTTTTCG ACGGCACTTC GAAACGATCT
TTCCAGTAG
 
Protein sequence
MTSARERSSS AGAGVPHQRD LPVSFHLAGD AQIDETISNG RMRANTAPSI VDEESDTVQH 
QEKQDIPEIP PSGSSEDEEL KLAQTMALAI QNNPNLTRDE IRKLIQTTTG GTKPASLMST
NNGSETSGSA VLQPSLLIPG RLMSGFFQSD EAEKEGDNVS SGKDGKKKSF RFDGSLRERA
KGRLSEIWTS SGANSNSIRG KEEGDIKDAQ ASGHSSLSST TGRDNSSRSS FRKERNHVRS
PPVSPLTTPM VIADDNKFKS IVSPASASMT TSKSVPIRMI GIAWKRRGGM GKYSANGAWE
RRRIELQGTK LLYYRTDGDT TYEQVSPSAS RDDASLTASA VMPSMDDNNN PEGVVIARRQ
TWLESATSKA ASSWVVNDQD HTTPRGHIDL AKEKATVNAA FGHSNAPSPF AISIKVRAET
KWKLCFDYHR TQMEWLAALT DVVVQGSVDS YNSNILIAAD PSNQTESALF HPPQVNHPPT
DSNEPAVSRR LWMMESYTVS TTQNVDNDHS DSENEDDSES SSLDERDDDV EESVTISRGA
SDPMTLSASH TSIVEGAKKV LFIPERYLAF VSGIVNASLA YARASSTSTE NFWYAVVVVN
FCVLWCLVKE PDWRGMVSQP EMRSAQPSRC DKQQRRENTK RKAAAIEPKS TVTGRSTDPS
AYIPMAGSTT VKLKHTTDLP VNDKNEVFAG WCDPPGNILA VRSHGYSLTK KKIPSPGSLY
NCARVDIFES PSRYPDMALR VKLPSVDFKD DDRPKTWRTP DVLIVSIALP TDPPKLGRSS
SDGGGYTVTC YFTMTQETRD ILRRVTADDY DPSKENIDDI QKSTVNAVRL LEEWVRRAPT
DPSWFSRFKV VPNAHNLKEI GMPAWISKYN GKPFLIKRPG TTGFIFEHPE LSCLEFDVSL
HPFPYIAKQA ICFMKESYFK KVLVSFGFVI EGKSDDQLPE CVIGCMQLCY PDPAHAISGA
SFFDGTSKRS FQ