Gene PHATRDRAFT_38931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38931 
Symbol 
ID7203696 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp579206 
End bp582580 
Gene Length3375 bp 
Protein Length1031 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182864 
Protein GI219125179 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAGTT GGGAAATCGA GTCCGTGCAG GATTGTCTGT GCCGGACAAA TCGTTCCTTC 
GGTGGTCGTA GGGACCATAA AGAGCGACCT TCTTTTCTCA CGATTGGCTC ATACTGCTTT
AGCCCAATAT TCGTCGAACC TTTCCGGGGG ACGACAAAGA CAGTAGCGAA GAAGACGACT
CGTCTCCACA AGTGTTCCAC CGTAATGGTG GGGGCAAGCA GTTAGTCCGA CCCGGCGGCA
AGTATCTCCG ACCGCGGCAC GACGACGAAG ACGATGATGA AGAATCTTCC TCAGAGGAGG
AAGAAGACCC CCCGCAACAG CAGATTTTTA GTCGAGCCGG TGGTGGCAAG CAGCTGCGCC
CGTCGGGAGG AGGCGGAAAA TATTTGCGTC CATCGTAAGT AAGCCCCCGG TCCCGCGTTT
TGATCAAATA ACTGTATCTA CCTAGTTTTC TGTCTGTCTA ATTCTGTGAT TGTTTACAAC
AGTGGAGGTC AGGAAGACGA CGACGATGAG AGTGAAGATG ACGATGAAGA GCTGGACGAC
GACGAAGACG ACGAGGAAGA CCAGACTGCT CCACAATCCC AGACCGTACG CGTAATGCCG
CGTGGAGGGG GCAAAAACTT TATTCGAATG CACGCCCAGC AACAGCAAGC GGAAGACAGC
GAAAGCGAGA GCGAAGACGA CGAGGATGTG GTGGTGGTGG AAGAAGAAAC GCCAGCTTCT
GCGCCGGCCC CGGTTGTAGT GCCGGCGGTT CCGCAAGCCC GAGGCAAAAT GCTGTCCAGC
ATGAGACCTC CGCCACAAGA AGACGACGAT GACTCGGAAG AAGACGATGA CGAAGTAGAA
GTTCTGGAAA ATGTCCAACC AATTAAGCCG GTGCAGCGTG GCGGAGGGAA AGCAATTTAT
GGTGGAGCGG GTGGGGGAAA GCAGCTTCGC CAACCAGAAC CCGAACACGA GGAAGAAAAG
CCCAAGCCAA CAACCATTTC GCGATCTCCC GCCGGTGGCA AGCAATTACG TCGCCCTGAA
GCTGAGAAAG AAGAGGAAAC GCCGAAGCCA ACAAATATTT CTCGATCCCC GGCAGGTGGA
AAACAACTAC GTCGGGCGGA ATCAGAAGAG GATATTCCAA AGGTTGCTGC AATCATACGT
TCACCCGAAG GGGGAAAGCA ACTTCGAAGG CCTGAACCCG AACAGGAGAA AAAGGAGCCG
TCTGAGGCAT CGAAAATTTC CCGCTCGCCG GCTGGGGGTA AACAGCTCCG ACGTCCTATT
CCGAAGGAAG ACACTGAAGA CGACGACGAT GAAGATGATG AAATCTTTGG CGACGATGAC
GATGATGAGG AGTCAGACTA CGAAGAAGCA GTCGCGGCGA AAAGATCGAC TGCAAACGGT
CGACCCAAAC GCAGAACTGC GGCAAAGAAG GTCATTGTTG AGGAAGACGA GGAAATCTTT
GGTGACGACG ATGTCAGTAG CGACGAAGAA CGGGAACTCG ACATTAATAA TACCGAAGCT
TTGATCCGCG ACGAAGACGA TCGCAAATAT CTAGATACCC TGCCAGAGCT CGAACGTGAA
GCAATATTGG GCGAGCGTTT TGAAAAACTC AAGAACGAAC AGGATTTGAA AAAAGCCATT
AAAGAGGCCA AGTAAGTAAA TTGTGGTGCT TTCAACGGCA CGGTGCATTG ACCTCCGCTC
ATTTTTGGGT TCTTTATTAA CAGACGCCAG GCAGACGAGA AATCGGGAAA TGTTCAATCT
ACAGCACAGC GTAAGGCAAC CCCGGGCAAG AAATCTGCCA AAAAAGGTAC GGCAGATGAC
GACCAAGCCC TCGCGAGAAA ACTCGCTGGA GCCAGTCGGC GAGAATCTAC TCGTGACAAG
GATGCGAAAG GAGCGAAGAG CAAAAAGGCC GCGGCTCTAG CGGCCTTGAA GAAAGAGCGC
AAAATACAGA AGCAGCAAGA TTCCGATGAC AGTGAAATGG ACTTCGGCGA CGATTCCGAT
GATGATTCCG ATGAAGATTA TGATGATGGG GGCTTTATGC CGTGGCAAAA GAAGGCCAAA
ACACCGAAAT CGACAGTTTC ACGTCTCGAC AAAGATGACG AGAAAATGGA TTCCGAAGAT
GACCGCGACG GCGCCGACGT ATCCAGGAGT AAGACGACTT CGGATCGGAG TGGAGGTTCG
GCGGTTGAAG CCACTTTGGA GGACTTCAAA AAAGTAACAG TTCCTCGCCG GCGCCTTGCA
CGGTGGTGCA ATGAACCTTT CTTCGAAGCA GCTATTTTGA ATTGCTTCGT ACGGGTGCTT
ATCGGAGAAG ACGAGAATGG CGACAAGGTC TATCGCTTGT GTGAAATCAC GGATGTCAAA
ACAGGGATGA AAGTGTATAA GTTTCCGATC GCTAAGAAAG GTGACAAGCC CATCATGACT
ACGAAGACTC TGACTCTGAA ATTTGGGAAA AACGAGAAAG AGTTTCCTAT GTCGTTGGTC
TCTGATGCGC CGCCGGACGA AGTGGACATG AAGAAGTACG TGACTGTGAT GAGAAACAAT
CGCCAAGAGC CTTTAACGAA GCGACAAGGA AACAAACTTC ATCGTCTCCA GCACGACTTG
GTTCACAACT ACGTATATAC TACTGAAGAC ATTGAGCGCA ATCTTCAACA ACGAAAGAAA
CAAGGAAAAA AGCTGGGAAA TTTTGGAGCG GAGCTGACCA AGGCTGCGAT CGCAGTTCAA
GCTGGCAAAG ATTTTGTCAA TGAAGCAGAG AAAAAATTGA ACGATGCCAA GCGAAGCTTG
ATGGAATCTG ACAGTAATGA TGCGTCTTTT GAGAAGAGCG TGAAGGATGC TGAACAGACT
CTTGAGCGTG CAAAGGCGAA CCTGGAGGAG ATTATACAAG ATGAAAGAAA GATGCTTGAT
GTCGTGGATA ATCGTAAGCG ATTGCTAAAC CAACGAGCGA AAGATCGAAA TTGGGCCAAA
GTTAACCTGC GTGCCGTCCA AGCAAATCAA AAAGCAGACC GGGAGGCTAA CAAGCCACTT
GACAGTGCAC TATCGGGTTC CGCCAAAAAA GATACGTTCA ACCCGTATGC TCGCCGTCGA
GTGAAACCGA AGATTCTCTG GGAGGTAGGG CAAGATGACG ACACGGAAGA AGCGAAAGTA
GGGGAAGTCG GGGGAAGCGA AGACGCTCCG AAAGAATACT CGAACATTCC GCCACCTAAT
CTCGTTCAAG AAACCGACGA CAACACCGCT GCCCTTAGCG AGAGCCATCA GTTTGCTATA
GATGAAGAAG GGCTTGCTCA GGCATCGGCT ACATCTATTC TGTTCGGATC AAATAGTTCA
ATGAAGCGAA AACGCAACCG AAGAGGTCTA AGCCTTTCGG ATTACATGGA GCAAAAAGCA
AGCGGGCGTT TATAG
 
Protein sequence
MPNIRRTFPG DDKDSSEEDD SSPQVFHRNG GGKQLVRPGG KYLRPRHDDE DDDEESSSEE 
EEDPPQQQIF SRAGGGKQLR PSGGGGKYLR PSGGQEDDDD ESEDDDEELD DDEDDEEDQT
APQSQTVRVM PRGGGKNFIR MHAQQQQAED SESESEDDED VVVVEEETPA SAPAPVVVPA
VPQARGKMLS SMRPPPQEDD DDSEEDDDEV EVLENVQPIK PVQRGGGKAI YGGAGGGKQL
RQPEPEHEEE KPKPTTISRS PAGGKQLRRP EAEKEEETPK PTNISRSPAG GKQLRRAESE
EDIPKVAAII RSPEGGKQLR RPEPEQEKKE PSEASKISRS PAGGKQLRRP IPKEDTEDDD
DEDDEIFGDD DDDEESDYEE AVAAKRSTAN GRPKRRTAAK KVIVEEDEEI FGDDDVSSDE
ERELDINNTE ALIRDEDDRK YLDTLPELER EAILGERFEK LKNEQDLKKA IKEAKRQADE
KSGNVQSTAQ RKATPGKKSA KKGTADDDQA LARKLAGASR RESTRDKDAK GAKSKKAAAL
AALKKERKIQ KQQDSDDSEM DFGDDSDDDS DEDYDDGGFM PWQKKAKTPK STVSRLDKDD
EKMDSEDDRD GADVSRSKTT SDRSGGSAVE ATLEDFKKVT VPRRRLARWC NEPFFEAAIL
NCFVRVLIGE DENGDKVYRL CEITDVKTGM KVYKFPIAKK GDKPIMTTKT LTLKFGKNEK
EFPMSLVSDA PPDEVDMKKY VTVMRNNRQE PLTKRQGNKL HRLQHDLVHN YVYTTEDIER
NLQQRKKQGK KLGNFGAELT KAAIAVQAGK DFVNEAEKKL NDAKRSLMES DSNDASFEKS
VKDAEQTLER AKANLEEIIQ DERKMLDVVD NRKRLLNQRA KDRNWAKVNL RAVQANQKAD
REANKPLDSA LSGSAKKDTF NPYARRRVKP KILWEVGQDD DTEEAKVGEV GGSEDAPKEY
SNIPPPNLVQ ETDDNTAALS ESHQFAIDEE GLAQASATSI LFGSNSSMKR KRNRRGLSLS
DYMEQKASGR L