Gene PHATRDRAFT_37484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37484 
Symbol 
ID7202482 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp249194 
End bp252136 
Gene Length2943 bp 
Protein Length929 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181687 
Protein GI219122717 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGCGG ACGAGCTCCA AAAGAATGCC GCCCAACAAA GGGCTGCCGA AAACCGCCAA 
CGGCGCTTGA AAATCAAACA GCAACAAGCG AAAGCTTCGT TCGCATCCAC TTCACCGACA
AATGCTTTCA AAGCCAAAGA AACAAGTACA GCGACGTCCG TTACTATTAC GGATGCGTCG
ACGCTACCGG TTAAGACGGA AACAAACCAG ACAATCCCCT TTTCCTCCGG GACTTCGATT
TCTGCCGTGA CCAAGGCATC GCAGGAACGC GCCCGACGAG CTAACCAAGT GAGAGAACAC
CAGCGTGCCG TCCAAATTCA GAGCAGGGCT CGGGGATGCA TCACTCGCGA TCGAATGCAA
CGAAATATTC GTGCCGATCT CGTCAGTAAA ATGTACGACT TGAGGTCCGT CCGCGATCTA
CTTTCGCGGT CCCAGAACAT CACTACGTAT ATCCCGCCTC CGGCTACAAC AACGACACTA
GTCCGCAGCC TCTGGTTCAT CACATCCCGT CGTGAAAGTC AGGGCCATAC TTCTGTTCCG
TGGGGAAGTA GGCGAGTGAT CGTGTGGAAC GATCGCCAAG ATGTCATCTT GCTGTCGCAG
GTTTTACAGT ATGCCGTAAC TCCGGGCTTG CAAAGCCCGG ACGAAAACGT CAACCCTTTT
GCATGCTGGA ATTCTTCGGA AGAGGGAAAT TACCGGATGA AGTTTGTGAT GAGGATCATT
CTAGTGGCCC TGGTTGATCC AAGTGTGGAG CCGTTGGTTG GGAATGACGT GTTTGATGCC
TGTCAACAGT GTTTGAAGGC TGTCATGGCC GTACCGCGAT CGATTTCTTC TCTGCAGGCT
TTGTCAGATT GCCGCGTTAC AGTTTTTCGA ACATCTTGGG ATTGGTTGTT GCCAAACCAA
ACGGAGGCAG GGCCCCGAAC ACGGCTTACC ACTACGACCA CAGCCTCGCA ACTTCATCAA
CCTTGTGCAT ATTTCTCATC CCCCTTGGAT ATGCTGTCAA TTTGGCGACA CTATTTACTC
TTTACAGTTG CCGGTCCCAA ACCTATTCCA CCCATGCTGG ACTTGAATCG AGAAACCTGC
GTGTCTCACT TACGTAAACA ACGTATAGGA GTGTGGATCC GTAACGTTTG TGATGCAGTG
GAACAAGCGA GTGATCACGA TGCAAGGGGT GACTTTTTGA TGATTCGTCT CGTTCGCGAG
ATTTATACTA TTCCGCTCTT GACGTGGAAA GTATCAAATG AGCTCTTGGC ATACTGGGTC
ACATCGTCGG GGACAAAAGG TTGTCCATTC GTGTCCGCCT TGCGACTTTT TGGCGATAAA
GGCGACGGCT TGCTTCAAAA TGACGGCGTC GAAAATTTGT TTCCGTTCGA CGATGTGCCC
ATGACGGTAT GTCCTGCCAC GCCGACACAA TGCTTCTTAG CCAATGTGGT CCAGCTAGGC
CTTCTATGTC CTACGCTGAA TGGTACTGAC AGTCTTAAGC TGGACTTTGG AGCTGCCGTG
GTGTTTTTTA ATCTCATTAC AGTTTTGGTA CAAGCCATTC CTTTAGCTAC ATTCTCCTCC
CGTGACTCGG CCGTTGTCTG GGTTGACGGA GTAAATGGTC ATACGATCCC AATTGTCTTG
TCGAAGGTCA TTCAAGACCA GTGCAGGGCT ATGTATGGTG GATTCCTTTG TCCGTCGCAT
CTTTCAAATC GCCCTGGATC CGCAAGTACT TGGTACAGAA AACATCTTGT CGACCAAGAG
CGATAAGGAT TTAAAACACG AGAAGGACAT GCTGGAGGTG GGAAGCTCGT CGGCAGCAAG
TTTGGCAGCC AAAGAAGCAC GTGTGGACCG CAATAAGAGC TTTTGGAACT CTTCAAAATG
GGCGCGGAAG CTAACAAAGG GCATGTCAAG CCTGCTGGTC GGTGAGGACG CCAAAAAGCG
AGCGGCCAAG AATTTGAAAC CGTCGTCTTT AAGGAATCAG TCTACGGTTT CGCGCAACCT
AGCGGAAGGG GTGGAAGGTG ATTGTAGCGA TATCGGGACC ATTTTCTCAA CTAACATTGT
CCCACGATCC GACTATACAG TGACATTTTT GTTTTGCTTG TGTCGCTGCT TCTCTGTTGT
CGTGGCACGA TGGGGTGGCG CTGGGAACGA TGATATGCTT CTTAGTCAGA ACAGGGAATC
CGCCAAGGGC GAATTCCCCA AGGCTTGCAT AAGAGCGGAA CCATTCGTTG TTATTATTTT
GAACGCCCTG TGCTTTTCTA CGCCATTTGT GCAATGTGCG TGGGGAATCA TGCAGTCAGA
CCGTCGGATT GCATCCCAAA TTCACAGCAT TGTCGAGATG GATGAGGGCA AATCTTTCAT
ACGGTGTTTG GACATGCAGA CTGGCCTTAC TGGGATATCT AGTGGAATTA GCGACTTAGA
CGGGGCAGCG TTGCTGTTTA TGTTTGCTGT AGTGTTGTCA CACACCTTGA TTATTACAGA
CGACGTTGAA ATCCACGACA TGGACCGCCC TTTGCCCAAA CATCAACTAA GACGTGTCAT
TCAGCTTCTC AAAAAGCTCT TGTATCGGGC TTGCTGCATC GATTCGACAA GTCTTTCTGT
TCATTCTAAT TACTTTGGAG TAGCTCTGAT ATCGGCCTCG TCAAGGGCCA TGCGTGATTT
GTATGATCGT TCAAGTCGGA GGCCAATCTG TGTACCCAAG CTATGGCTGC TGCCAAACCT
GTTGGAAAAG GATCTCTCGA AATGCAACTG CCATGCCGAA TATGTAGCGT TGCTCTCGAC
ACCCGTGTTG CGTATGTGTC CGTTTCTTGT TTCTTTTAAG CGAAGACTTA AACTCTTTGA
ACGAATCGTG ACTACGAATC GAGTAGAAAT TCAAGGAGAG GTAAGTTACT GACTGTGAAT
ATCTCTTTAC CTTGCAAGCA GAAATGTCTG ACAACGGTAA GGGCCTTCAA TATACTGTTT
TAG
 
Protein sequence
MFADELQKNA AQQRAAENRQ RRLKIKQQQA KASFASTSPT NAFKAKETST ATSVTITDAS 
TLPVKTETNQ TIPFSSGTSI SAVTKASQER ARRANQVREH QRAVQIQSRA RGCITRDRMQ
RNIRADLVSK MYDLRSVRDL LSRSQNITTY IPPPATTTTL VRSLWFITSR RESQGHTSVP
WGSRRVIVWN DRQDVILLSQ VLQYAVTPGL QSPDENVNPF ACWNSSEEGN YRMKFVMRII
LVALVDPSVE PLVGNDVFDA CQQCLKAVMA VPRSISSLQA LSDCRVTVFR TSWDWLLPNQ
TEAGPRTRLT TTTTASQLHQ PCAYFSSPLD MLSIWRHYLL FTVAGPKPIP PMLDLNRETC
VSHLRKQRIG VWIRNVCDAV EQASDHDARG DFLMIRLVRE IYTIPLLTWK VSNELLAYWV
TSSGTKGCPF VSALRLFGDK GDGLLQNDGV ENLFPFDDVP MTVCPATPTQ CFLANVVQLG
LLCPTLNGTD SLKLDFGAAV VFFNLITVLV QAIPLATFSS RDSAVVWVDG VNGHTIPIVL
SKSDKDLKHE KDMLEVGSSS AASLAAKEAR VDRNKSFWNS SKWARKLTKG MSSLLVGEDA
KKRAAKNLKP SSLRNQSTVS RNLAEGVEGD CSDIGTIFST NIVPRSDYTV TFLFCLCRCF
SVVVARWGGA GNDDMLLSQN RESAKGEFPK ACIRAEPFVV IILNALCFST PFVQCAWGIM
QSDRRIASQI HSIVEMDEGK SFIRCLDMQT GLTGISSGIS DLDGAALLFM FAVVLSHTLI
ITDDVEIHDM DRPLPKHQLR RVIQLLKKLL YRACCIDSTS LSVHSNYFGV ALISASSRAM
RDLYDRSSRR PICVPKLWLL PNLLEKDLSK CNCHAEYVAL LSTPVLRMCP FLVSFKRRLK
LFERIVTTNR VEIQGEKCLT TVRAFNILF