Gene PHATRDRAFT_45879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45879 
Symbol 
ID7201117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp559373 
End bp562495 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180260 
Protein GI219118987 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAATC TACAGTTGCA CCATCGTCCT TCTGCTAGAA TTGCGATGCA GAAGCTTTTG 
TTCAAATCGT CGCATCCCCG CGTTCCTTCT CGTTCAGTGG TACAAATACT ACGTCACCGC
ACGTTTTCAA CCAAAGCTGT CTTTCCAAAT ACTTTGACCA CTTCGCGGAT ACAAGGAGAT
CCAGACAATG GTCGGCAAGA TTCTGTGATG GGAGAGAGGG TTGCCTCCCA AAAATTTGGC
GGCATGGTGC ACTTCGTTCA AAAGCTCCCG ACAATTTCGT CACCGACACA AACTGGGCTT
CGCTCGGCAT TTCTGTATTG GCTCAGTAAA CCAACTCGTC GAGTAGCGTC GAAGCGAAAC
TTGCCGGAAC TCCCCTTAAA TTTTGCTCAG AAAATTCTGG ACTACTGTGT TGACCAAAAC
GACCCAGCCC TTATCGAGTA CGTCGTCGAC AATCCGGGCG TGTCTTGCAA CGGCGGCTTT
GACCGACTGA TATATCTCTA TCTCGAGCCA TTGCAAGGGA CAGGAGCTCA ATTGCGCGGT
CACAAATATC GAAAGAGCTT GGAGCGGTCA CCCGAACAAA CAATTGATAT GCTTGCCAAG
GCGTCGGCTG TGCGGGCTCT CGCGGATAGA CTGCACCGCG ACCCACGGTA TCCCAACATA
GTTCCAGATT TGACTACGTG CGAATCGACG CTGTACCTGT GGAGCAAACG TTCCCAGTTT
CTTGCAAACA ACAATCCATC GTGGCGTGAT AGTGTGGCCC AAAGTAAGGC GGATTTGGCT
TTTGGCGGAA AGTCGAATTC ACAGGAATGT ATCGATGCCA TGAAGGAGTT TGTTTTCCAA
AGAAAACAAA GTATTCACGG ACCGCAGCCA GACACTGTTA TGTATTCTAT CTTGCTAACG
GCAATTTCTC AAGGAAAGCA CTTTGATGCA GCGGATGAAG CCTTCTCGCT ACTGAAAGAA
TTGGAACTCG ACGATTCTGT CAAAAAGACC ATTCATTTGT ACACGGCAGT CCTTTTGGCT
TACCGCTCTG AAGTCACACA TTCGACACGG GCACAAGAAA AGGCTATCGA ATTGTGGAAT
CATATGATAA GTATGGACGA CCCAACAATA TCGCGGAATC CAATAGCCGC CGGCATTATG
ATGTCCATGT TTGCGAAAGT TGGAAAAGCT GAGGAAGCTC AAAAGCTCCT GGACGAAATG
GAAGCATCTG CTAAGGAAAA GAGCGAATAT CCGACGCGCA TTCACTATAA TACGCTTCTT
CATGCTCTTA CGAAGGCACA ACTAGACGAC GCTACAATAA GGGCCGAAAA AATACTTCAA
CGTATGGAAT CTCTGGCCAT TAACAACCTT CGCGATACAT TCCCGGACCG CATCAGTTAT
ACATCGGCTC TAAATGTTTT TCTCCAAAAC GGAGGAACCG ACTGCATTGA AAAGGCTGAA
GCAGTTCTTG ATAATTTGGA AGAAAGCAGC GGCAGAAATC TTCGTCCTGA TAAAATGACG
TACAGTAGAT TTATGCAATC TTTGTCAATG CGGCGTGGGC GAGAAGTTGA TACTCAAATT
AAGGAAAGCT TCTGCATCAA GATCGAGGAA GTCCTCCAAC GCTGGCGTCG TCGCTCAGAG
TCTGATGTGA CGGTCAAGCC TCCAGATCTG GAGGCGTACA AACTCTGCCT TCACGCGTGG
GCAACTTCTT ACAGCACCCT CTCGCCGGAA CGGGGAATGC TTCTCTTTAA TGAGATTGAA
TCTCGTTACC AAGCCGGTCA AAGAAATTTA CGCCCGGATG TTTACACTTT CGAATCTGTG
CTGCACTGTC TGTCTATAAA GGTAGATGAA GCGTCCGTAC GTCTTTCCGA ATCGCTCCTT
CGGAAATTAG ATGAGTACAA TATTTCCCAA ACCGGTTGCA TGGTTAAGTA CTACATCGGT
TTGGTTGCTA GGCAAAATGT CCAAAAAGCG GCTACGATCC TGAATGAATT GGAAGACAAC
TTTGCCTCTG GCGCAAGTTC ACTTCGACCT AACGAGCAAA TATACAATGC TGTAATTCGA
GGATTTTGCG TGCGTGAAGA CGGAGCACTT GAAGCCCAAC GCCTACTGGA CAGGATGAAG
CGACTGGCTC TTCTACCTGG TAGGACTGAC TTGACTCCAT CAGCAGTTGT CTACTCTTCA
CTCATTGAAG CATGGGCAAA ATCGGGACGA AAAGATGCAG TTGACCATAT TGAAGCTTTA
TTTGCCGAAG TGTGCGATCG AATGATTCCA AACCATTTTG TTTATGCGAC ATACCAGAAC
GCCATCAGCC GCTCAAATCT CCCCGATGCT CCTGAGCGCG TTGAAGCGAT ACTCACAAAA
ATGCAGGAGG ACTACGAACA AGGGCGCAAC AAGCTAGGCC GTCCAGACGC AAACAACTTT
GCAGCTGTGA TTTCCTGCTG GAGTTTTAGT CGACATAAAG AGGCTGCCGA ACGTGCTGAG
GCAATCTTAA ACAGAATGGA AAGCCTCTAC CTGCACAGCT TAAAATATGC TCACCTAAGA
CCAACAGCGC GATGCTTCAA AGGCGCAATT GCGGCGTGGG CGATGAGTGG TCATCCAGAT
GGAGGAAAAC GGGCTCTGGT ATTACTGGAT CGGATGAGTA TCGCTAGTCG AGGCCAAAAC
ATTGTCCACT TACGGCCATC TCGGGCCTGT TACGATTATT GTATTGTAGC TATCGGTCGA
TCGAAAGATT CTAATAGGGC AAGGAAGTCT TTGGATCTTT TGAAACGCAT GCAGCGAGAC
GTACGGGAAG GATATCGACA CTCACAGCCG GGGATTTCGA CGATGGAAAA CATACTAGAA
GTATGTAATA CGTACGCGCA TGCTCTGGCG AACGAACGAG AGGAAGCCCT CGAAGTGGCT
GAGAAAGCAA TAGATCTATT CGCAGAAGCA GACGGGGAAG TCCGAGATAT GGTCAATGTT
TATACACGAT ATGTTTGGGT GTTGAGGCAA CTCGTGCAGA CGTGTGAAAA ACGCGACGAA
GTTGTCCATA ACGTAAGAAA GAAATGGCCC GAACACATTC TGAGCGCGTC GGATGTTAAT
AAGGCTCTCC ACAACTTTGA AACTTCGGAA TTACCGACAG AGCTCAAATC CGTAGAAGAC
TGA
 
Protein sequence
MRNLQLHHRP SARIAMQKLL FKSSHPRVPS RSVVQILRHR TFSTKAVFPN TLTTSRIQGD 
PDNGRQDSVM GERVASQKFG GMVHFVQKLP TISSPTQTGL RSAFLYWLSK PTRRVASKRN
LPELPLNFAQ KILDYCVDQN DPALIEYVVD NPGVSCNGGF DRLIYLYLEP LQGTGAQLRG
HKYRKSLERS PEQTIDMLAK ASAVRALADR LHRDPRYPNI VPDLTTCEST LYLWSKRSQF
LANNNPSWRD SVAQSKADLA FGGKSNSQEC IDAMKEFVFQ RKQSIHGPQP DTVMYSILLT
AISQGKHFDA ADEAFSLLKE LELDDSVKKT IHLYTAVLLA YRSEVTHSTR AQEKAIELWN
HMISMDDPTI SRNPIAAGIM MSMFAKVGKA EEAQKLLDEM EASAKEKSEY PTRIHYNTLL
HALTKAQLDD ATIRAEKILQ RMESLAINNL RDTFPDRISY TSALNVFLQN GGTDCIEKAE
AVLDNLEESS GRNLRPDKMT YSRFMQSLSM RRGREVDTQI KESFCIKIEE VLQRWRRRSE
SDVTVKPPDL EAYKLCLHAW ATSYSTLSPE RGMLLFNEIE SRYQAGQRNL RPDVYTFESV
LHCLSIKVDE ASVRLSESLL RKLDEYNISQ TGCMVKYYIG LVARQNVQKA ATILNELEDN
FASGASSLRP NEQIYNAVIR GFCVREDGAL EAQRLLDRMK RLALLPGRTD LTPSAVVYSS
LIEAWAKSGR KDAVDHIEAL FAEVCDRMIP NHFVYATYQN AISRSNLPDA PERVEAILTK
MQEDYEQGRN KLGRPDANNF AAVISCWSFS RHKEAAERAE AILNRMESLY LHSLKYAHLR
PTARCFKGAI AAWAMSGHPD GGKRALVLLD RMSIASRGQN IVHLRPSRAC YDYCIVAIGR
SKDSNRARKS LDLLKRMQRD VREGYRHSQP GISTMENILE VCNTYAHALA NEREEALEVA
EKAIDLFAEA DGEVRDMVNV YTRYVWVLRQ LVQTCEKRDE VVHNVRKKWP EHILSASDVN
KALHNFETSE LPTELKSVED