Gene PHATRDRAFT_37844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37844 
Symbol 
ID7202649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp200020 
End bp203877 
Gene Length3858 bp 
Protein Length1285 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182025 
Protein GI219123424 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.378594 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTAT CTTCGACAGG TCAGTACTCT TATGCTGTAC GTACTGAGAA AGGAGTTCGC 
TATCTCAGTA ACGAGGATGA GATCAAAGAC TTGGTCCGAA AATCAATTCA GCGCGCGAAG
AAAACGTCCT GCTTACTGTC ACCGGAACGT GTAAGTCTTC CAGATAGTCC GGGGAAAAGG
AGTGTTCTTT CCTCACTAGA TAGAGTCGAA ACACAAAGGC GGGCATCAGA GTCTTGTCCA
CGTGAAAACC TGATGGCGCC GGGAATGAAT TCTGATGGAG ATAAATCAGA GACTGCTAGT
CGCATCCAAC CTCCATCATC CCCGCTATCA CCTTCGTACG TGGTAAAGAC CGAGTCTGGT
GACCAGATCA TGATGTCAAA GGAACAAATG AATCGATACG TTAGATCTTC CATTCAAAGA
GCCAAGGAAA GCGCCGCTAC GCAGCAGAAG AAGACAATGT CAAAAAAGAA TTCTGTCTGG
CCGAAAGTTC GATGTTCCAA CACGTCGGAC TCACCCAAAA CAACGAATCA CTTTTCCTTT
TCTGATTATG GAGAACTAGC TTTGGATGAT CGGAGTACAC CAGCATCTTG TCCTCCGATG
AAACCGGCGA AAATCATTCC CAAGAACTCG TGTTCACCAT TCACTGCCAA TGAAGCACTC
ATTTCACCAT TTACTAGCAG CAAATCATTT GTTAGCAAGA CTCAGCAGCT GGAGGTTTCT
CGTCCATTCT TTACTCCGCC CGATGACAGA CACAATAGAC AAGAGCATAG CGAAAAGTAC
GATAAAAGGC AACGCCATTC TCTGATGGAA GATGAATCCA GCAGCACGTT ATGGGTCAAT
CCTCTCCCTC GATTCTCGTG GTCTTCCTTA GACAATAGCC AGGAAGGGGA GGAAGAATGC
TCACTTGCTT CACCTATGAT CGAATCCAAG CTACCGATTG TTATTGGTGA ATCAAAGTCG
AATTTGCACC ATTCATGTGT CAACTCAGCT AAAGCAACAA AGAGTGGTAT AGAAATCGGC
GGCGAAAAGA AGTGCAGTGC ATTGAACAAT GCTTCAAATT CTCAAGATCC AATTTACCAA
GCAAATAACA GGATAATGAA TTCTGATATT GAGCGGAAAA TTTCAGCGAA GATAACCATA
ATAAAAGATG GCTTGTTTTC GGACGTCCTG TTGCAGGCAC AGGAGGGTGA TGTGTCGACA
GAAACCGCTT CTGCGAGCAG CCTGACGTCC AAAAGCTCGC GCAACAAGCT TATGAGAGAC
CGATACACTT TGAAAGTAGC TGAGGATCCT GAAGTTGCCA ACATGACACT CGAATCTACG
AACGGAAACT CCAGCCCAAT TCCATCGAAC CTTGCAAATG AGATGAAATT AAATTCGGCT
GGAAAGGCGG CGTCACCGAG TTCTCGTAGT CCAGATCCCA AAGCAATGCA GTCACCAAGT
TTGCCAAATT CTATCGAAGA TAAATCTAAG GCTGAGAGTG GCCGCGCATC AACATCACCT
CGAAAACTTC GACTCCAAGC GATGATGGCC GCTCGACGGG CAGCTCTTTG TAAATCTCCT
CCAAAAGAGA AAATCGATGT GAACACCAAG TCTGAAGAAT TCACTTTATG CGATAGCAAA
AATGCTATAC CTGGTTGTTC AACCTCTCCT ATTTACGTGA CGGAAGTCGA TCTTATCAAT
AGTCACACAC AATCCCGACA ACGCAGGAGC CAGATAAATG TTACAAAAAT TTCGACACAG
CCGGAAAGGT CCGCAACACA CAATCACTTG CCTGAGCAAC TGCCTACCCT CCTTTTACTG
AAAGATGAAA AGAATTCAAG TGAATACTTG ACCATGTCGA GTCACGCAAG TAAGGATGAA
CAGATAAAGG ATCCAGTAAT TGACCGTTCT TCTTATTCCA ATGAAATCAC GCCACCTACG
ACTTCGGATT TAGGTAAATA CCATCAGCTG GTACACACAT CATCTGATAA AAAGAAACAT
TCGACTCAGC CATGCGCTGA AAACGCAGAA TTTACAATGG AAGAGTGCGA TGTCGATCAC
TCGCCGCTTC CAATGAGAAT ACGGGAAGAC CTGGACTCAC CTTCGAACGG CAGTAAGGTG
GACAAGCCCG ATCAATCCGA CGAGAAAGAA GATCAGAAAA CCCTTCCCTC TATTCCTGTA
GATGAAATTG CGGCAACATC AACACAAATG ACGACTGAGA GCAGACCGGT CGATTGGCAT
ACCAAAGACG ACCGACCGTT CGTTGAGAAT GAAGGAAAGA CAGAAAGTGC TAGATCTCAC
TCACCGGACG TGTTCGACGG TTTACTGTTA ACAAGCAGTG ACGACAATGT AAATTCAATT
CCAACACAAC TTCAATCCCT AGATTCCAAA TTGCCGCCGA AAGGAGTGCC GGCATTAATC
CAAGAAACGG ATTGTGAATC GATGCCAACC ATTGCCGAGC CAAATGAATT CGAAATCGGA
CACAGAAAAC TGGACAAGGT TGAAGCTTCC GTAGTTTTAC ATTTCACTGA TGCAAACGAA
AAGAGTCAGG TGATTGCGCC TCGAGAAAGT AGTGTCCTTC CGATATCGAA TGGCGAAAAC
GAAACCTCAA TGGAGCAACC ATCCGGACTT ATGGAGCATA GCAATTCCAT CACAAAGCCT
TGTGCAGTCT TGAAGCCCGT CGATGTGGCT ATAGTCAGTA GTCCATGCGT AGAGAGCTCT
AGTCGGCAAA CTGTTGCGAC AACTATTGAA GACCAGAGTG GCCCTTGGAC TCGGACCGAT
TTCGGCAATA GGCCCCAGTC GGAGCAGCCT ACGACACCAA AGTGGTCACA CAACCGGCAA
CGAACTGTTG TTGGAGAAAA GTGCGACACT AGATACAAAG ATTTAAGTCC GGACATTGAC
ATCTCGCTAA CGCTGAGTGA TGGGGGTCTC GCCGAGGCAG CAAAAGTGGA AATGATGCGA
AGCCGACCTG CCTCTGGCTC TGTTTCCATA GCAAACTCTC CGTTTGCGCA GAGCGATGAT
GAGTCTCTAG AGGCTAATCA AGACAGGCGT ATTTCCCACA CGCCGAAGCG AAAACGCGCA
ATTAAGCGGC ATTTACCCCT TCATCACACG ACTTTCAGAC ACTCAGTTCA GGTTGATTCT
CCAGCGCAAA CTCTTCGCCG GAGCAATCGT TGTCGAAACG CTGGTAGGTC GTCTCGAAAG
ATTAAGAAGT ACGCCAAACC TCGCCATGAC GAAGGCGATG ACCCATCGGA AGTGATTGTA
ACCGATGAGC TATTGTTAGC AATCGCTGCA TGGAAAGGAG TCAAACTTAC TGCAGGCGAT
TTTAATAAGA TTCTTGAATC CCAATCGGTT AGCGGCTCTG TTGCCACAAG GGAGACTATC
AACCGCACCA AAGGTCATTC AGTTTCATTT TCTCCTCACG CCATTGCGGC ATCTAAAATG
CAGATTTCGA GCCAGCATGA TGAAAAGGAT CGGCGTTCCA AGTTCTGTGT AGATTTCATG
GATATCTTGG ATCTGGGTCC GGATGAGTTG ACCGACGAAG CAGGTAGTTT TGTGGATGAT
GAGTCATCGA AACTGTTCTC GAGATCGAAT CTAGCTTGTT CTATGGTCTT GGAAAACACA
ACAAACTCTA GCTTCGAAGA GACCGACGAA GACAGCAGAA TAAGTTTCAA GGAGAAGAAC
ATATTCGACT CACTAGCCGA CAAATTAAAT ACTATAATGG AGGGCAGAGA CAGCGAATCG
GACATGGAAC GATCAAAAGA ATTTAATCTA CGATCCTTTT CATTCTCAAG AGGAGATCAA
AGTGACACAG AAGACCACAC AAGTCGATCT TTCAGTGAAG CTGAGGAAAT CAGGCAGGGT
TGGTTCAAAA TGGGATGA
 
Protein sequence
MSLSSTGQYS YAVRTEKGVR YLSNEDEIKD LVRKSIQRAK KTSCLLSPER VSLPDSPGKR 
SVLSSLDRVE TQRRASESCP RENLMAPGMN SDGDKSETAS RIQPPSSPLS PSYVVKTESG
DQIMMSKEQM NRYVRSSIQR AKESAATQQK KTMSKKNSVW PKVRCSNTSD SPKTTNHFSF
SDYGELALDD RSTPASCPPM KPAKIIPKNS CSPFTANEAL ISPFTSSKSF VSKTQQLEVS
RPFFTPPDDR HNRQEHSEKY DKRQRHSLME DESSSTLWVN PLPRFSWSSL DNSQEGEEEC
SLASPMIESK LPIVIGESKS NLHHSCVNSA KATKSGIEIG GEKKCSALNN ASNSQDPIYQ
ANNRIMNSDI ERKISAKITI IKDGLFSDVL LQAQEGDVST ETASASSLTS KSSRNKLMRD
RYTLKVAEDP EVANMTLEST NGNSSPIPSN LANEMKLNSA GKAASPSSRS PDPKAMQSPS
LPNSIEDKSK AESGRASTSP RKLRLQAMMA ARRAALCKSP PKEKIDVNTK SEEFTLCDSK
NAIPGCSTSP IYVTEVDLIN SHTQSRQRRS QINVTKISTQ PERSATHNHL PEQLPTLLLL
KDEKNSSEYL TMSSHASKDE QIKDPVIDRS SYSNEITPPT TSDLGKYHQL VHTSSDKKKH
STQPCAENAE FTMEECDVDH SPLPMRIRED LDSPSNGSKV DKPDQSDEKE DQKTLPSIPV
DEIAATSTQM TTESRPVDWH TKDDRPFVEN EGKTESARSH SPDVFDGLLL TSSDDNVNSI
PTQLQSLDSK LPPKGVPALI QETDCESMPT IAEPNEFEIG HRKLDKVEAS VVLHFTDANE
KSQVIAPRES SVLPISNGEN ETSMEQPSGL MEHSNSITKP CAVLKPVDVA IVSSPCVESS
SRQTVATTIE DQSGPWTRTD FGNRPQSEQP TTPKWSHNRQ RTVVGEKCDT RYKDLSPDID
ISLTLSDGGL AEAAKVEMMR SRPASGSVSI ANSPFAQSDD ESLEANQDRR ISHTPKRKRA
IKRHLPLHHT TFRHSVQVDS PAQTLRRSNR CRNAGRSSRK IKKYAKPRHD EGDDPSEVIV
TDELLLAIAA WKGVKLTAGD FNKILESQSV SGSVATRETI NRTKGHSVSF SPHAIAASKM
QISSQHDEKD RRSKFCVDFM DILDLGPDEL TDEAGSFVDD ESSKLFSRSN LACSMVLENT
TNSSFEETDE DSRISFKEKN IFDSLADKLN TIMEGRDSES DMERSKEFNL RSFSFSRGDQ
SDTEDHTSRS FSEAEEIRQG WFKMG