Gene PHATR_43937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43937 
Symbol 
ID7204168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp476811 
End bp479810 
Gene Length3000 bp 
Protein Length999 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186068 
Protein GI219112969 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.499306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTGT ATATTGAGCC TAACAGATGT CACAGCGGCT CGCCACGCCA AAGGGTGGGT 
AACCCAACCC TCGAGAACGT GGCGGAAGAA ATGCTGAACG GGCTCTTCTG CCATTACGAG
CCTCCCGACC AGTCTCGGCG AGATCGAAGT GATTCGGGCC TGACCAGGTC CATCCTCCTC
CCTCCTCGTG ATTTCCGCCA GCAAAGTGTC CGCTCTAAAA AAGAAGTGCG TTGGTCCGAC
AGTATCAAAG AATCGCAAGA CAACGGGTCT CCCAGAGCGA TTCACTGCAA CGTCGGTTCA
CACTGCGTCG ACGCTGGCAT CGAAGCCGGA TGCGTCGACC CTGAACACAA GGGAGTTACC
CGCGATTACT TTGCCAAGTA CGACTACGAT TTGCAGCAAG TCGACAGTAC TGAAAGTGGC
GCGCCGTTTC GTCCTTCTTG TCTCCACAAG AATGTAGTCT TTGATGACGA CGGGAACCCG
ATTGCATCCG TTCCCGGGAC CCCGATCCCT ACGCTCCCCC CTCTCCCTCC GCTTCCATCC
AGGAGGGAGC GATTTATTCC CGAGCTTTGT GGTATTGGTA TCAATTGTGG CGAAGACGAA
GAGGAATTGC GTGAGATGTG CGCTTCCAAG CCGTATCGTA GGCCTTCGCA ACCGTACGAC
TACTCCGAAG ACGATGAAGA CAACTCTCTT CTTCTTGAAG AAAACAACTA TGAAATTCAA
GGAAGGGCAA CATTCAAGAC TCGCGGGCGC AAATCTCTTC CCACCCCCGC CTCCAGTCCG
CCCACCGAAA CCAAGTCGTC CCCAACAAAG AACAACGTAA ATGGATCAAC ATTACACGAG
GCCGACTGGG AGACTACTTC CAACGCATCC TCGTCACTGT TTCGCCGTAT ACTTCCTCGT
TGGAAACGTA ACGGCAGCAA GCATAGAGAA GCAGTTTCTG AAGAGCAGGC AGTTCTCGAT
CCAATTGAAC GGCAGCGGGC AGAAGACAAT AGAAAGGCTG AAAAGGCTTC CTTCTTGCTT
AAGGATTCGA AACTAAGAGC TTCGCAAGCA TCCACATGCG AAGAAAACAA CGACTGGGAC
GACATTGTAC CAGTCGCTCC ACTTTTGAAA ACGGAAATTT CGAGTCAGCT CCCTCATAGG
CCCGCTGCGT GTCAACCCAC TAGTAACGCC GCTGCCCTTT ATACCGTTCC AAAATTGGAA
GCAGCAAAGC GCGACCCGTC TCCTAGCCGG GGGCGAGGCC GCAGTGATAG TCTCGACAAG
AAAGAGAAAC CCAGACGCGG TTTTCTGAAA CGACGAAAGA GATCCAAGTC GCCCTCGTTG
CCCGGAAATT CTTTAGTTGT CTTGAACGCG CTTCAAGATA AAACCGACTG TGAGCGCAAA
AGTGCCATTG TGGCTCACAG GCAAGGACTA AATCAAACGG GATTGCAACA GCCTCTTCCG
ATGGGAAAAA ACCCTTATCA GCCTTCTCCA GCAAACCCAT CCCGTGAAGA ACTTCTTCTC
AAAAGGCAAC AACAGAAATC TGCATCTATT CCTCCAAATT TGATTTCGAG GGAACATGTG
GACCAACAAC AACCGCCAAT CATACAACAG CAACGCAATG CTTTGGCGCA ACCGCAGTTA
ATCCATGATT CTCGCTTTCA TCCAGCACAT CCTTTGAATC GTTCTGTTAG CGGCAGCAAT
GGCAACAGTC ATTCTCTTCA TGCAACCCAC AAACAGCTTG TGGATGCAGT AGATGATCGA
ACAGAGCGAA TGAGTTTCAT TGCAGCTCAC CGTCAGGGCT TCGATGAAGA CATAAGGGTG
GCCACTCTCG CCCAGTCCTC CCGTCACGAT GTACCAGACT GTCTTGACGA TCAAATGGAG
AGAATGAGTT ACATCACCGC TCATCGTAAG AGTTTTGGCG ACGGGTCATT TCATTCTGCG
AAAACGCCGT CGCAGATGAC ATTGATACAT GAATGCCAGC CTCTTGCAGA TTACCGCGGA
GGTATACCAT CGACCGGGCG TCATTATTCT CCCCACGATC AACGCATTTC CTTAGAGCAA
CCTTTCAAAG ACAAACGGAA AATAACAAGG GTGGAGCAAA AAGCTCTGAA GAAGCTCGAT
AGGGAACAAC GCAACGAAAT GCGAAGCGTA GAGCGCTTGA TTCGAGTACG TAATCCTACG
CAGCACTTAA AGACTTTAGG GTCATCGAAT CCGCAAAATT CGGCCATATT CCGAAAAGTT
GATGTACCCG CTCGAAGTCC AAGCCATCAA TCTTCCTACC AAGAACCACC GTTGCCTTTC
CGTCCAGTTG AGAATGTGGA TCAAACCAAC CCCAGCCAAG ATCCTCCAGT CCAGACCACT
GGCTCTTTTG AAAAGTTTGA GGGAGAGAAA CCTATTGAAA CTACTTGGGA GGAAAGAACA
CGATTAGCTT GGGAGCGTCT GCGGGGAAGC TTCACCATGG AGAGCGCGAC GGAAAAATCC
AACGAAGAGG CCCAACTGTT ACGCCCGCCT ATATCGAACC AGTCGCCCTC CCAGCAAGTA
CGGGGCATTC TGAAGATACC GTCGCACGAG AAGCGCGTGA CCTTCGGTCA AGACACAGAA
CACATTTTTG ACAAGGCAGA AGAGAACATA CCAATGCAGA GTGAAAACGG GTCTTACGTT
CAAAATACCG GTAGGCTATC CACGACAATG CCACAGCCAC TGGGTATGGT CGACCTTACG
GATCAGCACG GTCGAGTCAC CATGCCATCC ATGTATCCCC AGAACCTGCC ACCCATTTCA
TCCAGCAAGA AAACAAAAAA AGTCTTTCGC GGCGCAAGGA TTTTCAGCGA ACTATTTCGA
AAAGGGAAGA AGCGTAGCAG CCGTAGCAAA ATGACAAAGG GATCGTTCCG AGTCAACCCT
ATGGAAATCG TCAATTCACT CTCAACTGAA ATTACCACCC CGTCTATCTC ACGTTCTGGG
GATGAGTATG ATATGCTTGG AAATACCGCG ATGGGCTACA ACGTAATGCA CGCCGTATAG
 
Protein sequence
MALYIEPNRC HSGSPRQRVG NPTLENVAEE MLNGLFCHYE PPDQSRRDRS DSGLTRSILL 
PPRDFRQQSV RSKKEVRWSD SIKESQDNGS PRAIHCNVGS HCVDAGIEAG CVDPEHKGVT
RDYFAKYDYD LQQVDSTESG APFRPSCLHK NVVFDDDGNP IASVPGTPIP TLPPLPPLPS
RRERFIPELC GIGINCGEDE EELREMCASK PYRRPSQPYD YSEDDEDNSL LLEENNYEIQ
GRATFKTRGR KSLPTPASSP PTETKSSPTK NNVNGSTLHE ADWETTSNAS SSLFRRILPR
WKRNGSKHRE AVSEEQAVLD PIERQRAEDN RKAEKASFLL KDSKLRASQA STCEENNDWD
DIVPVAPLLK TEISSQLPHR PAACQPTSNA AALYTVPKLE AAKRDPSPSR GRGRSDSLDK
KEKPRRGFLK RRKRSKSPSL PGNSLVVLNA LQDKTDCERK SAIVAHRQGL NQTGLQQPLP
MGKNPYQPSP ANPSREELLL KRQQQKSASI PPNLISREHV DQQQPPIIQQ QRNALAQPQL
IHDSRFHPAH PLNRSVSGSN GNSHSLHATH KQLVDAVDDR TERMSFIAAH RQGFDEDIRV
ATLAQSSRHD VPDCLDDQME RMSYITAHRK SFGDGSFHSA KTPSQMTLIH ECQPLADYRG
GIPSTGRHYS PHDQRISLEQ PFKDKRKITR VEQKALKKLD REQRNEMRSV ERLIRVRNPT
QHLKTLGSSN PQNSAIFRKV DVPARSPSHQ SSYQEPPLPF RPVENVDQTN PSQDPPVQTT
GSFEKFEGEK PIETTWEERT RLAWERLRGS FTMESATEKS NEEAQLLRPP ISNQSPSQQV
RGILKIPSHE KRVTFGQDTE HIFDKAEENI PMQSENGSYV QNTGRLSTTM PQPLGMVDLT
DQHGRVTMPS MYPQNLPPIS SSKKTKKVFR GARIFSELFR KGKKRSSRSK MTKGSFRVNP
MEIVNSLSTE ITTPSISRSG DEYDMLGNTA MGYNVMHAV