Gene PHATRDRAFT_37283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37283 
Symbol 
ID7201938 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp613568 
End bp617034 
Gene Length3467 bp 
Protein Length1033 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181236 
Protein GI219121777 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCGT TAGAATTGTT GGCCCAGGAG CGTCTTCCAA GACGTGACTT CAACCGGAAA 
CGGGTCCCGT ACCGGGACGC GTACGTGTTG GTAGGTTGGT CGGTCACGGA TGCGTACGGG
CACTGAGGCA AAACGGCCAA ACAGGGTTGG ACACTGCGAA TGTCGGCGAT TTCCGCCGGA
TCTTTTGGTC ATTTCGAACC GCCTCTCGCG CAATTGCACA CTTTTGTATT TCCGGCACAC
GACCAACGAA ACAGACCCCC AAAAAAAAAG AACCGAACGG GTATCTCTAC CGGATTTGTG
CTACCCAACC GTACCTCGTG GTGATTGTCT CCACTCCAGC ACCACCACTA CCACCCATTT
CCTCGTAAGA CCAAAGAGAA CACGGATATC GAAGGCACCT TTATTTCAAC AAGCCGAACG
AGAATAGTCG AACGTCGAGT ACAGCCTTCA TCCACAGTCC CATACATTCC CACCTACATC
CTTGGATTAA CCCACCCACA CGTCCACCGA CCAGCATGCC ACCGTCCATT CTATCGGACG
CCTCGGGCGA TGCCTTTAAT TCACCCGAAC AATTCGTCGT GGTGGTCAAT CCCTACTCGA
CGGGCTGTTT GATTGCCAAA GAAATGCACA AGCGCGGATA CGTACTCATT GTCGTCTGGA
CCAAAGGCTT CTCCGCGGAT ATGAAGACGC ACATTCCCAT GAGTGCCGGG CCGATGGAGT
ACTACGCCGA AATTGACGAA CAAGACAATC TCGACGACAC CGCCGCGCTC GTCCGCAAAA
CGGCCAACGA ACAGGGAATC GCGGCATGCC TCGCTGGAGG CGAAGCCGGA GTAGACCTCG
CCGACGCACT TTCCGAACAT CTCAACCTAC TCACCAACGG AACCACTATT CCCAATCGAC
GTGACAAAAA GATTCAACAA GAACTCATTC GACAAGTAGG TCTGCGCTCC GTTCGACAAG
CCGGGGGTGA CCAGTTCTCG GACGTCGAGG ACTTTCTCCG GACGGAACCG TATCCCCTCG
TCCTCAAGCC CATCGAATCC GCCGGATCGG ACGGAGTCAA GCTCTGCGAG AACTTTCCGC
AAGCCAAGGA TCACTTTGAG TTATTGATGA AGTCACAAAT GGTCAACGGA GGGTCCTGTC
CCGCCGTACT CTGCCAAGAA TTTCTCCGCG GCAAAGAATA CGTCGTAGAT CATGTTTCTC
GCAACGGCGT CCACAAAACT GTCATGGTAT GGGTCTACGA CAAGCGTCCC GCCAACGGAT
CCGCCTTTGT TTACTTTGGC TGTGTCCCGG TCGACTCCAA TTCCCCGGAA GCGCTTATCG
TCATTCCCTA TATTCGCGGC GTCCTCGACG CCATGGGGGT ACAGAACGGA CCATCGCACG
GGGAAGTCAT GATGACCGAA GACGGTCCTT GCTTGATCGA AATGAATTGT CGCGCCCACG
GTGGAGATGG TAACTGGAGA CCCCTCTGCC GGGCCCTCAC TGGAGGCTAC TCCCAAGTCG
AAGCAACCGC CGACGCCTAT CTAGACGCTT TCCAATTCAA TCGACTCCCC GATAAACCTC
CCAGTCCCTT CAAAGCGTCC GGACAAGAAA TTATTCTCGT CAGTTATTCC CAAGGGACGG
TCCAGAGTTG TCCAGGATAC GACGTGATCA AAAGTTTACC CTCGTTCGTC TGTTTGGAAA
CGGGCGTTAA ACCGGGATCC GAGATAGACT ACACCGTCGA CCTCTTCACC GGCATCGGGA
GTGTCATTCT CATGCACAAG GACCCGGCTG TGTTGGACCG CGACATTGAC TTTGTCCGGT
ATATGGAGAC CATCAACGGC CTTTTTGTGT ACGAAACCAA GTTGGAAAAT CTCAAACGCC
CCCGTGGCGA GGCCGTTACC GAAAAGGGCC ACCGTCGCGT TTTTTCTGCG GAAGGCCCTG
GTCTCATTCG CCACATGTCC AACGATCGTC CCGAATTGCG CAATCCGCTG GTGAAACGTA
TGACCACGGT CGATGCCTCG CGGGAAGTGG TCGTTATAAT CGACCCTTAC TCGACTGGAT
GCTGCATAGC GGAAGAGATT ATCAAACGGG GCTTCAACGT GATTGCACTC TGGACCGAGG
GATTTTCTGA AGAAATGAAA AAGCATGTAC CCCTCAGTGT GGGAAATGTG ACATACTTCA
AAGAAGTGAA TCAGGCCGAG ACGTTGGAAA AGACGGAAGC GGTAGTTCGC AAAGCGGCGG
AACTGTTCCG TATTGTTGCC TGTATCGCCG GCGGGGAAGC CGGAGTCGAC CTTGCCGACG
CTCTTTCCGA ACAGCTCAAG GTTCGGACCA ATGGTACCGG CATTCCCAAC AAGCGCGACA
AGAAACTCCA ACAAGAGCTT GTCAAAAAGG TGGGACTGCG GTCAGTACGC CAAGCGGGAA
GCGATAAATT TGCCGATGTC GAGCCTTTCT TGCGCCGAGA ACCGTACCCG GTGGTCCTGA
AGCCAGTTGA ATCGGCCGGG TCAGATGGTG TCAAACTGTG CCACAATTAC GACGAAGCAA
AGCAGCATTT CGGGGTACTC ATGAAATCGC AAATGGTCAA TGGGGGGGAT TGTCCAGCGG
TACTCTGTCA AGAATTTTTG CGCGGGAAGG AGTACGTCGT CGACCACGTG TCTCGCGACG
GTAAGCACAA AACCGTCATG GTTTGGGTGT ATGACAAGCG CCCGGCAAAC GGATCAGCCT
TTGTCTACTT TGGTTGCGTC CCAGTAGATT CCGATTCACC CGAAGCTCGT CTATTGATTC
CGTACGTACG CCGAGTACTA GACGCCTTGC AAATCAAGAA CGGTCCATCG CACGGCGAAG
TCATGATGAC GAACAACGGT CCCTGTTTGG TAGAAATGAA TTGTCGCGCA CACGGTGGTG
ACGGGAACTG GCGTCCCCTC TGCCGCGCAC TCAATGGTGG CTACTCCCAG GTCGAATCCA
CGGTCGATTC GTACTTGGAC AGTCGTCAAT TCATGATTAC CCCCGAAAAA CCACCTAGTC
CCTTTAAAGC TCACGGCCAG GAAGCAATTT TGGTTTCATT TTCACGCGGT GTAGTCAAGG
CCACTCCCGG TACGTTGGAT TGCTTATCTG TGTTGAGTTG ATTCCGGCGT TTTCTATCTT
AAAATGTTTT ACAGTCAGAG ACTCACTCAA TCCCTTCTTT ACTATGGGCA TCCCCAGGTT
TTGAGGAAAT TCAAAAGCTC GAGTCATTCG TCTATTTGGA GACGGGTGTT CGCGTCGGCA
CCTTTGTTGA CTACACGGTC GACCTCTTTA CCGGAATAGG TAGTGTCATC GTCATGCACC
AGGACGAAGA CGTATTGGAA CGCGACGTTC GTCGCATTCG GCAGTTGGAA TCGGAAAATT
TGCTGTTTGA ATACGAAACT GGCAAGGTAG TCTTTTCCTC GCCGAGCAAC ATTCACGACA
CTGGAAGCGT GACGGTGGCC TCCGCGAACC GTCCCGACTT GTATTAG
 
Protein sequence
MEPLELLAQE RLPRRDFNRK RVPYRDAYVL TPKKKEPNGY LYRICATQPY LVVIVSTPAP 
PLPPISSPIH SHLHPWINPP TRPPTSMPPS ILSDASGDAF NSPEQFVVVV NPYSTGCLIA
KEMHKRGYVL IVVWTKGFSA DMKTHIPMSA GPMEYYAEID EQDNLDDTAA LVRKTANEQG
IAACLAGGEA GVDLADALSE HLNLLTNGTT IPNRRDKKIQ QELIRQVGLR SVRQAGGDQF
SDVEDFLRTE PYPLVLKPIE SAGSDGVKLC ENFPQAKDHF ELLMKSQMVN GGSCPAVLCQ
EFLRGKEYVV DHVSRNGVHK TVMVWVYDKR PANGSAFVYF GCVPVDSNSP EALIVIPYIR
GVLDAMGVQN GPSHGEVMMT EDGPCLIEMN CRAHGGDGNW RPLCRALTGG YSQVEATADA
YLDAFQFNRL PDKPPSPFKA SGQEIILVSY SQGTVQSCPG YDVIKSLPSF VCLETGVKPG
SEIDYTVDLF TGIGSVILMH KDPAVLDRDI DFVRYMETIN GLFVYETKLE NLKRPRGEAV
TEKGHRRVFS AEGPGLIRHM SNDRPELRNP LVKRMTTVDA SREVVVIIDP YSTGCCIAEE
IIKRGFNVIA LWTEGFSEEM KKHVPLSVGN VTYFKEVNQA ETLEKTEAVV RKAAELFRIV
ACIAGGEAGV DLADALSEQL KVRTNGTGIP NKRDKKLQQE LVKKVGLRSV RQAGSDKFAD
VEPFLRREPY PVVLKPVESA GSDGVKLCHN YDEAKQHFGV LMKSQMVNGG DCPAVLCQEF
LRGKEYVVDH VSRDGKHKTV MVWVYDKRPA NGSAFVYFGC VPVDSDSPEA RLLIPYVRRV
LDALQIKNGP SHGEVMMTNN GPCLVEMNCR AHGGDGNWRP LCRALNGGYS QVESTVDSYL
DSRQFMITPE KPPSPFKAHG QEAILVSFSR GVVKATPGFE EIQKLESFVY LETGVRVGTF
VDYTVDLFTG IGSVIVMHQD EDVLERDVRR IRQLESENLL FEYETGKVVF SSPSNIHDTG
SVTVASANRP DLY