Gene PHATRDRAFT_36007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36007 
Symbol 
ID7201348 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp322052 
End bp325416 
Gene Length3365 bp 
Protein Length1106 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180413 
Protein GI219119300 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.287397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCCCG CCACCCGGCA AATGACGAGT GCAGCCGTCT ATGCCCACCT TTTGGACAAC 
GTACTTCTTC TTCCCCAAGG GCATCCTATC CGCCTCAGTT TTGAGCAACA AGGATATGAA
TCGGCTGATG ATCTTCTGTG TATTTTTGAG AATGAACTTG AGTCTCTTGG ATACACTCCT
TCTGTCCTTC CCGACGGCCT GGAAAACCCG CCAACTATAC CCCTTCTCAT GGCGCACCGA
CAGATCATAC GTCATTTCTT GCGCTGGCAG GCATCTTTGG AACGACAAAA GGGGACACCC
TTGAAGAACT CCGAACTTGT TGCACTTAAC AATGAAGATT TTGTCCTTTA CCGTCGCTCA
GCCCTTGGTC AAGTCTCGAC AGCAACTGCA CCGGTTAATG CTTCCCCAAC TGTCCAGAGC
CCCATAGGAA AGACACGTTC GGCTGTCGAG GACTTCAAGC GTGGGATCAA ACGTGACAAA
ACTCACTATC CCGTGCTTAA AGATGATCGG TACTGGGACA ACTTTTATCG GTCGTTTGTT
GTTACTGCCG TAACACATAA CGTTGACAAA GTTCTAGATC CGACGTACAT CCCTACCGAT
CCCTTGGAGA AATCCCTTTT TGAAGAGCAG AACAAGTTCG TATATTCTGC TCTAGAGCAT
ACTCTCCAGA CGGACATGGG CAAGAACATT GTACGCGAGC ATAGTTTCGA CTTCAATGCC
CAGGAAGTTT TCCGTAAGGT TGTGAAACAC TACACAGAGT CCGCTAGCGC GAAGATTAGT
TCGTCTACTA CCCTGGGATA CCTTACAACT GCAAAGTACG GATCGTCATG GACTGGCACA
GCAGAAGGTT TTATTCTTCA CTGGAAAAAT CACTTGCGCA TCTACAATGA CACTGTTCCT
GCTGGTGAAC AGCTTCCTCA GCAACTATGC CTTAGTCTTT TGGAGAATGC TGTTCATGAT
GTACCTGAGC TTCGACAGGT AAAAATCACT GCAACTCTTG ACTTAGCAAA GGGAGGTAAT
CCTATTAGCT ATGATGGTTA TCTCAGTCTA CTACTCGCAT CGGCATCGCT CTACGACAAC
GGCAATAATC TATCTAATTC TCGTAGTGGC AAGAACAAGC GCAACATCTA TGCTAATGAA
CTAGAGTACA ATCCGATGGA TTTTGAGAGT AAACCGGATG TAGACTATGA TATAGATGTG
TCGCCTACCG CAATCTACGA AGCCAATGCT CATGCCCGTA ACAGCAGTTC CCGGAATCGT
AGTCCGGCAG CTAATCGCGA GCGACCTTAC ATCCCTCGTG AAATGTGGAA CCTGCTCTCC
GACGATGCCA AAGCCATCCT CCAAGGCTTA ATAGCCCCCG GGAAGCAGGC CCCGTTGAAT
AATACGCCAC ACCAATCGTT GCAGGCCAAT ACGCACGATA CCATTGGCGC GGAACGAATC
ACAACGGACA CCTTCCATGA TTGCGCACCC GAAACTGAAT TGCTTGCCCA CCTGACTGAG
CGTGTTAGTC ACATGAGCGA CGGCGACATA CGTAAGGTAC TTGCCGCATC TCGTGATGGT
CCCGCCTATG ATGAGCCCAC ACCACTGCAA TCTAACGTAC TTCAATATCA AGTGTCTCGT
CACAACGTCA TTGAAACTAC GGCAGCCCTC GTCGACCGTG GAGCCAATGG AGGTCTTGCC
GGCAGTGATG TCATGGTCTT GCATAAAACA GGTCGTTCTG CAACCATCAC AGGTATCAAT
GATCATACCT TGTCCGATTT GGACATTGTC ACCGCTGCTG GCTACACTGA ATCCCAAAAT
GGCCCCATCA TTCTCATTAT GAACCAATAC GCCCATTTGG GACAGGGTAA AACTATCCAC
TCCAGTGCAC AGCTTGAACA CTATCGCAAC CATGTCGAAG ACCGTTCCCG TACTGTAGGA
GGTAACCAGC GAATTGTAAC ATTGGATGAC TACATCATCC CATTGCACAT TCGACAAGGA
CTCGCGTACA TGGATATGCG GCGTCCTACC GACAAGGAAC TTGCGTCCCT TCCACACGTT
GTCCTAACCT CCGACGTAGA CTGGGATCCC TCCGTACTTG ACCACGAAAT TGATCTCGCG
ACCTCTTGGT ATGATGACAT ATATGATTTG CCTCAATCAC CTTACGTTGA ACCACGTTTT
GACCATACAG GCAAATACCT CCATCGTCAC ATTTCCCTTT GCAACCATCG CGATGACGTT
GTTGACCGCG TATTATATTG CCAACGGCAC CTCGTCACGA AAAATGTGCA AGATTATGAG
GCCCTTCGTC CGTGTTTTGG ATGGGTCTCT GCTGAAACCG TTCGCAAGAC CATCATGGCG
ACCACGCAGC ATGCACGCGA AGTATATAAC GCTCCGTTAC GCAAACATTT TAAGTCTCGC
TTTCCCGCTC TAAATGTACA CCGTCGTAAT GACCCAGTTG CTACCGATAC CATTTGGTCC
GACACCCCTG CTGTCGATAA TGGTGCTAAA TTTGCACAAC TTTTCGTTGG TCGACGCTCC
CTTGTCACCG ACGCTTACCC CATGAAAACT GACAAAGAAT TCGTCAATAC CCTTGAGGAC
CATATCCGTT ACCGGGGTGC CATGGACAAA TTGATTAGCG ATCGTGCCCA GGTTGAAATC
AGCAAAAAGG TCACCGATAT TACACGCGCA TATAATATCG ACCAGTGGCA AAGTGAACCA
AACCATCAAC ACCAAAACTT TGCCGAACGT CGTATTGCCA CTATCGAGGC TAATACCAAC
AACATTCTCA ATCTTTCCGG TGCCCCTGAT TCCGCCTGGT TACTTTGCGT GACATATGTT
TGTTATGTTT TCAACCATTT GGCACATGAA TCCCTAGATA ACCGCACTCC CCTTGAAGTC
CTCACCGGCT CCACGCCTGA TATCAGTGTT CTCCTTCAGT TTCATTTTTG GGAACCGGTC
TATTATAAGC TCGAAAATGC GACATTTCCT TCTGGTGGTA CCGAACAACA AGGACGTTTT
GTTGGCATAG CCGACTCCGT CGGCGACGCT CTCACTTATA AGATACTTAC CCACACCACC
AACCGCATTC TTCATCGCTC TAGTGTCCGT TCTGCGACCA TTCCCGGACA AACCAACCTA
CGCCTTACGC CACAGGATGG GGAGAGTGGT CCTAAACCCA TCAACTTTAT CAAGTCGCGT
AGAACCGAAA ACAAAAATTC CTATGCCATT AAGGAGTTGC CTGGTTTCAC ACCTGATGAC
CTTATAGGTC GTACGTTCCT CACCGACACT CGGGATGATG GGGAGCGTTT GAAGGCACGA
ATCACGCGGA AAATATTGGA CCCAGACAAG CCCTCGGATG TAAAGTTCCT TGTCGAAATC
AATGA
 
Protein sequence
MVPATRQMTS AAVYAHLLDN VLLLPQGHPI RLSFEQQGYE SADDLLCIFE NELESLGYTP 
SVLPDGLENP PTIPLLMAHR QIIRHFLRWQ ASLERQKGTP LKNSELVALN NEDFVLYRRS
ALGQVSTATA PVNASPTVQS PIGKTRSAVE DFKRGIKRDK THYPVLKDDR YWDNFYRSFV
VTAVTHNVDK VLDPTYIPTD PLEKSLFEEQ NKFVYSALEH TLQTDMGKNI VREHSFDFNA
QEVFRKVVKH YTESASAKIS SSTTLGYLTT AKYGSSWTGT AEGFILHWKN HLRIYNDTVP
AGEQLPQQLC LSLLENAVHD VPELRQVKIT ATLDLAKGGN PISYDGYLSL LLASASLYDN
GNNLSNSRSG KNKRNIYANE LEYNPMDFES KPDVDYDIDV SPTAIYEANA HARNSSSRNR
SPAANRERPY IPREMWNLLS DDAKAILQGL IAPGKQAPLN NTPHQSLQAN THDTIGAERI
TTDTFHDCAP ETELLAHLTE RVSHMSDGDI RKVLAASRDG PAYDEPTPLQ SNVLQYQVSR
HNVIETTAAL VDRGANGGLA GSDVMVLHKT GRSATITGIN DHTLSDLDIV TAAGYTESQN
GPIILIMNQY AHLGQGKTIH SSAQLEHYRN HVEDRSRTVG GNQRIVTLDD YIIPLHIRQG
LAYMDMRRPT DKELASLPHV VLTSDVDWDP SVLDHEIDLA TSWYDDIYDL PQSPYVEPRF
DHTGKYLHRH ISLCNHRDDV VDRVLYCQRH LVTKNVQDYE ALRPCFGWVS AETVRKTIMA
TTQHAREVYN APLRKHFKSR FPALNVHRRN DPVATDTIWS DTPAVDNGAK FAQLFVGRRS
LVTDAYPMKT DKEFVNTLED HIRYRGAMDK LISDRAQVEI SKKVTDITRA YNIDQWQSEP
NHQHQNFAER RIATIEANTN NILNLSGAPD SAWLLCVTYV CYVFNHLAHE SLDNRTPLEV
LTGSTPDISV LLQFHFWEPV YYKLENATFP SGGTEQQGRF VGIADSVGDA LTYKILTHTT
NRILHRSSVR SATIPGQTNL RLTPQDGESG PKPINFIKSR RTENKNSYAI KELPGFTPDD
LIGRTNHAEN IGPRQALGCK VPCRNQ