Gene PHATRDRAFT_41318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41318 
Symbol 
ID7199193 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011697 
Strand
Start bp106734 
End bp109722 
Gene Length2989 bp 
Protein Length916 aa 
Translation table 
GC content62% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185282 
Protein GI219130250 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.305609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCCGA CCGCCGACTT CACCCTTTCC GACTTTCCTC ACAAAGTCCT CGATCCCATC 
GCCACCGACA CCACTGCTCC CTCGTACGCG TCGCTTCTCC TGGCCCAACG CCAACTCAGT
GCCAACGCGT CCGCCATTCC CAGCCTTAAT GGCGGCGGGG CCCATGGTCA CATGGCCCTC
ACGCTCACTG ACGAAGCTTA CGCGGAACTT TCCGACATCC CGTTCGTCAT CCCCGTTGCT
CCCCCTGCCG ACCCTGAACC CGGCACCACG CAACCTCAAA TCACCGAAAA CAACCGCCTC
CACAAACGCG CTGTAGCCAC CCACAGCCTC TACGTGGCGG TCAACAACGC TCTCCGTCGC
CAGATCCTCG ACGCTGTTCC TCGCGTGTAC GTTCGCGACC TGGAGCACCC CCAGTTTGCC
TACAGCCATG TTTCCTGTCG CGACCTTCTC GACCATCTCT GGCGCAACTT CGGTACCATC
ACCGCTTCCG ACTTAAAAAG CAACATCCAA TCTATGTACA CCCCTTGGAA CCCGGTTGAC
CCCATCGAAA CCATTTTTCA TCGCTTAAAT GATGCCATCG CGTACTCGAT AGCCGGCCGT
GACCCCATCA CCGAGGCCGC CGCCGTTCGC GCCGGCTACG ACGTGCTCGA GCACTCGGGC
CTGTTTCCAC GTGCCTGTGA AACCTGGCGC ACCGCCTCGC CCGATACCCA CACGCTTGCC
AATCTGCGCG CCCTTTTCAA AGTCGCCGAT ACCGACCGAA AGCGCACCGT CACCACCGGC
ACCCTCGGTT ACGCCAACGT CCTTACCACC GCGCCATCGG TTCTCCCTTC GCCCTTGCCC
GACGCGCTCA GCCTTCCTTT CTCAGCCCTC TCGGTGTCCA ATTCCTCTGC CACCCTCTCT
GAGAAAACTT ATTGCTGGAC CCATGGGTCC AGCAACAACC GTCGGCACAC TAGTGCCACG
TGCAAAAACA AGGCCCCCGG ACACCGCGAC GACGCCACGG CCACCAACAC CCTTGGCGGA
TCCACCAAGG TTTGGACTGC CCCCAAACCT CCCGAATAGG AAAGAGGGAC GGCTACGCCG
ACGATTAACA CTAGTAATAC CGATTATTTA AATCATATTA CTAGTCTTAA CTCGTCTGTA
GTCCCCTCCC CGCCTAGCAT ACACACCTCA GCCATCGCCG ACACCGGCTG CACAGGACAC
TACATCACCG TGTCCTGTCC CCACTCCCAC CCACAACCTG CCTCTCATCC CCTTGCCGTC
CGCGTTCCCA ACGGCGCTAT TCTCCGCTCG AGCCACACAG CCACTCTCGC TCTCCCTGGA
TTTTCCCCCA CCGCTTGCCA GGCGCACATC TTTCCCGACC TAGCTTCCCA TCCCCTCCTC
TCCATCGGCC AACTCTGCGA CGACGGCTGT ACGGCCACTT TCTCGGCCAC TCGCCTTGAC
ATTCATCGCG ACGCTACCCT GCTGCTCTCT GGCGCCCGCT CCCCCCACAC CGGCCTCTGG
CACCTCGATC TTACCCCTCC CCAGCCCCCT GCCACAGCCC ACGCTCTGGT TCCCAACACC
CCACTTGCCG ACCGCATCGC TTTTGTTCAC GCCTCGCTCT TCTCCCCAGC TCTCTCTACC
TGGTGCCAGG CCCTCGACTC CGGCCACCTT GCGACTTTTC CAGACGTTTC CTCCCGCCAA
GTCCGCAAGT ACCCACCTAG CTCCCCCGCG ATGATCAAGG GTCACCTCGA CCAACAACGC
GCGAACCTGC GCTCCACCAA GCTCTCCCCT GTCTGTTTCC CTCTCTCGAC GGAACCCCCT
GCTGCCGCTG CGCCCGACCT CGACCCTCCT GACGCCCACC CCGTCGCCCG CACACACCAT
GTCTTTGTTG CCCACCAAAG GGTTACCGGT CAAATCTACA CGGACCAGCC GGGTCGTTTC
CTCACTCCTT CCAGTGCAGG CCACAACGAC ATGCTTGTTC TTTATGATTA CGACAGCAAT
GCTATCCACG TCGAACTCAT GAAGAACAAG TCCGGCCCCG AGATTCTGGC TGCCTACCAA
CGTGCTCACG CTCTTTTCAC CCAGCGCGGC CTACGTCCCC AACTTCAGCG CCTCGATAAC
GAAGCCTCTA CCGCCCTCCA AGCCTTCATG ACCTTAGAGC ATGTCGACTT TCAGCTAGCA
CCCCCCCATC TGCACCGTCG TAATGCCGCC GAACGGGCCA TACGCACCTT CAAGAATCAC
TTTATTGCTG GCCTCTGCAC CACGAACCCG GATTTTCCCC TTCATCTTTG GGACCGCCTC
CTCCCACAAG CCCTTATCAC CCTAAACCTT CTTCGGCGCT CCCGCATCAA TCCCAAGTTG
TCCGCCCACG CACAACTTCA CGGGGCCTTC GACTACAACC GCACCCCGCT TGCTCCTCCT
GGCACGCGCG TCTTAGTTCA TGTCAAGCCC GCTGTTCGCG AAACCTGGGC CCCCCATGCT
GTTGAAGGTT GGTATCTCGG CCCCGCTCTC AACCATTATC GCTGCCATCG CGTCTGGATC
ACGGAAACAC GTGCCGAACG TGTTGCGGAC ACCCTTTCCT GGTTCCCGAC CCGCATTCCC
ATGCCCGCCG CTTCGTCCAC CGACCGCGCC CTGGCCGCCG CCCGTGACCT AGTACATGCC
CTCCAGAATC CTTCCCCTGC GTCTCCGTTC GCCCCCCTCG ATGCCAACCA GCACCAGGCC
CTTACCGACC TCGCCAATCT CTTTGCCACC GTGGCCGCCC CAGTCGACGA CGTCCCCGCA
CCCGCTCCAG TGCCTCCGGT CCGTCCCCCT GCCCCAGCAA CTCCCCTTGC TCAGGTCCGT
TTTGCCGTTC CTCTTGTCAC GGCCGAACAT GCCCCGGCAC TTCCGAGGGT GCCCATTCCG
GCCCCAGCAC TTCCGAGGGT GCCCACCCCG GCCACCTATC ACTCTCGCAC CGGCAACCCC
GGCCGTCGCC GCCGCACAGC ACGCAAACAA CCGGCAACCC CAACCCTAG
 
Protein sequence
MSPTADFTLS DFPHKVLDPI ATDTTAPSYA SLLLAQRQLS ANASAIPSLN GGGAHGHMAL 
TLTDEAYAEL SDIPFVIPVA PPADPEPGTT QPQITENNRL HKRAVATHSL YVAVNNALRR
QILDAVPRVY VRDLEHPQFA YSHVSCRDLL DHLWRNFGTI TASDLKSNIQ SMYTPWNPVD
PIETIFHRLN DAIAYSIAGR DPITEAAAVR AGYDVLEHSG LFPRACETWR TASPDTHTLA
NLRALFKVAD TDRKRTVTTG TLGYANVLTT APSVLPSPLP DALSLPFSAL SVSNSSATLS
EKTYCWTHGS SNNRRHTSAT LNSSVVPSPP SIHTSAIADT GCTGHYITVS CPHSHPQPAS
HPLAVRVPNG AILRSSHTAT LALPGFSPTA CQAHIFPDLA SHPLLSIGQL CDDGCTATFS
ATRLDIHRDA TLLLSGARSP HTGLWHLDLT PPQPPATAHA LVPNTPLADR IAFVHASLFS
PALSTWCQAL DSGHLATFPD VSSRQVRKYP PSSPAMIKGH LDQQRANLRS TKLSPVCFPL
STEPPAAAAP DLDPPDAHPV ARTHHVFVAH QRVTGQIYTD QPGRFLTPSS AGHNDMLVLY
DYDSNAIHVE LMKNKSGPEI LAAYQRAHAL FTQRGLRPQL QRLDNEASTA LQAFMTLEHV
DFQLAPPHLH RRNAAERAIR TFKNHFIAGL CTTNPDFPLH LWDRLLPQAL ITLNLLRRSR
INPKLSAHAQ LHGAFDYNRT PLAPPGTRVL VHVKPAVRET WAPHAVEGWY LGPALNHYRC
HRVWITETRA ERVADTLSWF PTRIPMPAAS STDRALAAAR DLVHALQNPS PASPFAPLDA
NQHQALTDLA NLFATVAAPV DDVPAPAPVP PVRPPAPATP LAQHFRGCPP RPPITLAPAT
PAVAAAQHAN NRQPQP