Gene PHATRDRAFT_43587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43587 
Symbol 
ID7197316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp890909 
End bp893860 
Gene Length2952 bp 
Protein Length977 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177710 
Protein GI219111917 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTTAA TGGTACAACA ACAGGGTCAA AATCGGAGAC GGATCATCTC GTCCACACAT 
CATCTAACTA TAGTATCGAA CACTTCAGAT ACGCCTCCCA CTCTCCTTCG ACAGACGGCT
TGCTTCATGA TTCCAACAAT TTCGGCAGGG TTACGATGGT CGTCGTGTGC CGTTGTCGTC
GTCGGAGTTG TCGCCCTTCT GAGCTTTTCC CCCTCGACGC TAGCGCTGGT AGTTCCACAG
GCTCCGGGAA GACCACGGTC TGGGACGACA AGTACAGGGT TCGGCAAAAG TAGTAGCCAA
TCGATTGTGC CGCGACCAAG CCACTGCCGT ACTCGCCGTA CCGCGACGTG GATGGTTTTG
ACTCCTCCAT CCCAACGGCA ACCAACGCCA TCCGTATCCC AATCATCACG ACCGTCGCGA
TCCACCACGT TATCGACGAC GTCCCCGCCG CAAGGTACAC CCGTTCAATC CGCATCGGAG
AATACTGATT CCACCAGTGG GTCTACGGGG ACTGGACAAG CTCGCCCAAA GAGCGCCACA
CGCAGTAGCT GCAGCAGTAG CAGTAGTACG CGATTACACG CCCGTAACAA TAATCTGCTC
AAGCGTAAAA ATTACCAGCG CAATTTCATT CCGGAATCAG AACGTGAAGT GAAAACACTG
CGTCGATCGC GCCAGGCCAA GTATGAGCAG CTGAAGTCGG GCTCTGATGT GGCACCCAAT
ATCTGGAGTT TCGAAAGTCT TTTTCCGGAT CCGGTCTGGG ACGAGACCTC CGTGCAACGA
GACTTGTACC AAATTAAAGA GAGGGACGAG AAACAAAAGC AACGGCTCGC CTTTCAAAAA
AATGCCACCG ATGGCATCAA CAGCAACAAC ATCCGACAGA AAAAGGATCC TGTGGTAGAT
TCTACAAAGC TGCGCGAGAA GCGCGATAGC CTCGTCAATC CAAAAATGCG ATCACCCTCA
TACGGTGGAA GCTCCGTCAT GCGATTGTGG CGCGAGCCCA AGCTCAGCTC TCTCGAATCA
CCTTCAATGG AAAGCAACCG TGCTCTTCCT ACCCGCAAGA ACACGATTCC GAACAGCATA
AGCAAAACTA GTTCGGAGAG TATTGCAAAC GTTCGTCCGA ACGAGAAACT GCACAACATG
TCCAACGTCC TCACCATGAC AACAAGCTCG AACAGCAACA ACAAAACCAA GCCAGCGAAA
GTCGACGCCG ATTTAACGCG TCTAGTTCGG GATCGTATTT TCGGCTATCG CCGAACCAAA
ACCGGTCAAC TCCAGTACGA CACATCGCTT ATGGGAGACG GAGCCGTACA GTTTCGCGAC
GGGGTCCGCC TCAGCAATCC TTTGCGAGTC AACGCGGATC GACTCAATTA CCTCGCCAAA
AAGGAATTAC AACACGGTCG GGTGGAAGAA GCCCAAGAAC TGTATACGAT TGCCCTGCAA
ATCGATCCCC GCGATGGGCG CGCTTACCTC GGGATGAGTC GCTGTGCAAG CCGGCGGCGG
GACTTTAAAC TCGCCAAAGT CTGGTTGCAA ACGGGCATTT CCAATGCCGT GTCCGTTAAC
GAAAACACCA TGCAAGCTGA TCGTGGCGCC AACCCGTTCT TACTGCAGGC GCTCGGCTGC
TTGGAAGAAA ATTCGGGACG ACTTTCCGAG GCGGAGGCCT TGTATATTGC GGCGGCCAAA
TCAAGACCTA CTCATGCAGC TGCATGGGTC AGTCTGGGGC AGTTAAGAAT CCGCAAATTG
GGACAATCCG CTAACGCTGG GCGAGTTTGT TTTCAATCCG CGGAACGAGA ATGGCAACGA
GCATCGCTAC CCCCGTCAGC ACACGTTTAC ACGGCCTGGG CGGCCTTGGA ATGCGAAGCA
AACGACATAC GGCGGGCCCG CCAACTATAC AAGGCTGCCT TAGATGTTGA CCCAAGAAGT
TCCGTGGCCT GGTTGCAGCT CGGTGTCATG GAAGCAGATG AGGAGAACTG GAACGAAGCT
GAAACTTGCT TTGAAACAGC GTTAAAATTT GATCGTCGGA ATTCGCGACT GCTGCAAGCA
TACGCACTCA TGGAAACGAA ACGGCCTAAC GGAAACAGTC GGAAGGCGAT TGGATTGCTA
GAGCGTGCCC TCAAGGCGAA TCCCAGAGAC GCCGGTGTAC TGCAAGCTTA CGCTTTGTAC
GTTGCCGAAC TGGGCGACGT GGACGCCGCT CGCGATTTGC TACGACGAGG GGCCGAAGCC
AACAAGCGCC ACGCCCCGGT CTGGCAGGCC TGGGCGGTAC TAGAAACGCG CCATGGAAAC
GTTCAGGAAG CCCGCTCAAT TTTTCAAGAG GGCATTTGGG CTTGCGCGCA ATTGACGGGT
GGCCAGTCGG GTGGCTACCG GTGCGCCCGA CTGTGGCAGG CCTGGGGCGT GTTAGAGGCC
AGAGAAGGCG ACGCTGCCGC GGCTAGAAGA TGTTTTTCGC GGGCCCTGGA TGCCGATAGT
CGTAACGTAG CGGCAGTCAC AGCCTGGGCC TTGATGGAGG AAGAGTTTGG CAACGTTCGG
GACGCCCGAG CTATTTATGA ACGATCGCTG CGGCTGTTCG CTGCTGGCAG TGGTGAGAAA
ACATCAATAT GGAGAAACTA CGAACTCATG GAACAGCGGC TTGGTCACGT GGCGGCCGCC
CAAAACGTCT ATCAGCGGTC CATGCGGGAA GCAATTACCG TCTCGGATGA AATCGCCGAC
AATATTGTGG GCCTGTCGGC TAAGAGTACA ACTCCCCTCC CGGACTTGAC AAACGTACTG
AGTAGATCGT CGGACGAAGT GGAAGTTTTA CGATGGGAAG GCCAATCAAA ATCGAGCTTG
GGTGGCGAAG TTTGGCTCAA CGACCGGGCT ATTGAAGGCA AGGTACCATT TGACATGAAG
ACGAACCAAC GACGGAACAA GAAAACCGAT AAAAAATACA ATCAAACCCC GTAGAAAAGG
TTGATAGATG TG
 
Protein sequence
MALMVQQQGQ NRRRIISSTH HLTIVSNTSD TPPTLLRQTA CFMIPTISAG LRWSSCAVVV 
VGVVALLSFS PSTLALVVPQ APGRPRSGTT STGFGKSSSQ SIVPRPSHCR TRRTATWMVL
TPPSQRQPTP SVSQSSRPSR STTLSTTSPP QGTPVQSASE NTDSTSGSTG TGQARPKSAT
RSSCSSSSST RLHARNNNLL KRKNYQRNFI PESEREVKTL RRSRQAKYEQ LKSGSDVAPN
IWSFESLFPD PVWDETSVQR DLYQIKERDE KQKQRLAFQK NATDGINSNN IRQKKDPVVD
STKLREKRDS LVNPKMRSPS YGGSSVMRLW REPKLSSLES PSMESNRALP TRKNTIPNSI
SKTSSESIAN VRPNEKLHNM SNVLTMTTSS NSNNKTKPAK VDADLTRLVR DRIFGYRRTK
TGQLQYDTSL MGDGAVQFRD GVRLSNPLRV NADRLNYLAK KELQHGRVEE AQELYTIALQ
IDPRDGRAYL GMSRCASRRR DFKLAKVWLQ TGISNAVSVN ENTMQADRGA NPFLLQALGC
LEENSGRLSE AEALYIAAAK SRPTHAAAWV SLGQLRIRKL GQSANAGRVC FQSAEREWQR
ASLPPSAHVY TAWAALECEA NDIRRARQLY KAALDVDPRS SVAWLQLGVM EADEENWNEA
ETCFETALKF DRRNSRLLQA YALMETKRPN GNSRKAIGLL ERALKANPRD AGVLQAYALY
VAELGDVDAA RDLLRRGAEA NKRHAPVWQA WAVLETRHGN VQEARSIFQE GIWACAQLTG
GQSGGYRCAR LWQAWGVLEA REGDAAAARR CFSRALDADS RNVAAVTAWA LMEEEFGNVR
DARAIYERSL RLFAAGSGEK TSIWRNYELM EQRLGHVAAA QNVYQRSMRE AITVSDEIAD
NIVGLSAKST TPLPDLTNVL SRSSDEVEVL RWEGQSKSSL GGEVWLNDRA IEGKVPFDMK
TNQRRNKKTD KKYNQTP