Gene PHATR_43943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43943 
Symbol 
ID7204372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp495554 
End bp497953 
Gene Length2400 bp 
Protein Length799 aa 
Translation table 
GC content44% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186359 
Protein GI219113551 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCCAT GTGAGGTAAA AAAGAGTATT TCCTATATTG ACGGTGGTGA AGTAAAAATA 
GCACTTGATG CAGCAAAAAG TAATGCCCCA CCAAGCGTGA TTGGGCGGCT GCTGAACATA
CGAACTGGAG ATATCTTGTC AGGTTCATCT CTAAAAAATA TACGGATGCA AGCTGAAAAA
GGTGAACGGA ACAAATTTGG TGCAAACGAT AATTTCACGA CGCAAGCGGA TCAACTTCTA
GCTTATCTTG AGAGCACCCC GGATGTCAGC TTCTGTGCCA TATATGATGA ACCAGATTCC
CCTTTATTCA CTGTTTACAA GCAGAGGGCA AAAACTGGAC GTCGGCACCT ACACACCAGT
ACACGGAGTA TCTCTGGAGG AATTGCCCAA CAAGAAGTGC TCAATGAAAA GGTGCTAGAT
GCTATTGATC CAAGGGGAGA GCTTGATGAC TATATAGATA GGACGCGGAG GGCGTTCAAG
TTGAAGGGCA ACGAAAAAAT GCTTTTAGGA GTGGCCTGGA CCAACAACGA GAGTAGAAGA
ATCTTTGCTC GCTATTCCGA GATCATGGTA GCAGATGTGA CAGAAGGTAC CAACAATGCA
AAACGGCCGC TGTTTTTGTT TTCGGGAAAG ACATCAAATC AAAACACGTT TACAGCACTT
TGGGCCTTTC TACCGCAACA ATCTCGTTGG GCCTTCCGAT GGGTGTGGAC AAGATGCATT
CCACAGCTCT TACCTGAACA GGGGCTGAAC AGAATGCGTC TATTGATTAC TGATGGAGAC
CCGCGAGAGT ATGGTACTTT TTTAGATGCA ATACCTACTT GGTATAGCTT GTGTCGGCAC
AAACTATGCC ATTGGCATCT ACTCTATCGC GGCAGTCTTA TGAAAGCACA GACTGGAAAC
TGTGGAACAA AAGCAAAAAT TCTATTCCAT GTGGTCCTGA AGTGGATAGA AAGCTGGATG
ACAAAAATTG AGACGCAAGA GGAGTACAAT TTGTCAACTG GGCTTTTGAT TGACTGGCTG
AAATCTCCGG AGGCACTTGA TACAAATTTG GGCGGAATGG GTTGTGCCCT TGTTTCGCAG
ATAAATGCAT TTTTGACGTC CTCTCTGTTT CCGCACGAGC AACGCTGGGC TCGATACCAT
TTTCTGAACG TGAGAGCATT CAACACGTCC GCAAGTTCCT ACGGAGAAGC AGAGAACAGT
GCTCTAAAAC GACGGGGTGA TGGGGTCAAG CCAAACTTTT CGGTGCCAAA AGCAACACGG
GCAATAAACG AAGGGACTCA ATTGCGAACA GTGAAGAGGC AACAAAAAGC AGTTCATAAC
CTCAATGCTA CAAAGAAGAC AAAGGCAGCA AACTACACCA ATATATCCGA CCTAGTAGAT
TGCATACAGG AAACCATATC CCATGAATTC AATGCAGCCA AAAAATATGA CCTCTTTTGC
CCGGGTCCAA AAGAATTTTG GGTAAAGCGA GCATGGTACC AAATTCCAAG CGAGACCTAC
CAGGATTTCA ACGACAGCAA CTTTTGCCAA TTTATGATTC CACAGTTTGA GCGCACCCGC
ATCGTAAAAA TTACGGAAAT TGAAGGTGAA CTCTATCTGG AATGTAGTTG CGGCAAGTTC
CAACGACAAG CTTCTCCATG TGCTCACATC TACAAAGTAC TTAACCGACC ACCACAATCA
ACAGACGTTT CTGTGAGATG GACAAAAATT TGGGATGTTT ACCTGCATCG ACCTGGATAT
CATGATCTGT CGGACCAGTT AGAGGAATTG TATAAGAAGG AGCGGCCAGG GCCACATTTC
GAAAACACAA ATCAGTGGGA AGTTGGAAAG GGTGAGAGAG AGTACAACTA TTTTAAGAGA
TCACTTCCAA GCGAGCCCAC CATTATCCAG AAGTACAGCA GATGGGCTGA TTCTTTTTCA
CGACAACCTG GATGTTATGT GCATAAAAGC ACTGAACAGG AAACAGTTCC TGCAGCAAGC
GGTATGGTGC AAGAGTTGAC CAGCCTTTCC CAGGGGTATG CTATTGAAAC TCAATTGGAT
AGTGAAATGG ATGTTGGAGA TGTAACTGTC ATGCAGGTTG AAGAAATTGA TTCAAATCTC
TCAAAATCGG GTAAAAGTCC ATACACAAAC AATCTTCATT TTTACGAGGA AATCTCAAAA
CTTGCCAAAT TCAATTCAAA AGCTGCTGAC ATAATGACAA AAGGAATGCA GGAAACTTTG
GAATTGCTAC AGAAACATGT TGCAGAAGGG TCAGGTATGG TAGATTACAG TATTGGCCCA
GCTATTGGAA AAGAACCAGT AGGCCAAAGG CTCAGGCCAA GCTACAGTCC TTCAAAGAGC
AAGAATCTAA GAGACAGACA AAAAGAAACA AAGGCAAAAT TTTGGTGGCT GAACAAGTAA
 
Protein sequence
MDPCEVKKSI SYIDGGEVKI ALDAAKSNAP PSVIGRLLNI RTGDILSGSS LKNIRMQAEK 
GERNKFGAND NFTTQADQLL AYLESTPDVS FCAIYDEPDS PLFTVYKQRA KTGRRHLHTS
TRSISGGIAQ QEVLNEKVLD AIDPRGELDD YIDRTRRAFK LKGNEKMLLG VAWTNNESRR
IFARYSEIMV ADVTEGTNNA KRPLFLFSGK TSNQNTFTAL WAFLPQQSRW AFRWVWTRCI
PQLLPEQGLN RMRLLITDGD PREYGTFLDA IPTWYSLCRH KLCHWHLLYR GSLMKAQTGN
CGTKAKILFH VVLKWIESWM TKIETQEEYN LSTGLLIDWL KSPEALDTNL GGMGCALVSQ
INAFLTSSLF PHEQRWARYH FLNVRAFNTS ASSYGEAENS ALKRRGDGVK PNFSVPKATR
AINEGTQLRT VKRQQKAVHN LNATKKTKAA NYTNISDLVD CIQETISHEF NAAKKYDLFC
PGPKEFWVKR AWYQIPSETY QDFNDSNFCQ FMIPQFERTR IVKITEIEGE LYLECSCGKF
QRQASPCAHI YKVLNRPPQS TDVSVRWTKI WDVYLHRPGY HDLSDQLEEL YKKERPGPHF
ENTNQWEVGK GEREYNYFKR SLPSEPTIIQ KYSRWADSFS RQPGCYVHKS TEQETVPAAS
GMVQELTSLS QGYAIETQLD SEMDVGDVTV MQVEEIDSNL SKSGKSPYTN NLHFYEEISK
LAKFNSKAAD IMTKGMQETL ELLQKHVAEG SGMVDYSIGP AIGKEPVGQR LRPSYSPSKS
KNLRDRQKET KAKFWWLNK