Gene PHATRDRAFT_45404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45404 
Symbol 
ID7200529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp68431 
End bp70338 
Gene Length1908 bp 
Protein Length635 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179784 
Protein GI219118000 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCT TCGCAGCAGT ATCATGCACC AGTACAGTCG TGTCGCGGCA CGTACTGCGA 
CCCTTCCTCA AGCGGCGGAG GCAACTCGGC GACCCCCCTT CGACAATTCT CGACAGCGTT
CGATCGACTC GCAAAGGGAC GACAACATGG TTGCATCAAC AATCGCGCGC GTCCACTACG
GCAAGATCCG CAGAAGATCA GCCGCTGGTG TCTCTCCGCA ACGCCAGACT TTCGTACCGT
CCCGAAGATA CCTCGGAGGC TCACGTTTCG CAACCCATAT CGCTCGACAT TTGGCATCCC
TCCCGAGGCG GTCACCTACT CCTCGGTCGC AACGGTACGG GCAAGTCACT CATTACGCAG
ACACTGGCCA CGAACGGTAC AGGGACGTTG GTGGACGGGG AATACGTCGT GACCGCTCCG
CAATGGCACA GTCGTACCGT CACCCACGTC ACCTTTCGCT CCCATCAAGA CGTCTTGCAG
ACCTCAGCTC ACCTTACCAG TTACAAGGTT ATTGCCGAAG GAGGACAAGT AAGCAAGGCC
GCGCAATTCC TCATTGTACG ATTCGGTCTT TATCCGCTTC TGCATAGGGA AATTTCGACG
CTTTCCACGG GAGAAATCCG CAAAGTTCTG CTCGTCCGAG CCTTGGCCAC GCGGCCACGG
TTATTGATTC TGGACAACGC CTTTGACGGA TTGGACGTGG CCAGTCGCGA AAACTTGCTC
GACTTGGTCC GTCAAACACT CCGAGGATTC AAGCAAGACA TTCTCGTCCA AGGCATCGAC
GCCAAGAATG CCGCTCGTAC GCAGATCTGC CTCGTCACGC AGCGTCCCGA AGAAGTGGCG
GACGAATTCA CCAACGTGGC GTTTCTCGAT CCTCCCCATA CGGATCGGGT GTCGCCGGAA
ACTTCGGCAC AGGGTGGCGA TTTGCGTACT ATGGTACGCA ACGGCCAAAG AGCGACACAA
ATCTTTGCGC AAAGCTTGGG AACGACTTCG CCACTAGAGG AAGGTGACCT CGACGACTCC
CCTTGGGACA GTCGAAAAGA CGAATACTGG AATGCACCGG GGTTACCGAC TTTGACAGAA
ATGTCCATAT GGTGGAACCA TGGACGTAAA GATGACGATG ACGGCACTTC AAGTACAAAC
ACAACACTCC CACTGGTGGA CGCCCAGGGT CTACGAATAC AGAAGGGATC CACCGTCGTG
CTACAAGAGC TAGATTGGAA AGTCTGGCCA TCGCAACACT GGTTGGTGGC CGGCGGCAAC
GGAGCCGGCA AGTCAACCCT CAGTCGGCTG TTGGCTTATT GTGAAACCGA TAGTGATACG
GAGGGATATT TGCGCGTACT CCATGGAAAA AGGAATCTAC CACAGATCGA TATTGATGAT
GGCCAGCAAA CAGTAGTGGG ATCGCAGTTT GTACACCGAA GGCCTGGGGT AGGGTGGGTC
TCGACTGAAT CACATTTGCA GCGTGTTCAT GATCAACGTA CGGCACGAGA GATTCTGCTG
GAAGAAGCTT CTTCTGATTC ATACATTGTC CAGACGGTTA CGGAGTGGTT TAACTTGACG
CACGATCCTA AGCTACTCGA ACAACACTTT GCTGACTTAT CACAGGGGCA ACAGAAGCTT
GTTTTATTGG CCGCAGCAAT CTCGTCACGT CCACGTATTC TTGTGTTGGA TGAGCCCTGC
CAAGGTCTCG ACATCGTCCA CCGACGACTT CTGTTGGGAT TGGTGGAGCG ACTGTGCCAA
GCGACCGACA CGAATGACAC CGACACCAGT AGTCGAAGCA TTACCTTGAT TTACATTACA
CACCATATGG AGGAAGTTCT GCCGTCAATC AATCAAGTCG TGCATCTGAA AGACGGACAA
GCAGTCTATC AAGGGTCAAG GAAGCTTTAC AACCCGGACT TGCTTTAA
 
Protein sequence
MSTFAAVSCT STVVSRHVLR PFLKRRRQLG DPPSTILDSV RSTRKGTTTW LHQQSRASTT 
ARSAEDQPLV SLRNARLSYR PEDTSEAHVS QPISLDIWHP SRGGHLLLGR NGTGKSLITQ
TLATNGTGTL VDGEYVVTAP QWHSRTVTHV TFRSHQDVLQ TSAHLTSYKV IAEGGQVSKA
AQFLIVRFGL YPLLHREIST LSTGEIRKVL LVRALATRPR LLILDNAFDG LDVASRENLL
DLVRQTLRGF KQDILVQGID AKNAARTQIC LVTQRPEEVA DEFTNVAFLD PPHTDRVSPE
TSAQGGDLRT MVRNGQRATQ IFAQSLGTTS PLEEGDLDDS PWDSRKDEYW NAPGLPTLTE
MSIWWNHGRK DDDDGTSSTN TTLPLVDAQG LRIQKGSTVV LQELDWKVWP SQHWLVAGGN
GAGKSTLSRL LAYCETDSDT EGYLRVLHGK RNLPQIDIDD GQQTVVGSQF VHRRPGVGWV
STESHLQRVH DQRTAREILL EEASSDSYIV QTVTEWFNLT HDPKLLEQHF ADLSQGQQKL
VLLAAAISSR PRILVLDEPC QGLDIVHRRL LLGLVERLCQ ATDTNDTDTS SRSITLIYIT
HHMEEVLPSI NQVVHLKDGQ AVYQGSRKLY NPDLL