Gene PHATRDRAFT_41562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41562 
Symbol 
ID7199400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp175042 
End bp176607 
Gene Length1566 bp 
Protein Length521 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185495 
Protein GI219130697 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0981287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAAT TTAAAGTTAC AGCTTCCATT CTGCTCAGTG CTTTGCTTTT GCAAGGATCG 
CAGGCTAAGT TTTTCTCGGA GGACTCTGAG GTCCTCACTG TCAATCGTGT ACCAAAGATT
AGCAAAAGTG GCAACAAAAG CCAACCTCTA AGGGAACCGG AATCGGTGCC TTCGTCGTCT
TTTTCGGGCG GTTTTATTAC AGCCGAAAAG GAAGTTTCGA CTGCCTTCCG AGACCGTACC
AGTACCGCTT TTGGCCGAGT CTCCTGGGGA AACAATACTC TCAAACGGAG CAAAAGGACA
AAGAGTGCCA AAGGCTCCGA CCCAGGAAGC TCAGATTCAA CTCCGGGCAG CGAGATTCTA
ACTGTCATTC GTCAACGGCA AAAACGAGGG GGTAAGGGTT CGAGCAAGGA TGACAACGCT
TTACGAACGA TTGAGCCAAA AGAGAGCCCC ACTCCGTCAC CGGTGGACCT TTCCACTTTT
TCTCCCGGTA TGATCGACGA AACGCCCCAA CCCAGCCCCG AAGAAACTTT TGCCCCCACA
TTGACTGGAA CGACGCCTGT ACCAAGCATA ATTCAGACAA ATCCTGGAGT ACTGGCAACA
CCATTTCCTA CTGACGAAGA TGCCTTGCCG ACGCCTTTTC CTACATTCTT TCCAACTGCG
AACGAAAACC CTTTCCCTAC CATTCCACAA AGTACATTCT TGCCCACTCC CACACCTCTA
TGCTTCGAGT CATCGCTGGA ATTGCAATTC GCGGTAGACG AGTATTTGCT AGACAGCAGT
CCCGACACGG AAGTGGCGTT CTTCTACGGG CATCCGATGG AAGAGTGGTG CGTCTCCAGC
ATCGTGGACT TCAGCAACCT TTTCTCTGCC TTTCGGAATT CGGACACCAG TACCTTCAAT
GAGCCTCTGA ATGGTTGGGA TATGTCGAGT GCCGAAACAC TCGAAAACAT GTTCGCGGGG
GCCGAAAGCT TCGACCAACC CCTGTTTGAT TGGGATACGT CCAACGTGTC TACTATGACT
CGAGCGTTTA GTGGAGCAGA AAGTTTCAAC AGCGACATAC GAGCTTGGGA TACCTCAAAC
GTCCTAGATA TGCAAGCAAT GTTTGCGGGA GCTATCAGCT TTAATGGCAA TATTGCTTCT
TGGGACATTC GAAATGTGGA GAATTTGTCT TTCATGTTCG CTGAAGCAAC GAGCTTTGCT
GGGGATTTAT CGCAATGGGA ACCACTCAGT GCCATTTCAA TGGTGCAAAT GTTTCTCGGT
GCTAGCTCAT TCAACAGCGA TATTTCGAGA TGGGACGTAT CGGCAGTCGA ATTATTCTCG
AGTATGTTCA ACGAAGCGAT TTCCTTCAAC CAAGACATAT CTGGATTCGA TTTGTCGAGT
GCGACCAATT TGGACCGTAT GATGTTCATG GCAGAGTCCT TTAGCCAAGA TGTATGCAAC
TGGGGTTCTA CGCTTGACCC ATTTTTGGCT CCGTTCGAAG TTTTTCAAGG CACGGATTGC
CCAAACGTCT CGGATCCCAG CCTTGATAAC ACCCCTCCAG GTCCCCTTTG CTTTCAATGT
TCGTAG
 
Protein sequence
MAQFKVTASI LLSALLLQGS QAKFFSEDSE VLTVNRVPKI SKSGNKSQPL REPESVPSSS 
FSGGFITAEK EVSTAFRDRT STAFGRVSWG NNTLKRSKRT KSAKGSDPGS SDSTPGSEIL
TVIRQRQKRG GKGSSKDDNA LRTIEPKESP TPSPVDLSTF SPGMIDETPQ PSPEETFAPT
LTGTTPVPSI IQTNPGVLAT PFPTDEDALP TPFPTFFPTA NENPFPTIPQ STFLPTPTPL
CFESSLELQF AVDEYLLDSS PDTEVAFFYG HPMEEWCVSS IVDFSNLFSA FRNSDTSTFN
EPLNGWDMSS AETLENMFAG AESFDQPLFD WDTSNVSTMT RAFSGAESFN SDIRAWDTSN
VLDMQAMFAG AISFNGNIAS WDIRNVENLS FMFAEATSFA GDLSQWEPLS AISMVQMFLG
ASSFNSDISR WDVSAVELFS SMFNEAISFN QDISGFDLSS ATNLDRMMFM AESFSQDVCN
WGSTLDPFLA PFEVFQGTDC PNVSDPSLDN TPPGPLCFQC S