Gene PHATRDRAFT_44438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44438 
Symbol 
ID7197678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp547950 
End bp549331 
Gene Length1382 bp 
Protein Length458 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178250 
Protein GI219114909 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.426284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTAACATGTC CGTCGTGCAG CCTACAAAAC GAGCCAAAGC TGCCGTTGGC CTCATCCTTG 
TGCTTGTTAT CGGAAGCGAT GGCTTAGATC TGTCTTTAAG ACCGCACTCG AAGCCTTCCA
GCGTATCCTA CCCTTGCCGC TTTGTCGTCG GTCCGACTCG CCTCGCGAAG CCCCATCGGC
ATCCGCGATT ACCGATGAGA GGATATCGAT ATCCGACAGT GCGGGGGGCA GGCGACTCTC
TTAGTAGTGA GCCTTCGCAG AATCGCCCTT CTTGGGCTTG GGTTTGGATG CCGACGTGGT
TGTTTACCAT GAATCCGCTA GCTCAGTTTT TGACTACCAT GGGATTTTAT TTACTGCATA
TTCTCGTTCT GTCGCAGCGG CAGCTGGTCT TTCCGATCCA GTTGATACCC AACGAGAAAG
GTCAATTTGC TTCGATTGGT TATGACTCTA TTGCTGGGAT TCTTGTCGCC GTGTGTTACA
CAATATTGCG GAAAGCATCT ACGCAATCCT CATTGCAATC ATCGTCGGAG GCCCCTTTTC
CCGCACTCTT CAAGAGTCCG ACGTCTGATG CTCCCTGGAA GCTGCCAAGC AACCACATAA
GACACCGAGT CTCAAGCTTC CTGACAATCA TTCTGCTAGT ACAGGCCTAC TTCTTTACGG
GTCGCTTTAG TTTGTTTTGG GAAGATACAC TGTACACAAT GTCGGGACTC GGATGGCCCT
TGACGGCTCC CATGCACCGC AGTCTTTGCG TATTGTTCGG GCACTTGAGT TGGCTAATAA
CGGGAACCTT GCTTTTGCGA TTTGTACCTC GACCACCGCG ATTTTTTGGG CCCAAAGCCG
TCTACAACAC CGACGATGAT GATGCGGTTT CAAGCAAGAG TCCAGCAAAA CCGGCCTACC
GGTGGTTTCG GTCCAGTATT CGCCGCAACT GGGTATGGTG GGTCGTAGGT GGCTACTTTG
TCAGCAGTTG GCTTTTTAAC ATTACTGACG TCATTAATCA GTTTGTCTTG CCGACGGCTG
TCTTGGAAGA TGCGCAAGAA TCGGTAGTAT CCCAGTTGGT CAATCCAGAA CACAACGATA
TCGCGGCTAG TGTCGCTGGG TACATAGCGC CATGCCTGAC TGCCCCTTGG TGGGAAGAAG
TTTTGTATCG TGGGTTTCTC CTTGCGGGAC TGTCACAGCT ACTGGGATAT CCCTGGGCAG
TCTTTGTACA GGGTCTTATC TTTTCGGCCC ACCACATGTC TTTGACAGCC GCTCTGCCGC
TAGCTGTCCT AGGATGGACC TGGGCGATTC TATACACCAA ATGCCGCAAT CTTTTTACAG
TAATTTTTGT ACACGCCCTC TGGAACTCAC GGGTATTCCT GGGATCATGG CTCGGATTAT
AG
 
Protein sequence
MSVVQPTKRA KAAVGLILVL VIGSDGLDLS LRPHSKPSSV SYPCRFVVGP TRLAKPHRHP 
RLPMRGYRYP TVRGAGDSLS SEPSQNRPSW AWVWMPTWLF TMNPLAQFLT TMGFYLLHIL
VLSQRQLVFP IQLIPNEKGQ FASIGYDSIA GILVAVCYTI LRKASTQSSL QSSSEAPFPA
LFKSPTSDAP WKLPSNHIRH RVSSFLTIIL LVQAYFFTGR FSLFWEDTLY TMSGLGWPLT
APMHRSLCVL FGHLSWLITG TLLLRFVPRP PRFFGPKAVY NTDDDDAVSS KSPAKPAYRW
FRSSIRRNWV WWVVGGYFVS SWLFNITDVI NQFVLPTAVL EDAQESVVSQ LVNPEHNDIA
ASVAGYIAPC LTAPWWEEVL YRGFLLAGLS QLLGYPWAVF VQGLIFSAHH MSLTAALPLA
VLGWTWAILY TKCRNLFTVI FVHALWNSRV FLGSWLGL