Gene PHATR_44106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44106 
Symbol 
ID7203867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1002025 
End bp1004028 
Gene Length2004 bp 
Protein Length577 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186444 
Protein GI219113721 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.287897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCAGACCCG CTAGTTTGTA CGATGCAAAT CAGCTTCTAG AGAGACGATA CATTGCCGGA 
TTACCAACAC GTACAAAGTG TTAATTGTGT CCATGCCTGA GTGTATCTAG TAGATATAGA
TACTTTTGGT AGGAAGCTGC CTGCTGCGAC CGCAGAAACG CCTCACTGTC AATCGATATA
TCCACAGAAT CGAATGTCTA GTACATCGTT AACCGCTCGG GAAAATTGTG ACACAAACGA
TGTGAACGCT GTAGAGTTTG TAAAAGTGGC CATCGTTGGA GCGGGAGCTT CTGGGCTGCA
GTGTGCTCAC ACATTAATTC GAGACTTTGG CTTCGCCCCG TCCGATATTG TAATTTTGGA
AGCGCGTGAA AGAGTAGGAG GTCGCCTTTA CACCACGATG GAAACAAGAA GAGGCCTGGA
TGGAACTTCG TTGCATTTTG CCATGGATCA CGGTGCGGCA TGGGTACACG GAACTGGCCT
CGATTGGGAA GCTCCACTGA GTAAAGAAGA TCGCTCCTTC CCTATGAGGA ATCCCATGAT
GGCGCTCTTG GAAAAAGCTA CACCTTCAGG CGAGTCCGTA TATGAAAGGC ATTTGAACCC
GATCTTTCTA GGCAATCCTT GGATGCGACC CCAAAGTATA GCGCACGGCG CCAATCAGAT
CGTTTTGTAT GTAAATGGAC AAGAGCTCGC TAAAGATTCA CCATTGATCT CGTTAGCGCT
TAAACGCCAT TACGCTCTTT TGGACCGTGT TTCGGATGTT GGTAACACCA TGTTTGAACA
AGGAGAAGGC ATGGAGACGA CAATTCAAAG CGTGAAAGAA ACAATTTCAA AGATTCAAGA
CGAGCCAAAT TTTCGATCGG AACTAGAACG TTTGTCCGAG GATGACATGG AACAGGTACT
TGCTTTAACC CCTTTTTATC TGCACATGAT CGAGTGCTGG TACGGAAAGG AGACTTCGGA
TTTACAGCTC TGCGAGTTTG TCGATGACAA ACTGAATGAC GATAACGCCG ATGAGACATA
CACTGCGGAG GGCGACTTTT ATGGACCACA CTGTACCTTG AAGAAGGGTA TGAGTTCGAT
TTTGGAACCT TTACTACGAG ATGGCGTGAA CAAGCGGATA CGATTGAAAG AGGAAGTCAT
TAAGATATCC AACGAGACTA ACACCGTCCT TCTAAACACG GTCTTAGGGA CGCAAATCAG
GGCGAATGCG TGCGTACTAA CCCTCCCAGC TGGTTGTTTG AAAGAGACTG AAGGTAGGTA
CAAATTCTTT GAACCTGCAA TGAGCGCGAG CAAGCTTGAA GCAATCAGTC ACATGAGCAT
GGGCAGCTAC AAAAAAGTTT TCTTAACTTT TGATCGTATA TTCTGGCCGA AGGAAGAGGC
GTTTCTGGGG ATGATCCGTA AAAGCTCTTT CCAGACGTCA GATGAGCCGC CTGGTAACTG
CATGCTTTTC GACAATTTAT GGGCGCGAAA TGATATTCCT TGCATTGAAG CTGTCCTGTC
TGGATCTGCC GGAAGCTGGG CCGTCGGAAA AAACGACGAG ATTATTCGAG ACCACGTTCT
TTCATTTATG AAGGATGCCA TGGGTATCGC TGACGAAATT TCGTCATATT GTCAAGACTG
TCAAGTCACC CGCTGGGAAG AAGACCCTTA TAGTCGAGGC GCGTATTCAT CGATGTCACT
TGGAGCGTTG AATCGGCACG TGGAAGAATT GAGAAATCCG GAATGGGAAG GACGCCTCAT
ATTCTCTGGG GAAGCTACAG TCACAGAGTT TGCAGGCAGC GTACATGCGG CGCTCTTTAG
CGGACGCAAT TCTGCCGAGA AAGTCAACGA ATATTGTACA CTCGTAGAAG CGAAATTATG
TTGCTCTCAG CTAGATGATG CGGCTGATAA GATTGGATTC CTAAAGCCGT CGAAACTCAA
TTGGTAGCAA CTCGTTTGAA AAGAAAGGTG CCTTTCCAGT ACTTCCATTA GTGATAAATG
TATACTAGCT GGCAGGTTTA TTTC
 
Protein sequence
MSSTSLTARE NCDTNDVNAV EFVKVAIVGA GASGLQCAHT LIRDFGFAPS DIVILEARER 
VGGRLYTTME TRRGLDGTSL HFAMDHGAAW VHGTGLDWEA PLSKEDRSFP MRNPMMALLE
KATPSGESVY ERHLNPIFLG NPWMRPQSIA HGANQIVLYV NGQELAKDSP LISLALKRHY
ALLDRVSDVG NTMFEQGEGM ETTIQSVKET ISKIQDEPNF RSELERLSED DMEQVLALTP
FYLHMIECWY GKETSDLQLC EFVDDKLNDD NADETYTAEG DFYGPHCTLK KGMSSILEPL
LRDGVNKRIR LKEEVIKISN ETNTVLLNTV LGTQIRANAC VLTLPAGCLK ETEGRYKFFE
PAMSASKLEA ISHMSMGSYK KVFLTFDRIF WPKEEAFLGM IRKSSFQTSD EPPGNCMLFD
NLWARNDIPC IEAVLSGSAG SWAVGKNDEI IRDHVLSFMK DAMGIADEIS SYCQDCQVTR
WEEDPYSRGA YSSMSLGALN RHVEELRNPE WEGRLIFSGE ATVTEFAGSV HAALFSGRNS
AEKVNEYCTL VEAKLCCSQL DDAADKIGFL KPSKLNW