Gene PHATRDRAFT_49658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49658 
Symbol 
ID7198302 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp331820 
End bp333209 
Gene Length1390 bp 
Protein Length339 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184346 
Protein GI219128283 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GACGAAACAC GCTAAAGCTT CACGCCGAAT AATTCTTCCA TGGTCAAGAA AATTACGCCG 
TTCTTTGTTT CTCCAGCTAT CCGAATTTTT ATTCTGGTTT CGTGTGCCAC TGCGTTTCAA
CCGAAGGCTT TTCATGCCAT TGACATCAAC CCGACCAATC ACTTGTTTGG CAGGAGTAAC
GGGACGCCGA ATCGCAACGA GTGCGCCATG TCGTTCAACT CCGCAGACCC CAAAAAGGAA
TTGTCGGTAG GCTTTATCGG CTGCGGGACG ATTGCCAGTG CCATTGCGAC CGGCTTGGCC
CTGCAAGACA AGGTATCGGT CACAAACATC GCCGTTACCA AACGATCGGA AGCCAAGTCT
TCTGCCTTGC AAAAGTCCTT CGGTGACCTC GTTTCGATAC ACGAGGATGC GCAAGAGTTG
GTGGATCAGT CGGATGTTGT ATTCGTAACT GTTCTACCTG AGCAGGCATC GCAAGTTCTG
CAAGAAGTTA CCTTCGACAG CACTCGACAT TCGCTGGTTT CTCTGGTGTC CACCAGTACA
CTGGACGACT TGATCAGTGA CTCTGGATTG CCCGCCGAAA ATGTATCCAA GATGATATGT
AAGTCAACAT CCACAAAAAT CGCTTCGAAA TTGTCGAACC TGGTCGCCTG ACACGTAAAT
GTCTATTTCG ACCGATTCCT ACCGTCAGGT CTCCCCGCCG TGGCTAAACT CAAGGGCGTT
TCACTTGTAG TACCGAAACA AAACCACAAT CCCATTCTAC TACAAATGCT GGAAAGCTTA
GGGGGCTACG TTGAGTGCGA AACACTACAC CAGATGAACG CAATGATGGT CCCCACCGGC
ATGATGGGGA GCTTTTACGG TCTGTTACGG AACAATCGTG ACTGGCTTGT GCAGCAGGGT
GTGCAGGCCA GCGATGCTTC CTACTTTGTT GCGAAACAGT ACATGAGCAT GATGGAAGAT
GCCGTCGAAT CTTGCGTGGA TCCGTCACGT TTTGACGATT TGGTAGAAGA ACAGACCCCT
GGAGGTTTGA ACGAACAGGG TCTGGCGAAT CTATCGCAAC AAGGAGTCTT TCAATCGTAC
AATCAAATTA TGGATGCTCT TTTGTCTCGC TTAGAAGGTC GATCGGACGG ATCGTTAACT
GAGAAGTAAA GCCTGTAACA CCGGCTGCTC CCTCATATTC GATTCGTATA CCGAGGGCAT
TCATCCGGTG AGAATGCTTG AAATATGCAA ACGACCTACG CCTTGCTTGT TTATACTTTT
GGGAGACACA TAAACACTGC AGCAGCTGAC TGTGAAATCT CGATTGCCTG ATCTTTGATT
GTGAATGAGG TGCAAGGCAA TTGCTTTGAT AGTTAACAGA AAGGATTTAG CTTTTTCTAT
GAGCCTAGCT
 
Protein sequence
MVKKITPFFV SPAIRIFILV SCATAFQPKA FHAIDINPTN HLFGRSNGTP NRNECAMSFN 
SADPKKELSV GFIGCGTIAS AIATGLALQD KVSVTNIAVT KRSEAKSSAL QKSFGDLVSI
HEDAQELVDQ SDVVFVTVLP EQASQVLQEV TFDSTRHSLV SLVSTSTLDD LISDSGLPAE
NVSKMICLPA VAKLKGVSLV VPKQNHNPIL LQMLESLGGY VECETLHQMN AMMVPTGMMG
SFYGLLRNNR DWLVQQGVQA SDASYFVAKQ YMSMMEDAVE SCVDPSRFDD LVEEQTPGGL
NEQGLANLSQ QGVFQSYNQI MDALLSRLEG RSDGSLTEK