Gene PHATRDRAFT_50155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50155 
Symbol 
ID7198854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp212901 
End bp214651 
Gene Length1751 bp 
Protein Length525 aa 
Translation table 
GC content50% 
IMG OID 
Productchitinase chitin binding glycoside hydrolase family 1 
Protein accessionXP_002184991 
Protein GI219129639 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.487357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAGAAGGGT TAACAAAAAC GATTCATTCG GATCCCATGA GTGTCAAAGT AAGACCGAAG 
ACCGTCCTTT AACAATGACT TTCAAATGAC ATTTCCAGAA GCTCAATCGA CAAGTCGATG
CGCTCACAAA ACACGAATCT TGGCAAACCA AATCGCTTCC GAAATCCAAT ATTATGACGA
TTGCCATCTA CCAGTCGATT TTCTCGTTAA TTCTCGTTGG AGGATTTCTT TCTTCAACCT
CACTTGCTGC TACCTGCAGT GGGGGCACGA TAGGCGACGG TATCTGTGTC GACAACACCT
TGTGCTGTTC ACGATTTGGC TGGTGCGGAT TAGGTCCCGA TTGGTGTGAT GAAGGCAATA
CAGCATCCCC TCCTACTCCT GCTGCAGCTC CAACCATAAC AGGCTCACCG CCCTCACCGG
TTCCACCTAC AGTTCCCGTC ATACTAAATG ACGACTCTCG AATGATTGCC TACATGGGAA
ACTGGCATAC CTGTCCGACA GCCGCGCAAC TCACTCAGTA CACGCACGTG GTCTTAGCAT
TTGCAGTTTC GTATACTTGG GCTCCTGGTA AGAACCAATG TGATACACAG TGTAACATTG
CCACACCGCC TGTTTGCAAC AATGCACCGA ACGACTCTTT GATCAGCGAC CTACACGCCG
CTGGCAAGAA GATCATTCTC AGTTTCGGCG GTGCTGGTAT GGGAGGAAGC TGGTCGTCCT
CACAGGATGA CTGCTGGGAC TACTGTTTCG GGAAAGAAGA GAAGGTTGTT TCGCGCCTGA
CTGAGATTAT TGATGACATG AACTTGGACG GTATCGATAT TGACTACGAG TATTTTTACG
AAGACAATCA GAATGGCTCG GGATTTACTA AGGGAGCACA AGCCCAGAAG TTCCTTACAG
AAATTACTGT CGGCCTGCGC AATAGTCTTC CAGCTGGATC GATTGTCACC CATGCTCCTA
TGGATTCTGA TTTGGTTCCG GGAAAAGCAT TCTATAAGTT GTTGAAGGAT ATCAGTGGAA
CTTTGGATTT CATCATGCCA CAGTATTACA ATGGCCTGGT GCGTCCAGCC TTGGATGGAG
TTGATGGATC TGGATTTGGG CAAGAAACAT CTATATCCCT TTACTCTCAG TTGGCGAACG
ATTATTTTGG TGGTGACGCC ACCAAAATTG TATTCGGGTT CTGCATCAAG GATTGTAGTG
GGACTAACAG CAATGCCAAC GCGGTTCAGG CGGCAGCGGT CATGAGCGAA TTAAGCAGCG
TGTACGACTG CAACGGTGGA GCCTTCTTTT GGGTTGTCAA TGATGATACT AATGGGTCTT
GGTCCTCCCA AGTGAACCAA GTTGTGCAGC TCAATGCTGG GTGTGCGGCG TCAAACTCTC
CCAGTACTGT CGCTCCCTCG TCTTTTGTTG CCACGACCCA TCCTAGTGCT GCTCCGACCA
AACAACCGAA CCAAAGTCCC ATGGCTGCAC CCACGAAGGG AACGCCGAAC CCGTCTTTCG
CACCCACCAG CAAACCTTCG CTCCGCGCCA CTACTGCACC CAGCAACGCA GATGGTTTCG
AAGAAACCAA ATGTTGCCCG CCGGACTTCA CAGGACTGCG GGCCTTGGAC AATTGTTCCA
AGTTTTTCCA CTGTGTGGGT GGTGCCGTTG TGGGTCCAGA GACAAGTTGT GGCAGTGGGT
TGTTGTTTGA CCAGAAGTTC ATGTACTGTA ACTGGAGTGA CCAGGTGGCT TGTGTTCCCA
ACGTTTGTTA A
 
Protein sequence
MTIAIYQSIF SLILVGGFLS STSLAATCSG GTIGDGICVD NTLCCSRFGW CGLGPDWCDE 
GNTASPPTPA AAPTITGSPP SPVPPTVPVI LNDDSRMIAY MGNWHTCPTA AQLTQYTHVV
LAFAVSYTWA PGKNQCDTQC NIATPPVCNN APNDSLISDL HAAGKKIILS FGGAGMGGSW
SSSQDDCWDY CFGKEEKVVS RLTEIIDDMN LDGIDIDYEY FYEDNQNGSG FTKGAQAQKF
LTEITVGLRN SLPAGSIVTH APMDSDLVPG KAFYKLLKDI SGTLDFIMPQ YYNGLVRPAL
DGVDGSGFGQ ETSISLYSQL ANDYFGGDAT KIVFGFCIKD CSGTNSNANA VQAAAVMSEL
SSVYDCNGGA FFWVVNDDTN GSWSSQVNQV VQLNAGCAAS NSPSTVAPSS FVATTHPSAA
PTKQPNQSPM AAPTKGTPNP SFAPTSKPSL RATTAPSNAD GFEETKCCPP DFTGLRALDN
CSKFFHCVGG AVVGPETSCG SGLLFDQKFM YCNWSDQVAC VPNVC