Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50155 |
Symbol | |
ID | 7198854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 212901 |
End bp | 214651 |
Gene Length | 1751 bp |
Protein Length | 525 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | chitinase chitin binding glycoside hydrolase family 1 |
Protein accession | XP_002184991 |
Protein GI | 219129639 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.487357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGAAGGGT TAACAAAAAC GATTCATTCG GATCCCATGA GTGTCAAAGT AAGACCGAAG ACCGTCCTTT AACAATGACT TTCAAATGAC ATTTCCAGAA GCTCAATCGA CAAGTCGATG CGCTCACAAA ACACGAATCT TGGCAAACCA AATCGCTTCC GAAATCCAAT ATTATGACGA TTGCCATCTA CCAGTCGATT TTCTCGTTAA TTCTCGTTGG AGGATTTCTT TCTTCAACCT CACTTGCTGC TACCTGCAGT GGGGGCACGA TAGGCGACGG TATCTGTGTC GACAACACCT TGTGCTGTTC ACGATTTGGC TGGTGCGGAT TAGGTCCCGA TTGGTGTGAT GAAGGCAATA CAGCATCCCC TCCTACTCCT GCTGCAGCTC CAACCATAAC AGGCTCACCG CCCTCACCGG TTCCACCTAC AGTTCCCGTC ATACTAAATG ACGACTCTCG AATGATTGCC TACATGGGAA ACTGGCATAC CTGTCCGACA GCCGCGCAAC TCACTCAGTA CACGCACGTG GTCTTAGCAT TTGCAGTTTC GTATACTTGG GCTCCTGGTA AGAACCAATG TGATACACAG TGTAACATTG CCACACCGCC TGTTTGCAAC AATGCACCGA ACGACTCTTT GATCAGCGAC CTACACGCCG CTGGCAAGAA GATCATTCTC AGTTTCGGCG GTGCTGGTAT GGGAGGAAGC TGGTCGTCCT CACAGGATGA CTGCTGGGAC TACTGTTTCG GGAAAGAAGA GAAGGTTGTT TCGCGCCTGA CTGAGATTAT TGATGACATG AACTTGGACG GTATCGATAT TGACTACGAG TATTTTTACG AAGACAATCA GAATGGCTCG GGATTTACTA AGGGAGCACA AGCCCAGAAG TTCCTTACAG AAATTACTGT CGGCCTGCGC AATAGTCTTC CAGCTGGATC GATTGTCACC CATGCTCCTA TGGATTCTGA TTTGGTTCCG GGAAAAGCAT TCTATAAGTT GTTGAAGGAT ATCAGTGGAA CTTTGGATTT CATCATGCCA CAGTATTACA ATGGCCTGGT GCGTCCAGCC TTGGATGGAG TTGATGGATC TGGATTTGGG CAAGAAACAT CTATATCCCT TTACTCTCAG TTGGCGAACG ATTATTTTGG TGGTGACGCC ACCAAAATTG TATTCGGGTT CTGCATCAAG GATTGTAGTG GGACTAACAG CAATGCCAAC GCGGTTCAGG CGGCAGCGGT CATGAGCGAA TTAAGCAGCG TGTACGACTG CAACGGTGGA GCCTTCTTTT GGGTTGTCAA TGATGATACT AATGGGTCTT GGTCCTCCCA AGTGAACCAA GTTGTGCAGC TCAATGCTGG GTGTGCGGCG TCAAACTCTC CCAGTACTGT CGCTCCCTCG TCTTTTGTTG CCACGACCCA TCCTAGTGCT GCTCCGACCA AACAACCGAA CCAAAGTCCC ATGGCTGCAC CCACGAAGGG AACGCCGAAC CCGTCTTTCG CACCCACCAG CAAACCTTCG CTCCGCGCCA CTACTGCACC CAGCAACGCA GATGGTTTCG AAGAAACCAA ATGTTGCCCG CCGGACTTCA CAGGACTGCG GGCCTTGGAC AATTGTTCCA AGTTTTTCCA CTGTGTGGGT GGTGCCGTTG TGGGTCCAGA GACAAGTTGT GGCAGTGGGT TGTTGTTTGA CCAGAAGTTC ATGTACTGTA ACTGGAGTGA CCAGGTGGCT TGTGTTCCCA ACGTTTGTTA A
|
Protein sequence | MTIAIYQSIF SLILVGGFLS STSLAATCSG GTIGDGICVD NTLCCSRFGW CGLGPDWCDE GNTASPPTPA AAPTITGSPP SPVPPTVPVI LNDDSRMIAY MGNWHTCPTA AQLTQYTHVV LAFAVSYTWA PGKNQCDTQC NIATPPVCNN APNDSLISDL HAAGKKIILS FGGAGMGGSW SSSQDDCWDY CFGKEEKVVS RLTEIIDDMN LDGIDIDYEY FYEDNQNGSG FTKGAQAQKF LTEITVGLRN SLPAGSIVTH APMDSDLVPG KAFYKLLKDI SGTLDFIMPQ YYNGLVRPAL DGVDGSGFGQ ETSISLYSQL ANDYFGGDAT KIVFGFCIKD CSGTNSNANA VQAAAVMSEL SSVYDCNGGA FFWVVNDDTN GSWSSQVNQV VQLNAGCAAS NSPSTVAPSS FVATTHPSAA PTKQPNQSPM AAPTKGTPNP SFAPTSKPSL RATTAPSNAD GFEETKCCPP DFTGLRALDN CSKFFHCVGG AVVGPETSCG SGLLFDQKFM YCNWSDQVAC VPNVC
|
| |