Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43611 |
Symbol | |
ID | 7197333 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 974383 |
End bp | 976357 |
Gene Length | 1975 bp |
Protein Length | 474 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177728 |
Protein GI | 219111953 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGGTACGGA CACGCCGTGT AACTTCCGGT CGCTGCCCGG CACAACCCAC AACGGTATCC AACCACACTC TCACATATCT GCAGCCCAAG CACCCCTCAC TCCCAACCGT TGCTCTTTGC ATTAGCGCCG TTTTATTGCT ATTGCTCGTG TCTACCAGCA TGTCCGAAAC CAAACCCCTC GTTCCTACCA CGGACGATGC AGTCGACCAT CTCCGTCGCT TGAAGGCTTA CCAACCGTGA GTATCGGCAT GTGCGATTCG CTGTACTCAA GATCTTTGAG GCGTGAGACC GTGGTCGAAT CTTCGCCCAA GTTTCTGTCG TCCTGTGTTC CGTTTGGTGT TTGTCGCTCA CACGTTCCCG ATCATCCTCT TCCACAGATG GGCCTTTTTT ATCACCTTTG GCGGCTACTT CATGTCGCAT TTCTCCCGCA AGTGCTATTC CACGGTCAAA CAGCAGTTGC AGACGCAGGC AGGCTACTCG CCACTCGTCT TGTCCGAGAT GGACACGATC TTTATGGCGA CCTACGCCGC CGGAAACATT ATCAACGGAA AGCTCGGTGA TACCTTCAAT CCGACTACCA TTCTGGCTAT TGGCTTGCTC GGATCCGGAG CCTGCCTTTT CCTCATCAAC GTTGCCATTT GGTTTGACTT TGAAGGCTTT TCACAGACCC TCGGTAATTT CTTCATTCTC GCCGTATACT TCTTGTTCGG ATTCTTCCAA GCAACGGGTG GTCCCGTAGG AACCGCCGTC ATGGGTAACT GGTTCTGTGA TACCGGGTCC GTCAAGAATC GTGGGACCAT CTTCGGATTC TGGACCTGTC ACCAGTATAT GGGAGATATC ACCGCTGCGC TCTGTACCGC ATGGGTGCTC GGTATTGGTT TGCCCTACTG GTGGGCCTTA CTCATACCCG CGATTGCAAA CGTTGCGTGG GCGTTTTTGA CGGCCCAGCT CGTGGCCGAC CCCTACACTG TCGGTATCAT TACCCCCGAA GTCCGCATCC GCCAAGCCAA ACACGAAGCC AAGCGCAAGG AAATGGCCGA ACTGGGTGAG AGCGTGGCGG CGGATGAAGG TCCCCAGCCT ATCACCTACT TGGCGGCCTT AAAAATCCCC ATGGTTGCCC AATACGCCGT TGCCTTTGGA TTTTTCAAAC TCACCAACTA TGTCCTATTC TTCTGGTTAC CCTACTTTTT GGGCAAGGCG TTCGACCCCG TCACGGCCAA CTTGATCGCG GCACTCTACT CGGTAGGAAT GATGCCCGGT GGCATCATCG TCGGATACGT TTCGGATTTG TTTGGAGGAC GTCGGGCTGT GGTTATTGGT GTCTTTATGT GCATGCTCAT TGTCTTTTTG GGAGTGTTTG CTGTCTACTC TGAGGCAGGC CTGTCGCCTG GGGCCTTACT TGTCATGCTG GGATGCATGG GTATTCTTGT GGGAGGGTAC GTGTTATTGT GTAGTGTTGC CGCATTGTCA GCCACGCTGC GCAAGCAGAC TCATTTATGC CAGATACTTC ACTCACGACA CCTTTCCCTT TCTTCCACTT TCTTTTCCAC TACAGACCAA ACAATATTAT CACTTCGGCC GTCGCTGCCG ACTTGGCCTC TCATCCTTCG GTGCTCGGTA ACAACAAGTC TCTCGGAACC GTCACGGGAC TGATTAATGG ATGTGGTTCC ATTACTGCTT CGATCGGTCT CCTGGCGGTC GGTCCTTTAC AAGAATCTTA CGGTTGGGGC TCGGTGTGGC TCTACTTGAT TTTTTGTACC GCCACGGGTA CACTCTTGAT GGCAACCAAG ATCTATTCCG AATTGTTCCC TTCCGCTGCG AACGCTACAG CCGTGATTGT CTAAGCACAG TCGTAATCCG GAGGGCAGTT ACCACGGAAG GGAAAAGCTC GCGATAAGAC CTGTTCCGAC ACCTCTTCGT CCTTTTCCTT ACTAATTAAA AATGTAATAA ACAGATTTGC TCCTTTGAAT TTAAT
|
Protein sequence | MSETKPLVPT TDDAVDHLRR LKAYQPWAFF ITFGGYFMSH FSRKCYSTVK QQLQTQAGYS PLVLSEMDTI FMATYAAGNI INGKLGDTFN PTTILAIGLL GSGACLFLIN VAIWFDFEGF SQTLGNFFIL AVYFLFGFFQ ATGGPVGTAV MGNWFCDTGS VKNRGTIFGF WTCHQYMGDI TAALCTAWVL GIGLPYWWAL LIPAIANVAW AFLTAQLVAD PYTVGIITPE VRIRQAKHEA KRKEMAELGE SVAADEGPQP ITYLAALKIP MVAQYAVAFG FFKLTNYVLF FWLPYFLGKA FDPVTANLIA ALYSVGMMPG GIIVGYVSDL FGGRRAVVIG VFMCMLIVFL GVFAVYSEAG LSPGALLVML GCMGILVGGP NNIITSAVAA DLASHPSVLG NNKSLGTVTG LINGCGSITA SIGLLAVGPL QESYGWGSVW LYLIFCTATG TLLMATKIYS ELFPSAANAT AVIV
|
| |