Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16814 |
Symbol | |
ID | 7199069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | + |
Start bp | 326410 |
End bp | 328398 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185173 |
Protein GI | 219130020 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.728956 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TATGATGACT TCGATCACGT AGGCTACGAT ACCCATGGAA ACAAGGTCGT CAAGTCGGCG AGTGCCGCCG GAGCTGGTGA TCGTATCGAT CAAGCCATTG AGCTGCAGGA CAACATGGCC AAGGGGAAGT TTGTCGTACA CGATATGTTG AACGATCGCA ATGTGCAACT CTCAACTCGG CAGCTTGAAT TGATACGTCG TGTTCAAGGT GGTGCATTTG CTCATCCTGA GCACGACTCC ACTCCGGAGT ACATCGACTA TTTTTCTGGC GTTAACAAGG AAATTTCTGG TTTGGGCTCT GACCAGTATG AACGGAAGGC CCGTTTTCAA CCCAGCAAAT GGGAGAAGAT GCAGGTCGAT AAGCTATTGG AAAAGTTGGA AAATGGCGAT ATAAATATGG ATTATTTGAC AGGGAAAATC CGCGACATGA ACGAAGTACA TAGGAAGAAG GAGGGTCCAG AAAAGCCATT TATGCTGTGG ACCGGCGAAG AGGAAGACCT GTTAAATATG AGAAAAGGAC CACAGCATAT CGCTGCTCCA AAGCTTCCGC CCCCAGGTCA TGCGGAGAGT TATGTTCCAC CCAATGAGTA TCTACCTACC GAGGAGGATC TTAAGAAATG GAAAGAGATG GAACCAAAAG ACCGCCCCCA TGGCCATCTG ATTCCCAAAA AATTCCCAAA CCTTCGTTCC GTTGGAGCCT ACGAGCATTC CGTACGAGAA TCTTTTGAGC GCTGTCTTGA TCTATATCTC TGTCCACGTG TTATGAAGCG GCGACTGAAT ATTGACCCCG AAAGCTTGGT TCCACGTCTA CCCCGCGCAA ACGACCTGAA GCCCTTCCCG ACAGCCAAGT GCATCGAATA TATTACGCCT TTTGAGGGCA ACGAAACTCC GACTATTCGT TGCGTTACTG CGAGTCCTGA TGGTCAATTC ATGGCTTCTG GGGCTTCTGA CGGCTGCGTA CGGATGTGGG AAGTCGAAAC CGGACGATTG TTGAGAACCT GGGATCTTCA TTTTTCAGTC GTAAAAGCTT CGACTACCGG CGACGACGAA AGTGCCATGA GAACGAAGCC TGTCGTATGC TTGGAATGGA ATCCAAACCG ATCTCACCAT TGCTTACTGG CTGTTATCGC AGAGTATGCT GTCGTGATTG CAACAGGAAC AGCTAACGAA AAGGATATGG AAATCACTGA AGCGGTGCTT GCTGCTGCCT CAAAAGGAGG TAACATGAGT AGCGCCAAGG CTAGCAAATC TGTTCAATGG ATCCTTGTTG AGTCGGGTTT AGAAGAGAAG CCCGTAAGCG TTTATGGTTC CTTGAGTGGT CCGATCTGTG CATTGAAAAC AAATCGCGAG ATGACCAACG TTCGTTGGCA CGCAAAAGGT GATTACTTCG TTTCTGTAAA TCCAAAGGCA GGATCAGCGG CGGTTTTGAT ACACCAACTG AGTAAAGGAA ATTCGCAGCA GCCTTTCAGT AAAGCAAAGG GTGAAACGCA ACTCGCTTGT TTTCACCCGA ATAAGCCTTT TTTGTTTGTT GCTAGCCAAC AACATGTCCG CATTTATCAT TTGGTAAAAC AGAACATGGT TAAGCGCTTG GCCTCCGGTT GCCGGTGGAT CTCTTCGATC GATGTGCATC CATCAGGAGA TCACTTGATA GTTGGGAGCT TGGACCGTCG CATGATTTGG TTTGATCTTG ATTTGAGTTC GACTCCATAC AAGACTCTGA AGTATCACGA ACGAGCCCTA CGTTCGTGCC GATACCACAA TCGCTACCCA CTCATGGCCA GTGCTTCCGA CGATGGCGCA GTGCACGTCT TTCACAGTAT GGTCTACAGC GATCTAATGC GAAATCCTCT CGTTGTACCA GTAAAAATTC TTCGCGGTCA TGCTGTGACA AAGAAACTTG GTGTGCTCTC AATCGTCTTT CATCCGACGC AGCCCTGGAT ATTTACAGCT GGTGCCGACG GTAGGATCTT CCTTTACCAA GATATTTAA
|
Protein sequence | YDDFDHVGYD THGNKVVKSA SAAGAGDRID QAIELQDNMA KGKFVVHDML NDRNVQLSTR QLELIRRVQG GAFAHPEHDS TPEYIDYFSG VNKEISGLGS DQYERKARFQ PSKWEKMQVD KLLEKLENGD INMDYLTGKI RDMNEVHRKK EGPEKPFMLW TGEEEDLLNM RKGPQHIAAP KLPPPGHAES YVPPNEYLPT EEDLKKWKEM EPKDRPHGHL IPKKFPNLRS VGAYEHSVRE SFERCLDLYL CPRVMKRRLN IDPESLVPRL PRANDLKPFP TAKCIEYITP FEGNETPTIR CVTASPDGQF MASGASDGCV RMWEVETGRL LRTWDLHFSV VKASTTGDDE SAMRTKPVVC LEWNPNRSHH CLLAVIAEYA VVIATGTANE KDMEITEAVL AAASKGGNMS SAKASKSVQW ILVESGLEEK PVSVYGSLSG PICALKTNRE MTNVRWHAKG DYFVSVNPKA GSAAVLIHQL SKGNSQQPFS KAKGETQLAC FHPNKPFLFV ASQQHVRIYH LVKQNMVKRL ASGCRWISSI DVHPSGDHLI VGSLDRRMIW FDLDLSSTPY KTLKYHERAL RSCRYHNRYP LMASASDDGA VHVFHSMVYS DLMRNPLVVP VKILRGHAVT KKLGVLSIVF HPTQPWIFTA GADGRIFLYQ DI
|
| |