Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_30145 |
Symbol | |
ID | 7195620 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 100251 |
End bp | 102510 |
Gene Length | 2260 bp |
Protein Length | 471 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | hypothetical protein |
Protein accession | XP_002183928 |
Protein GI | 219127408 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.542768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGCGGGCTC CGTTTGCGTG TTTGGTTCGG TCGAGGAAGG AATTTATGCG AGTGCATCGT TCTTCGTTGT CGATCCAGAT CCGCGGCCAT CACTTTTGTA CCATTCGTGA CAACGTATTC CTTGTTTCCT CTCCAAGGTG ACACTGAAAT ATCTTTCGTT CTCTCTTGTT GCTCCCAATT GATTCACAGT CACTGTTATC ATCATTAACC ATGACGCTAT TGGTATCTCG AACGGTGCGG TCTTTGCTCC GCAAGGTACG TAAGCTAGCA GCAACGACGA CGACGAATAT CAACGGTGAC GGAACGGAAA GTAGCGCTCG CATTCTCCAG CCTTTTTGAG TTCGAGGACG AACCAACCCT TCCGCAATGC TTCGGCAAGG AAGGTTGTCG TCACAAGCTG ACTTGCTGTC GCCTTGCGAA GCAGCTTGGG AACCGACACA GATGGAGGCT GGGAAGCTAC CGTTTGCCAT TGTCCGCACT TTATCATACT TCTATTTTCC ATTACGATCT ATGAACCATA CTCACACGAC ACTTCCGACA CGTTGCTTCT GTTTTTGTTG GTCACTCGTC TTGTTGCGAG ACAGACGCCC TTTTCTACGA CTGGTACGAC CACACCCATC GGCGCGAGTA TCCGCACCGT CACGACACTG CAAGAGACGC TGGCGGCGCA AGTGCCGGAA AAGCAAAAGA AAATGGCCGA ACTTAAAAAG AGTTACGGAT CCGAAGTGTA AGTACAACTT CGTTTAGCGA TGCAGTTGTA GAGATACTGG CGCAGTCCGT TACCCCACGG CTGCCACCGG AAAATAGGTA GTACCGCGTC CACTGCGGGG CCCTTCTTGT TTCGTCCGCG CCCTCCTCTT ATCTGTTGGT ACACACACAG GCCCGACTCA ACCCCGCTGT TCTCTTTCTC AGGATCGGCC AAGTGACGGT GGACCAGTGC ATCGGTGGAG CTCGTGGTGT CAAGTGCATG CTCTGGGAGA CGTCCAACCT GGACCCCGAC GAGGGTATTC GCTTCCGCGG TCTCACCATT CCGGAATGCC AAGCCGTCCT CCCGACTTAT TCGGGCAAGG CTGGCGACGG CGAGCCCCTC CTCGAAAGTC TTTTCTGGCT GCTCCTCACG TCCGAGGTTC CCACCAAGGA ACAAGCCGAC TCCTTGACGG CCGATTTACA CTCGCGCGCC AAACTCCCCT CACACGTGGT TCCCCTGCTC AATTCGTTGC CCAAAGACAT GCATCCCATG ACGCAATTCA CTATTGGGCT CGCCGCCTGC CAGACGGAAT CCGTCTTCGC CAAAGCCTAC CAGGACGGCG TTCCCAAGAA TCTATACCAC GAGTACGCCT TGGAAGACAT TCTTTCCGTC GTCGCCAAGT TGCCAGAAAT TGCCGCCACG ATTTATCGCA ACGTCTACCA CGACGGAGTT GTGAGTCAGG ATACGTCGTT GGATTTTGCG GGAAACTTTT GCCGGATGCT CGGCTACAAC GATCCGTCGT TTGATGAGCT CATGCGTCTG TACCTGTGCA TCCACACGGA CCACGAGGGC GGCAACGCGT CGGCGCACGC CACGCACTTG GTCGGTTCGA CCTTGAGTGA TCCGTTCTTG TCGTACGCTG CCGGATTGAA CGCGTTGGCC GGTCCGCTCC ACGGCCTCGC CAACCAAGAA GTCCTGAAAT GGATCCAGGA GCTCAAGGAC AAGTTCGAGT CCGAAGGCAA ACCGGTGAAC GCGGAAACCA TTACCGAATT TGCTTGGGAC ACGCTGAACG CGAAAAAGGT GATCCCGGGC TACGGTCACG CCGTCTTGCG CAAGACCGAC CCCCGATACA CGTGCCAACG TGAATTTGCG TTGAAACACA TGCCCGACGA CGAACTCTTC AAGGTGGTCG ACACGATTTA TGAAGTCATG CCCGACATTT TGAAAGAACA CGGCAAGGTG GCGAACCCGT ATCCCAACGT GGACAGTCAT TCGGGCGTTT TGTTGTGGCA CTATGGGTTC ACCCAATACC AGTACTACAC TGTTTTGTTC GGCGTGAGTC GTGCGGTGGG TGGGCTATCG CAACTGTACT GGGACCGGGC TCTGGGCTTG CCCCTCGAAC GACCGAAATC CGTCACGCCG GAATGGATCT GGAGTCAGGT GCAAAAGTAG TGGTCATGGA CGTCGTTTCG TATTGGAACG CACAAATGGA TGGAGAGGAA TGAAATGCAC TTCTTTTGTG TCGGTGTATC AGTCCATTAG AAATAACTTT AGCCTACGCA GTTTGCAGTC
|
Protein sequence | MTLLVSRTVR SLLRKTPFST TGTTTPIGAS IRTVTTLQET LAAQVPEKQK KMAELKKSYG SEVIGQVTVD QCIGGARGVK CMLWETSNLD PDEGIRFRGL TIPECQAVLP TYSGKAGDGE PLLESLFWLL LTSEVPTKEQ ADSLTADLHS RAKLPSHVVP LLNSLPKDMH PMTQFTIGLA ACQTESVFAK AYQDGVPKNL YHEYALEDIL SVVAKLPEIA ATIYRNVYHD GVVSQDTSLD FAGNFCRMLG YNDPSFDELM RLYLCIHTDH EGGNASAHAT HLVGSTLSDP FLSYAAGLNA LAGPLHGLAN QEVLKWIQEL KDKFESEGKP VNAETITEFA WDTLNAKKVI PGYGHAVLRK TDPRYTCQRE FALKHMPDDE LFKVVDTIYE VMPDILKEHG KVANPYPNVD SHSGVLLWHY GFTQYQYYTV LFGVSRAVGG LSQLYWDRAL GLPLERPKSV TPEWIWSQVQ K
|
| |