Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43489 |
Symbol | |
ID | 7197185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 595318 |
End bp | 599067 |
Gene Length | 3750 bp |
Protein Length | 425 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177648 |
Protein GI | 219111793 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0573784 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTCAAAAA AGTTCTCTAT CAAGGAGAAA ATTGGGGTAA TGAGTTCTGG GCACGCAGTG TGACTCTTCT CTATCACACA CGCATCGCCT TTTATAGTCA AGCCGAGATT TCTTCTCTTC CCAGCCAGCG CTCCTGGCAA TGTTGCTCCG TCGAAAGCAA CCATGTTGGG TCCGGTCTCA GATCTTTGTA TAGTAGAAAA TGATCCCTAC CCTTGGTAGC TCCCGGATGT AGATTCAATG AGGAGGTAAA GTCCAAAAAG ACATCCCGAT CGGGTGCAAT TTGGTAGCCC GCTTTGTGGT ATAATTGCAA GGCACTGTGA TTCATCGTGT CCACGTGCAG ATAGAGCGTT TCGACGTGAC GATCCCTCGC CAAACGGTCC ATGGCGTCCA ATAAACGCAA TCCAATGCCT TGCCGTCGCG CGGACGGGTG GACCGCCACT TCCGTCAAGT ACAGTAGTCC GTACCGTGGC CGGCGGGAGC CTAACTGTGT ATTGTAGAAT TCGTGAAAGC TACATTCGGC ACTACCCAGA ATAACGCGCG ACGTGGTGCT TGGTTCGGTA TGCGCGGTCG CTTTGGTCGC AACGATGCAG GTGGCGCCCC GCATCCGCCG TCGGACCATG GCTTGACAGG ACCGGGAGCA GAACTGGGCC TGTAAGTCGG CCGAAAAGTC GGAAAAGACG GATAGACGCA AGTTGGCAAT ATCCACGTCG TCCAACGGAG TAGCCAACTG CACCGAGACG GCTCCATCCT GCGCAAGGAT GATGGTGCGG GGAGATTCGA TTGGTTCGCC TAAAAGTCCC ATCTCGGAGT GAGTGAGCGA AGAGAACGCG GTAGCGAGTC GATCGTGCTG CGTACTGAAT GAGGTGGTAT CCAATTTCCT GTGCGGGGAT CACTGTTTCT ACGACTTCCC AATAAGTCAT CAATAACAGA GTGAATAATA GATTTTGTGG TGGAGGCTGC CGTGTTGGCT AACGCTGATG ACCGGAGCGA TGCTTTGGTG GGCGGGTCCG TATGATCCTT TCGGGGAGCA GCTTTGCGCA AAATGTCCAA AACCTTGTGG GTGGCTCGTT CTGCTTGCCG GGCATGGGTG GCACTGCCAA TCCGACGAAC TCCCACACTG GTGGGTTTTC CTGTCGCATT AGCGTTCCTT CGTTGTGGCC TCGTACCAGC GGCACGGTTG GCCAACAGTG CGGCTCGTAT CCTGGCAGCC TTGCGTTGAT CGTCGGGAGC CGCTTCGGTC CGTGTGAGCG AGTCGTCGCC GCGTCCCGCG GGTTGGGCGG ATCCGGATTC TGGTGGTCGG TCCTCCTCCT CCTCGCGAGA CGCAGAATTC TTGGGAACGT CGTCGACATC ACTTTTCGAG GCGGACGGGG TGCATGATCG TGCGTCGTCT GGTCGTATCG GATGATCTCG TGATTCCGAC CAAGATCGTT TGGTGGTGAC GGTGGGTACT GGTGTACCAG TGGAGACGCA AGACGGTGTG GCAGGTACCA TGGTAAGGCG CGCAAAGGGA GAAGAGAACG CCCGGTCCCG AGATGGACCA TCAACGAAAA CAGACGATTG GTCAGCGCCG GAGACGGGTT GGGTAGAGGG ATTGGTTCGA AAGGGCATTC CTGGAGAGGA ATGGAACGCC CCACAGACCT TCCCGGAATG TATCGACACG AGACAGCAGA GTAGAAGAAC AGCAATGACG TGCCATCGGC GCACCATTGC GGACGGCAGA CCGTACCGAG AGTTTGTCCG GCTAAGGGCG TTGGTAATGC TCGTTGCGGT ACACACCCGT GAACTCGCAA AGTTCGGAAG CGCAGCGCTC GACGGAATTC GGCTGAGAAT GAGCGGCGGT AGGAGACGAT TCTTGGACTG CGATCTTCAA CTGCCTAGCT AATGTAGTAA TGAGCAGGAG TCGTCGGGAT GGCAAAGAAC AGGAACGGAA TTGTGGAAAC TTCCTGCAAC TGTAAGCAAT CTCTTTTTCG ATACTCCTGT CGCAGCACCG GGTCGACCCA GACGACCCTG ACGACCCCTG TAGTTTCCAG GTAGGGCACC AGCCAGCTCA CACTCACAGT CAACGGAATA TCGGTACCCT CCTATTGTCT GTTTGTCTGT TTGTCTGTCT GTACGTCTGA CCAGCAGTAC CGAACGGGAC TTTTTCCAAC TGGCTCTTCA CCAACGTTGC AATCACCAAC CACAAAAGGT GTTCCAGGTG CCAGTATCTT TTGGAACGGC CAGTACGACG TACAAACACA GGGAGCAGAC GTCGGCAGCG CTGGATTCGA ATTTGGTTCC TCACGCTCCC CTTTCAAGTG CTCTTTCATC GGACACTAAC ACTAACAAAT TCTCTACCCT CGGCAACTAC CTACCTCCTT TCCTAAGTTT GTTATTGGTG TGTGGTACTA TTGTTGCTCT TGCTGTCATT ACCATTTGTT GTGGATGCCC ATTGTGGAAG CTGCGAGTCA CTTTTTAGAA AATTGATTCC CCATGACCAG CCCTTCCGTT GCGATCACCC GAGCCGCTAC ATCGACCCCT TTGGTGTCCC GAGCCTTGGT CGTTCTGCCG TGTTGGTTGT CCCTATCCGT TCCTTTTCAA AGGCGGCCAC AAGCGGGACT TACCCAAAGG GTCGGTCGAA CCATTTCCAC AACGACTCGT TACACGGTAA AAACGCGGCA ACATCACCCA TCGAGTCTCC CTTGGTACTA CCATTCCACC CGATTCTTCT CGGCCAATCT TTCTTCCTCC CACGAAACCC ACAGTGATGA CGTTCGGAAT GGATCGAATG AAGACAACCA AGATGACGAA GAAGAAGAAG AAGAAGAAGA AATGGAAGAC AACGCCGATG AGCAAACAAA GGCAGCCCTG CAAATCCGCA ACGAGATTAT TTGGCAGAAA AGATTCATGG CCCTGGAAGC GTTCGTGGCG ACCCAGACCC GCGACGAACA AGGGACTCTA CCCTACCCAG AGGACCGCTC CATGCGTACC TGGTTAGACA AGCAGCGGCA TTTGTTTCAC CTCAAGATGC AGGGCGAAAG CTCGTCGCTC ACGGATGCTC GTTCAGCCCA ACTCGAGTCA CTGGGCTATC CACTCAGTCC CCGGGACGAT TGGTGGGAAA AACGTTACGA AGATGTACGA GCGTTTGTCC AAAAACACAA TCGCTTTCCT TACGATATGG ATGACAGTTA CATGACTGAA GAAGAAAAAC GATTGCTTTG GTGGTGTCGT CTGCAAAAGA AGCAGTACAA GGCATGGAAA GAGCAAGACG ACGATTCGTT GACTGGAATG AACGAAGCAC GAGAAGCCAA ACTGAACGAG ATTGGCTTTT GTTGGGATGC TCATCAAGCG TCCTGGTTGG CTCGATACGA AGAACTAAAA GCTTATCACG CCCATCACGG AGATTGTCTC GTGCCGAAGG ACTATCCGAC GAATCCCCCT CTGAGCAAAT GGGTGAGCGA TCAAAGGAAC AACATGGCTC GTTCCCGCAA AGGAATAATA AAAGTTAATC CGGAGCGACT TCAACTCCTA AAAGAATTGG ATTTCGAATG GAATGCACTA GAAGAATTTT GGAATCGGAA GTACAAAGAG TATGCTGAGT ACGTGCGATT ACATGGGCCA GGTAGCATGC CTCGCCAAAA ACACAATCCT CATTTACGGA ACTGGCTTAC TTATCAACGA AGGCAGTATC AGTTGTTGTT GAATGGGCAG AAGAGCTGCA TGACACAGAA ACGCAAGGAT CTTTTGGACG CATTGGGTTT TATCGTTTGA
|
Protein sequence | MTSPSVAITR AATSTPLVSR ALVVLPCWLS LSVPFQRRPQ AGLTQRVGRT ISTTTRYTVK TRQHHPSSLP WYYHSTRFFS ANLSSSHETH SDDVRNGSNE DNQDDEEEEE EEEMEDNADE QTKAALQIRN EIIWQKRFMA LEAFVATQTR DEQGTLPYPE DRSMRTWLDK QRHLFHLKMQ GESSSLTDAR SAQLESLGYP LSPRDDWWEK RYEDVRAFVQ KHNRFPYDMD DSYMTEEEKR LLWWCRLQKK QYKAWKEQDD DSLTGMNEAR EAKLNEIGFC WDAHQASWLA RYEELKAYHA HHGDCLVPKD YPTNPPLSKW VSDQRNNMAR SRKGIIKVNP ERLQLLKELD FEWNALEEFW NRKYKEYAEY VRLHGPGSMP RQKHNPHLRN WLTYQRRQYQ LLLNGQKSCM TQKRKDLLDA LGFIV
|
| |