Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42569 |
Symbol | |
ID | 7195952 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 459027 |
End bp | 461095 |
Gene Length | 2069 bp |
Protein Length | 427 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177091 |
Protein GI | 219110679 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.191925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCAAGCCA GGCCCGACAC GACGAATGTG ACGGAAAATA ACAAAAACAT AAGGCCAAAG GGTTTCAAAG CGTTGCTTCA CCCTCGACTC CTGCCGACCG TAATCTAGCA TATCTTTGCA GTAAAGTGCA TGCAATCTAC AAGCCTCCTT AGATGTCAGA CCTTGTCCAA GAAAATATCA ACTCGTCCTT CACTCTCTGA ACGACGTGTC CTGCTGGCGT ATTTCCTTTT TCTGCTTGTG CTGCTGGCGC TTTTCGTGCG ATCTGATGTC GAACGGAAAA CCTAAAACGC CAACCGCGGC ATTGAATGAT CAAGCTGCTG AAGAAAGTGA GGGACCTTCT ACAAATTCAG AAAACGAGAC ATTGACTGCG TTTGATGTGC TGGGAAACGG GGCGACTAAC AGTGAAAGTG ACGGGCGGAA CGACAGATCT GCCGAAAGTA AAGCTTTGAA AACGGAACAA AAGTGGGAAG AACATTTTAA AAAACTGGTG GCTTTCAAGA AACGGTTCGG ACACTGCCTT GTCCCCAATC GTTACAACGA CGACTTTCAT CTTGGTAGTT GGGGTATGTG TGTATGGTGA TTGTGATTCA GACTTTTCGT CTTGGAACTT ATTTATCTCG CGTTTTTTCC AATGATATTC AGTTTCCACA CAGCGCCGTT ATTATAAGAT CTTATTATCT GGAACCAGTG CATCTACGCC AATGACTGCG GAAAGAGCTA AAAGGCTGAC GCAGTTGGGA TTTTCTTGGG CAACAAAGGA TCCACGCCAT GTACCGTGGG ACGATCGGTA CCAGGAACTC GTGGTTTTTG TCGTAAGTAA TTATGAGTAT TTCTTACGCG GTGACGTTGT GCATGGCATA CATTAAAAAC TTCCCCGTCT ACTCCTTATG TAGAGAGAAT ACGGCCATAC ACAGGTTCCC ATTGGTTGGC AAAAGAACAC AAAGCTGGCA AACTGGGTTT CCACACAGGT AAGCAGGTTG TGACATTGCC TTGCCTAGTC ACACGTCGAT ACGCATTCTT CAAACACAAT TGAGATCTCG CGTTTCACAG AGACAAGAGT TCAAGCTGTT GCACAAAGGA CGTTCCTCAA GATTGACCCA AGATCGCATT GACAAACTGA ATGCAATAGA CTTTGTCTGG GAAGCTCAGC GAGGGGGTCC GCGCCGAGGT CCAAAGGCTT GTACGGTCGG GAAGGTATCA GAAAAAGCAA ATCCGGTTCC GGGTGTGGGG CCACGTTCAA ATGCGTTAAT TAGTCTCTCT ACAAAATTAG AAGCAGAATC GAGAGGGTGC ACTACACAGA CAATGAAGTT GTGTGATGTT GGGGTAGGTA TTCAACAGGT ACCATATCTG GGTCGCGGTG AATCCAGGCC GGCTCCAGCG CAGCCGGTAG TGCTCGGAAC GTTGTCTGTT GCCCAGTTGT TGGAGCTGCA ACAAGCGGTA GAAGTAGCAC AAAGACCGTG GCAGGTTCTA GCACATGGAG GATTTCACCA AGGGCAGCTC CCACAACCGC TGCTGTCTTC GCAAACTGTC AGTGCGCAGT TTGGATTCCA GCCAATTCCG AAACCCTACG AGAGTCAACT CATGCCTAAT GAACCACGGA ATTTGCAGCT ACCTCGTACA TCCAGCAGCA ACTTTCCGAT TTCTGCTTTG ACGCAACTGC ATCAGTCATC CAGCGGAATG AGTTTACCAC CTAACATTGT CAACACGAGC CAGGTGGCGG ACCAAGTTTT GATTCAGAGA CTTTTGCTCG ATCAGCAGCA GTCAGAGTAC TCCTACCATT CCGATCAATG AAACCGTGCA AACGCTGCTT CGGGACTTGC AATCGCTGAC GCGTTCGTCG GCGACGAATG GTGCGCCAGG AAACTAGTGC CAATCCAATT CTTTGTGTAG TTCGCCATTT TCTGGTCTTG CTCGAAAGAG GGAGTCTGTG TCACCAAAAA TCCTATTCCA GCATCAACTT TCAATTACAG TTAACTGTAA GACCTTCCCG AAAAAGGATA CGGTCCGTGA AGGGGCCAGG TCCTCGTAGA GGTGACATAA TACAACCTGA CGAAGCATTC TGTAGCCTC
|
Protein sequence | MSNGKPKTPT AALNDQAAEE SEGPSTNSEN ETLTAFDVLG NGATNSESDG RNDRSAESKA LKTEQKWEEH FKKLVAFKKR FGHCLVPNRY NDDFHLGSWV STQRRYYKIL LSGTSASTPM TAERAKRLTQ LGFSWATKDP RHVPWDDRYQ ELVVFVREYG HTQVPIGWQK NTKLANWVST QRQEFKLLHK GRSSRLTQDR IDKLNAIDFV WEAQRGGPRR GPKACTVGKV SEKANPVPGV GPRSNALISL STKLEAESRG CTTQTMKLCD VGVGIQQVPY LGRGESRPAP AQPVVLGTLS VAQLLELQQA VEVAQRPWQV LAHGGFHQGQ LPQPLLSSQT VSAQFGFQPI PKPYESQLMP NEPRNLQLPR TSSSNFPISA LTQLHQSSSG MSLPPNIVNT SQVADQVLIQ RLLLDQQQSE YSYHSDQ
|
| |