Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43943 |
Symbol | |
ID | 7204372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 495554 |
End bp | 497953 |
Gene Length | 2400 bp |
Protein Length | 799 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186359 |
Protein GI | 219113551 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCCAT GTGAGGTAAA AAAGAGTATT TCCTATATTG ACGGTGGTGA AGTAAAAATA GCACTTGATG CAGCAAAAAG TAATGCCCCA CCAAGCGTGA TTGGGCGGCT GCTGAACATA CGAACTGGAG ATATCTTGTC AGGTTCATCT CTAAAAAATA TACGGATGCA AGCTGAAAAA GGTGAACGGA ACAAATTTGG TGCAAACGAT AATTTCACGA CGCAAGCGGA TCAACTTCTA GCTTATCTTG AGAGCACCCC GGATGTCAGC TTCTGTGCCA TATATGATGA ACCAGATTCC CCTTTATTCA CTGTTTACAA GCAGAGGGCA AAAACTGGAC GTCGGCACCT ACACACCAGT ACACGGAGTA TCTCTGGAGG AATTGCCCAA CAAGAAGTGC TCAATGAAAA GGTGCTAGAT GCTATTGATC CAAGGGGAGA GCTTGATGAC TATATAGATA GGACGCGGAG GGCGTTCAAG TTGAAGGGCA ACGAAAAAAT GCTTTTAGGA GTGGCCTGGA CCAACAACGA GAGTAGAAGA ATCTTTGCTC GCTATTCCGA GATCATGGTA GCAGATGTGA CAGAAGGTAC CAACAATGCA AAACGGCCGC TGTTTTTGTT TTCGGGAAAG ACATCAAATC AAAACACGTT TACAGCACTT TGGGCCTTTC TACCGCAACA ATCTCGTTGG GCCTTCCGAT GGGTGTGGAC AAGATGCATT CCACAGCTCT TACCTGAACA GGGGCTGAAC AGAATGCGTC TATTGATTAC TGATGGAGAC CCGCGAGAGT ATGGTACTTT TTTAGATGCA ATACCTACTT GGTATAGCTT GTGTCGGCAC AAACTATGCC ATTGGCATCT ACTCTATCGC GGCAGTCTTA TGAAAGCACA GACTGGAAAC TGTGGAACAA AAGCAAAAAT TCTATTCCAT GTGGTCCTGA AGTGGATAGA AAGCTGGATG ACAAAAATTG AGACGCAAGA GGAGTACAAT TTGTCAACTG GGCTTTTGAT TGACTGGCTG AAATCTCCGG AGGCACTTGA TACAAATTTG GGCGGAATGG GTTGTGCCCT TGTTTCGCAG ATAAATGCAT TTTTGACGTC CTCTCTGTTT CCGCACGAGC AACGCTGGGC TCGATACCAT TTTCTGAACG TGAGAGCATT CAACACGTCC GCAAGTTCCT ACGGAGAAGC AGAGAACAGT GCTCTAAAAC GACGGGGTGA TGGGGTCAAG CCAAACTTTT CGGTGCCAAA AGCAACACGG GCAATAAACG AAGGGACTCA ATTGCGAACA GTGAAGAGGC AACAAAAAGC AGTTCATAAC CTCAATGCTA CAAAGAAGAC AAAGGCAGCA AACTACACCA ATATATCCGA CCTAGTAGAT TGCATACAGG AAACCATATC CCATGAATTC AATGCAGCCA AAAAATATGA CCTCTTTTGC CCGGGTCCAA AAGAATTTTG GGTAAAGCGA GCATGGTACC AAATTCCAAG CGAGACCTAC CAGGATTTCA ACGACAGCAA CTTTTGCCAA TTTATGATTC CACAGTTTGA GCGCACCCGC ATCGTAAAAA TTACGGAAAT TGAAGGTGAA CTCTATCTGG AATGTAGTTG CGGCAAGTTC CAACGACAAG CTTCTCCATG TGCTCACATC TACAAAGTAC TTAACCGACC ACCACAATCA ACAGACGTTT CTGTGAGATG GACAAAAATT TGGGATGTTT ACCTGCATCG ACCTGGATAT CATGATCTGT CGGACCAGTT AGAGGAATTG TATAAGAAGG AGCGGCCAGG GCCACATTTC GAAAACACAA ATCAGTGGGA AGTTGGAAAG GGTGAGAGAG AGTACAACTA TTTTAAGAGA TCACTTCCAA GCGAGCCCAC CATTATCCAG AAGTACAGCA GATGGGCTGA TTCTTTTTCA CGACAACCTG GATGTTATGT GCATAAAAGC ACTGAACAGG AAACAGTTCC TGCAGCAAGC GGTATGGTGC AAGAGTTGAC CAGCCTTTCC CAGGGGTATG CTATTGAAAC TCAATTGGAT AGTGAAATGG ATGTTGGAGA TGTAACTGTC ATGCAGGTTG AAGAAATTGA TTCAAATCTC TCAAAATCGG GTAAAAGTCC ATACACAAAC AATCTTCATT TTTACGAGGA AATCTCAAAA CTTGCCAAAT TCAATTCAAA AGCTGCTGAC ATAATGACAA AAGGAATGCA GGAAACTTTG GAATTGCTAC AGAAACATGT TGCAGAAGGG TCAGGTATGG TAGATTACAG TATTGGCCCA GCTATTGGAA AAGAACCAGT AGGCCAAAGG CTCAGGCCAA GCTACAGTCC TTCAAAGAGC AAGAATCTAA GAGACAGACA AAAAGAAACA AAGGCAAAAT TTTGGTGGCT GAACAAGTAA
|
Protein sequence | MDPCEVKKSI SYIDGGEVKI ALDAAKSNAP PSVIGRLLNI RTGDILSGSS LKNIRMQAEK GERNKFGAND NFTTQADQLL AYLESTPDVS FCAIYDEPDS PLFTVYKQRA KTGRRHLHTS TRSISGGIAQ QEVLNEKVLD AIDPRGELDD YIDRTRRAFK LKGNEKMLLG VAWTNNESRR IFARYSEIMV ADVTEGTNNA KRPLFLFSGK TSNQNTFTAL WAFLPQQSRW AFRWVWTRCI PQLLPEQGLN RMRLLITDGD PREYGTFLDA IPTWYSLCRH KLCHWHLLYR GSLMKAQTGN CGTKAKILFH VVLKWIESWM TKIETQEEYN LSTGLLIDWL KSPEALDTNL GGMGCALVSQ INAFLTSSLF PHEQRWARYH FLNVRAFNTS ASSYGEAENS ALKRRGDGVK PNFSVPKATR AINEGTQLRT VKRQQKAVHN LNATKKTKAA NYTNISDLVD CIQETISHEF NAAKKYDLFC PGPKEFWVKR AWYQIPSETY QDFNDSNFCQ FMIPQFERTR IVKITEIEGE LYLECSCGKF QRQASPCAHI YKVLNRPPQS TDVSVRWTKI WDVYLHRPGY HDLSDQLEEL YKKERPGPHF ENTNQWEVGK GEREYNYFKR SLPSEPTIIQ KYSRWADSFS RQPGCYVHKS TEQETVPAAS GMVQELTSLS QGYAIETQLD SEMDVGDVTV MQVEEIDSNL SKSGKSPYTN NLHFYEEISK LAKFNSKAAD IMTKGMQETL ELLQKHVAEG SGMVDYSIGP AIGKEPVGQR LRPSYSPSKS KNLRDRQKET KAKFWWLNK
|
| |