Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39571 |
Symbol | |
ID | 7195241 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | + |
Start bp | 160215 |
End bp | 161581 |
Gene Length | 1367 bp |
Protein Length | 404 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183561 |
Protein GI | 219126642 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCGAA TTCTACTTTT TGCTATGATT CTTCTGGATG CAGCCAGCCT TGCTCAGCCT AATGACGAGG CGCAAATCCA TAAAAGCGTA TTTGAGAAGC GAGAGCTGAT GAATAATTAC ATAGAACTGA GCATGAGTAT GCCAGTCATT GGGAAGGGCA AGGGTAAAGG TAAAGGAAAG GGAATCGACT ACAGCCTATC ATCGAAATCC TCAAAGGCCA GCTGGTCCAA ATCGTCCAAG TCAAAGGGAA AAGGAAGCAG CAAAAAGAGC GATAAAAGTG GAAAGGGTGG TGGCAAAGAA ATTTCGCTCG TTCCCACAGT TCCGACCCCT ACTACGGCTC CAATTGGTAA GTTGCGCTTG TCTTACATTA AAAAGGAAAA ACCCAAATTG AAAGATCCTA ACCGACTTAA CATTCAGTGG CACCTCCTAC TATCACTACA ACCTCCCCGT TTGGTAGTAC AATGAGTGAG TCACGTAAGT ACAGCGAAGC AGAACCTATT TGCCCGAAAT TTAATAAACT AGATTTTTCA CTAACTGTAC CTTCCCGTCC ATCAGCAAAT TCGACGCCTG CGCCCACGTT GACACCAGTA TCTATTTCTC CATTCACGAT TCTGTATACC ATCGAACAAT TCCGCCTCCC TCTCGCAAGC GAGTATGTTT CGGTCGCAAA TCTCACAGCG AACTATTTGA ATGAGTACTT TCGAGCAAAT TTCCAAGAGA CGAGTCTTGT AGACTTTGTG ATTGCCGACA CGATGATGAC TGACAACAAC TTTCAGTTTG GACAGCCTGT CGAGATTGAT TACAGAACAC AGCTTACGTT TGCGTCGGCT TCCTTTATTC CTTCTACAGA GGAATTCGAC GAGTTGTTGG CCAGTGCGTT CCGCGACGAC AACTTGGCAA TCTACATTTC TCTTTTGAAC AGTTTGCCAA TCAGCAACAT CTTCCAAACA ACGTCGCTTG TTACTTTTGA AGGATCTACT TCCGCTACCG TACCCGCGAC TAGTGCAGAC TCCGCCGCAA GAGCAGCCGG AATCGCGGTA GCAGCCGGTG CTGGAGCCCT TATCTTTATC ATCGCTGGTG TAGTTATGTA TCGCCGAAAG GAAAGAGAGG AAGTTGGCAA GCGCCTCGAT GAAGATGGGC AGATGACAGT TGCTGGTGAC ACGTGTGGAG GCTCGTCAAT GGATTCCCAA TCGGTGGTTA ATCAAACACA TGCAATGAAC GACGCAGATG GCTCCTCGGT ATCGGAATTG GGCGACTTTC AGGTCGCTCC ATCAAATCAT CCCATTCTCG AAGAAGGAGC TGAAGAGGAA ACGGACGACG AGTGCGACTT TGAAGAAAGA GGCCAACTTT CGGAAGTCCA ACTGTAA
|
Protein sequence | MMRILLFAMI LLDAASLAQP NDEAQIHKSV FEKRELMNNY IELSMSMPVI GKGKGKGKGK GIDYSLSSKS SKASWSKSSK SKGKGSSKKS DKSGKGGGKE ISLVPTVPTP TTAPIVAPPT ITTTSPFGST MSESPNSTPA PTLTPVSISP FTILYTIEQF RLPLASEYVS VANLTANYLN EYFRANFQET SLVDFVIADT MMTDNNFQFG QPVEIDYRTQ LTFASASFIP STEEFDELLA SAFRDDNLAI YISLLNSLPI SNIFQTTSLV TFEGSTSATV PATSADSAAR AAGIAVAAGA GALIFIIAGV VMYRRKEREE VGKRLDEDGQ MTVAGDTCGG SSMDSQSVVN QTHAMNDADG SSVSELGDFQ VAPSNHPILE EGAEEETDDE CDFEERGQLS EVQL
|
| |