Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21485 |
Symbol | |
ID | 7202315 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 147844 |
End bp | 149636 |
Gene Length | 1793 bp |
Protein Length | 430 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181668 |
Protein GI | 219122678 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.323695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGAGCACGAC GTCTCCTTTG CTGTTTAAAG ACTAATTGCA AAGCGCTTCT GGGCGAATCT GCATCTTCTC CGATTACCTC TGCCAACCCT TTGTGACATT GCAGCAAGCC ACCGCTTGTA CCATTCAAAA AAGCCTTTCC TGTCGAATTT CCCGTAGTCT AGAATCATGT CAACTGGAGA ATCCGCGAGT CGAGCGACTT GGGATTTGAC CTCCCAGATT TCGCCCTTTC TGGATCGACA CATGATTTTC CCCCTCCTCG AGTACCTCGA TACGCTCATT AAGGACCAGG ACGGCAAACA CTTGGTGGAT TACAATCCCC AAGATGTGGC GGCGGCTCGA CTCGAAATTC TGCGTCCGAC GCACATGGTC GACTACGCCA TTGATGTGCA CCAAACGCTT CATCCTGGTC AGCCCGTACC GGACGAAATG GAAGCTCAAA AGAAGACCGT GTACGAAGAA TTGGAAGCCC TACGCGCAGC GTGTGGCGCC TTTGATAAAC TTTGCCAAGA CGAAACGGAA CGTGTACGTT GGTTGGTTGG ATTAGTTGAG TAGTTGATTG CGAGACCTTT GGTGGATGTG ACTCACATAG AATATCCTTT GAATGTTGTC TTTGCAGAGC AAACTCATGG CTACCGGGCA ATGGAACTTT GGAAGTTTGA ACCAATCACA CCAAATTACC CCCGAAATGG TGGAAGCCTA TCGTAAATTG GCGCGTTTCC AGTTTGAGTG TGGAGATTAC CAATCGGCTC GAGCCATGCT GAGTTACTAC ATTGCACTTT TCGCCAAGGC TCCACTTACG GCGGTGGATG AAGCTGACGA TGACATGATT ACGGCTGGTG CGGTACAGCA ACAAGCGTCA CAAAACGACA AGGATCTCGG CAACGCCAAC ATGTACTATT TAACAGCCGT CGACGAAAGC ATGTTGCAAG TTCTGTGGGG ACGTTTGGCT TGCGAAGTGT TGGTGGAAGA TTGGGACGGC GCCAATGTTG CGGTAGACGC GGTCAAAACA GCAATGGAAT CGCTCGTTAC GTCTCGCACT CTCAACGCCT TGCAGGCCCT CCAACAACGC ACCTGGCTAC TCCATTGGTC ACTCTTTGTC TACTGGAACG GGAATCGCTT GGAAAACTTG GTCGACTTGT GTTTCCAAGA AAAGTACAAG CAGGCTATTA CGACCAACGC TCCACATCTG TTGCGCTACT TGACCGCCGC CGTGCTCTTG TGCAAACGCC GAGTGGTGAA CGTCACCAAT AAGAAAGAAG CGCAGGCCGT ATCGCCGCGT CGGGTGCTGC GTAGTCTCAT CTATGTGATG CAGGACTGTG ACTACGTGGA TCCGATTGTG GATTTTGTGG ACTGCTTGTG CGTCAAATTC GATTTCGACA AGGCCCAAAC CAAACTGGCG GAATGCGAGC GCGTGCTGGG GACGGACTTT TTCTTGTGCC AGCAGACGGA CCTTTTTATG GAAGAAGCCC GGGTCTTTGT ATTCGAGAAC TATTGTCGTA TCCACAACAA AATTGATCTC CAGACGCTGG GTGATAAGTT GGCCATGGAT CAAATCCAGG CCGAACGCTG GATTGTTGAT TTGATCCGTA ATGCCGATTT AGACGCCAAG ATCGATTCTG AGGAAGGGTG TGTCGTGATG GGTGGCAATC CTCAGAGTAT TTACGAACAA GTGATGGACC GAACAAGGGA TTTGAATGTC CGATCTGCAA CCTTGGCGCA AAACTTGAAC AACTTGATGA ACCAAACAAG GAAGGAACGG ACCAAGAAGG AACGATCGAG CATGGAGGAA TAG
|
Protein sequence | MSTGESASRA TWDLTSQISP FLDRHMIFPL LEYLDTLIKD QDGKHLVDYN PQDVAAARLE ILRPTHMVDY AIDVHQTLHP GQPVPDEMEA QKKTVYEELE ALRAACGAFD KLCQDETERS KLMATGQWNF GSLNQSHQIT PEMVEAYRKL ARFQFECGDY QSARAMLTVD ESMLQVLWGR LACEVLVEDW DGANVAALQQ RTWLLHWSLF VYWNGNRLEN LVDLCFQEKY KQAITTNAPH LLRYLTAAVL LCKRRAVSPR RVLRSLIYVM QDCDYVDPIV DFVDCLCVKF DFDKAQTKLA ECERVLGTDF FLCQQTDLFM EEARVFVFEN YCRIHNKIDL QTLGDKLAMD QIQAERWIVD LIRNADLDAK IDSEEGCVVM GGNPQSIYEQ VMDRTRDLNV RSATLAQNLN NLMNQTRKER TKKERSSMEE
|
| |