Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37440 |
Symbol | |
ID | 7202359 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 154002 |
End bp | 155126 |
Gene Length | 1125 bp |
Protein Length | 271 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181669 |
Protein GI | 219122681 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTAATA AAGCCAAGGA ATGGTATCCA AAGATTGGAG GAACCTACAG GCTGATGGAT GACGTACGTA CCGTCAGTGA AAACTTATAT TTACGACCAG TGTCACAAAT GCTAGAGAGC TCTAGCTAGA GTAGACACAC AATTCTTATG TGACTTTATT AGTACCCTCG TGTCAAAACT CATCCGCGTT TTTGTGGACC AAATACTTTT GTTGCTGCTG CCTATATTAC TTTTCCTCAA CAGCGAGAGG TCTTGCTATA GCCATGAGGC CCGTTTTGAA GAAGCGAAAA TTCAAGGAGG TCATCGATTT GTGTTCCGAG GACGAGAGCG TTGAGACCGG AAAGCGCTCG GCATGTTCGA CCGTAAAGTC GGGGGTGCGC CCAACTGTTC CAATCGACGA CGGTGACTTG ATCGAGATGT CTTACAAAAA GGATAGAATA AATCACAGAC CGCTTGTCTA TATCGATACA GAAGGGGGGG ATGACAGTAT CGTGTCAGAA GACTGCTACT ATCCGTTGAT GGAATGGAAT AAGGGGGCGG CACGGACCTG CGCCGGCGGC GATGAAACCA TCAAACGACC CAGCTTTTAT CTCCTTCATA TCCAACAACG CGACAAATGG AGCTGTGGCT TTCGCAATTT ACAAATGTTG CTAGTTACTT TGGTTCCCTC TCTACCACCC GAGCATATTT ATTTTGATAA TGGCAGCTTG AATGGACAAG CTTTTCGGGT GCCTTCTTTG CAACAGCTCC AAGCTTCTTT GGAACGGTCG TGGCGTGCCG GATATGATCC CGATGGTGCA CAGCACTACC AGAATCGTAT TTTGGGTAAA AACTCGAAAA TTGGAGCAGT CGAAGTTTCG ACAACCTTGT AAGTTTGCTG ATCGAGTGGC CGAGGCTACG ACTACAATTG ATGCTGACCA AGAGATTTTC ACCAATTTAT AAAATCAACA GATCATTCAT GAGTATTGAC TCGGTCGTGG TACAATTTAT AACTTGCCCA GAGTCTCGGG GCAAGTTGGG GCCTTTTTGC TCTACGTACT TTGGAAAGGA AGCGGACTGC TGTCCCTTTT GCCGGACCGT CGCGATCCCA TCGTGTTTAA CTATTGCCAA GCAGGTTGTC AATAG
|
Protein sequence | MVNKAKEWYP KIGGTYRLMD DVPRGLAIAM RPVLKKRKFK EVIDLCSEDE SVETGKRSAC STVKSGVRPT VPIDDGDLIE MSYKKDRINH RPLVYIDTEG GDDSIVSEDC YYPLMEWNKG AARTCAGGDE TIKRPSFYLL HIQQRDKWSC GFRNLQMLLV TLVPSLPPEH IYFDNGSLNG QAFRVPSLQQ LQASLERSWR AGYDPDGAQH YQNRILGKNS KIGAVEVSTT LVSGQVGAFL LYVLWKGSGL LSLLPDRRDP IVFNYCQAGC Q
|
| |