Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38694 |
Symbol | |
ID | 7203398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 707819 |
End bp | 708865 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182588 |
Protein GI | 219124601 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGCCT TCGCGGGAGG GTTCGGAAAC GACACGTCAA AACGCGCAGC CAAGCAAAAC AAAAATTCCA GACGTCCGAA ACGAGGATTG AGTGACTTGG CTCCTCCAGT TGCAAGTACC GAATCCATGT CAGTCCAGCT CGATAAATGG GGGCTTCCTC CGCCAACCGC GGAAGACATT TTCCCTGCTT TGCCTCCCGG AACTGAGCTG ATACCTGCTA GGAAATCGAG CACAGAACAC ACTTTGACCG AAATCCGGAA GGCTCTGAAA GATCACTTCC CTCTGACGCT CGATCGTTTT GATGACAACG GGGTAGAAAC GAGCACACTC GACGGCCGCG AACCCATGCG ATTGCGAATG TTGCATCAGT CACCTCCAGT GATTGCTATT GATCACTTCT TAACCCCCGA TGAATGCCTA AACGTTGAGC ACGCTGCGAT GCCGCCTTCT TCAACGCCAA AACACACAGA AGGGGAATAC CACAACGAGG CCGTTCGGGT AAATTCGGCC ACTTTTTCGC CTTTGGCTCA GTCTAAACGC ACTTCTACGT CCTGGTTTTG CTATTATTCT CAAGTACCAA CGTTGCTGGC GAAAGCACAT CATGTGTTGG GCATCTTGTT CCCACAAATG GAAGAACCGC AAATTGTTCG GTACAAAACG GGCGAGGAGT TCTCGTGGCA CTACGACGAA GTTCCAACTC AACAGCTTGG GAACGGCGGA CAGCGACTGG CTACGCTTCT GGTATATCTT AACACAGTTG AAAGAGGCGG TGGTACAGTT TTTAGAGACT TACGGACCCC GGACGGATCT CTTTTGACGA TGCAACCAGT ACAAGGGTCG GCTCTTATGT TTTTTCCCGC CTATGCCGAC GGCCGACCCG ACGATCGAAC CTTACATAAG GGCGAAATGG CCGCGGATGA GAAAAGAATA GTGCAAATGT GGATCCATGA GCGACCTTAC ACGGCTGCCG TACCGATGGG GAACTCTCAG GAAGCTGCTA TAGAAGCAAT TGTACGAGCC AGTCGAGATC TGTGCTACTC TCCTTAA
|
Protein sequence | MTAFAGGFGN DTSKRAAKQN KNSRRPKRGL SDLAPPVAST ESMSVQLDKW GLPPPTAEDI FPALPPGTEL IPARKSSTEH TLTEIRKALK DHFPLTLDRF DDNGVETSTL DGREPMRLRM LHQSPPVIAI DHFLTPDECL NVEHAAMPPS STPKHTEGEY HNEAVRVNSA TFSPLAQSKR TSTSWFCYYS QVPTLLAKAH HVLGILFPQM EEPQIVRYKT GEEFSWHYDE VPTQQLGNGG QRLATLLVYL NTVERGGGTV FRDLRTPDGS LLTMQPVQGS ALMFFPAYAD GRPDDRTLHK GEMAADEKRI VQMWIHERPY TAAVPMGNSQ EAAIEAIVRA SRDLCYSP
|
| |