Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29793 |
Symbol | |
ID | 7195191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 51749 |
End bp | 53557 |
Gene Length | 1809 bp |
Protein Length | 424 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183416 |
Protein GI | 219126337 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.832112 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCCACAGCT GCCATAGCCC AAGGAACGAA CCGTCGCCCT GCTGTGACTT CCTCCCATTC ATACGGGAAG CCCTGCTGTA TTCGGGAAAA GAATCTTCTG CACGACACGA GACACCGCCG GCACGTGCGA TGCGGAACAC CGCAGTCGAA CCTTTCGTGT TGAATTTGTG ATGGGCACTT TATCTTGTCG AGAACGCATG ATAAGTACCG CTACACTCTG GGTCGCTTCC TTACTGGTCC TGTTCCCCAT GAGACGGCGA CTGGTGGGAG GGTTCATGGC CGAGTACAGG ACGCCCCGAT TCACAAAAGC ACGAACAAGG CTGCGATGGA ACCGAAACAC GTTGGGATCC GTAAATACTA GGATCCCCAC TGCCTCGAGC GGGACGGCAC GCCAAATCGT ACGCTTGTTC GCACAGTCTC AGCAGACAGA CAAAATACAC ATTCACGATG AAGACGAGCA ACAACAGAAA TGTTCCTTAC CGACCGTGGC TTCCATTTTA ACAACCTCCT ATCTTGATTT AGTTAGTGTA GAAGTACCCG AGCCCGAAGA GTCTGTCATA CATCTCTTGT CCACGGCCCT GGATCTGCCA TGGGAAACCG GGTATCGAGA CTTGCGCAAG ATTTTAATGC GGCCTCAGTC CCAAAGTATC CCGTCTTCGA ACAATTTGTT GGCCAATCAA GTACTAACCG CCGTGCAATT CAAAACCTAT CAGGCCTTGC TCGTCCGCCG GAAAACGATG GAACCCATTC AATATTTAAC TGGCCAGTGG GATTTTCTGG ATTACGTCTT GACTGTCCGA CATCCATTGC TCTGTCCCCG CCCCGAGACG GAAGAACTGG TGGAGCTTGT GCGGGAAGAT CTTGCTACGC TAGCTGCCAA AAACAACAGT GACCGTTGCC GTCTACGGAT ACTGGACGTG GGCTGTGGGA CTGGATGCAT TGGAGTTTCT TTGGCTGCTA AACTTCCGAA TAGCTTCGTC GAGGCAATCG ATGTCGAACA CGTGGCTGTC GCAACCGCTA CGGAAAACGC CGAGCGAGTA CTCGGCGCAC AATATCAAGC TCGCTTTAAC GCGCAACTTT GCGAGGCCGA AGTGTTCGAC GTGGCTACGG TCCAAGATCG TTTCGATGCG GTCGTTAGCA ACCCGCCCTA TATACCCCGC GCTGACATGG GAACGTTGGA AACTACGGTT GTTGACTTTG AAAGCGAGAC CGCCCTCTGC GGAGGAGAGG ATGGACTGGA TGTTGTACGT AGTATTGTGA AGAAGCTACC GTTCTGGTGT GTCGAAAATG CCGTTTGCTG GATGGAGGTT GATCCGACCC ATCCAGCTTT ACTTCGAAAA TGGCTGGAAA GCGATTGCTC ACTGGGCGTC GTCTTTGTAC ACACGTACCG GGACCTCTAC GGCAACGATC GCTTCGTGAA ACTAAGGGTA ACTCGAACGA AGTAAGGCTG TGGTACCTTG CTCCATTCAA CTATGACAGT GCAGTGCATG GAATGGCTTG CTTTAGAAAA TCTCTGGTGT GAGTATAGAG CAAACCATGT TCTATCGTCT GATCCGAATG GCCCCCTGTC AACTACTTTA CTGCTCGATG TGGGATCTAT CTTGAAGGCT GGTAGTTGGA ATACGTCTAT TGATTAAATA GCTGCTTACT GTTAAAAAGA ATAGAAGCGA AAATGGTTGT GGGATGATGA GGTTGCCATT TAAAGGCCAC TTCATGGCGG CAGAATGAGG CTCAATTTTT TGGTGCTTGA AGGATCTTGG TCAAGCTGCT GCTCATCCTC TGTTCGGAGT ATTTCCATAC TGGCCTCTC
|
Protein sequence | MGTLSCRERM ISTATLWVAS LLVLFPMRRR LVGGFMAEYR TPRFTKARTR LRWNRNTLGS VNTRIPTASS GTARQIVRLF AQSQQTDKIH IHDEDEQQQK CSLPTVASIL TTSYLDLVSV EVPEPEESVI HLLSTALDLP WETGYRDLRK ILMRPQSQSI PSSNNLLANQ VLTAVQFKTY QALLVRRKTM EPIQYLTGQW DFLDYVLTVR HPLLCPRPET EELVELVRED LATLAAKNNS DRCRLRILDV GCGTGCIGVS LAAKLPNSFV EAIDVEHVAV ATATENAERV LGAQYQARFN AQLCEAEVFD VATVQDRFDA VVSNPPYIPR ADMGTLETTV VDFESETALC GGEDGLDVVR SIVKKLPFWC VENAVCWMEV DPTHPALLRK WLESDCSLGV VFVHTYRDLY GNDRFVKLRV TRTK
|
| |