Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43352 |
Symbol | |
ID | 7197102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 203132 |
End bp | 205436 |
Gene Length | 2305 bp |
Protein Length | 347 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177577 |
Protein GI | 219111651 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0470294 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCAACTAGT ACACTGGAAC GAACCGCTTA CCGTTTCGTA CATTCCGTTC TCAGCATTAT CATGGTCAAG CATACAGTAT TTGGTGTGGA ACGGGACTTG GATATTCCCC ACCAAGTCTT TCCGAACGAA CAAGAACTCA CCGACATGGA AGCGGAACTG CCGGGTTGTC AGCACGGATG GTTCGAGTCG GTGCACGAAA GCGCACAGCT TCACTACCGC AAATTCTTAC CCGAAAACAA CAAACCACCC AAAGCGGTGG TTGTGTTCAT GCACGGTATT GCCACGCACA GTGGCAAAGG ATTCACCCTG AACGGACGGA AACTATGCAT GGCCTTGCTG TCGGACGCTT GCAACGCCCA AGACTGGGCG GTGTACGCTT ACGACTTGTA CGGACACGGT TTCTCCGAAG GCAAACGCTT TTTTATTCCT AATTCGTGGG AAACCAACCG ACAAGATTTG GTCAATTTTT GTAATCTAGC GGCCGAAGAC TACCCCGACG TCCCGCTTTT CATCGTCGGA GAATCGTACG GATGTACCCT GACCATTCTC GCCGCCAAAC AATTTCAGGA ACACCCCGAG ACGGGTCCCA AGAATTTTAA TTCGATTGTT CTCACAGCAC CCGCCATTAT CGGTGATCTT CCGCCTTATC CCGTGTACTT TACGCTACGC TACATTATGG CCCCGCTCTT TCCTTCGTGG CGTCCCTTTT TCATGCCCAA CCCTGTTTCG GCCGATCGCA TTTGGAGAGA TCCCGAAGTT TTGGCCAAGT GCTCCACCCC CCGCCAACGG TCCATGCAAA TTGACGGTTC CGGGTTGCCC TTCCGCCTCG GTACCGCCGT GAATCTCGTA ACGGCTTTGG AAGCGGTCCG TACCAAGGCC ATTCCCGGAT TCCGACTCCC CTACTGCATT ATTCACGGGA CAGAAGATTA CGGCGTGCCC ATTGCTGGCT CCGAGTACAT GTGGAAAACC GCTGTAACGC CCGCGACGTG CCGCGTCTTT CGACGTCAGC CCGGCGCCTA CCACGATCTA TTGAGTGACC CCACCGCCGA ACATACCATG CAAACGATAC TGGATTTTAT CCGAGCCCAA CTGTCCAAAA CATAGTTTTG TATGCACCCC GTTTCACACG TATAGGTAAA GAGACTGCGC TTTGTTGCAT AGGGAATGAT GACCTTTTGT TGCTCTCTGG CTCGGCCCTA TGTATTGGCC AGTAGACCCA TCACCTCTGC TATGTGAGAC GAATCTCCGC ATCTATACTA CTGCTACCAT TGTACACGTT GCGCCTAAAT CTGTGTTGGG ATTAAAATTA ATTTAGTTTA AACGATGATA TGCCGCCTTA CAAGACAACG AGTCTAGACG TTGAGCACTT TGCTACACGG CAATTTAAGG TTGGCTGGTC GAACTGGAGG TGACCACAGA GAAAGTTCCT GCCGTGGTGG CGTAAGCAGA CGACCCTGCT CCGGTGGTGC TGACATCAGA CCCTCCATCG CCTATGGTGC TCACATTGCC TGCGGAACCA GCGTTGTTGT CACCGGTAAA TGCTCCCTCA ACCAGGCCGA AACCAATGGC TGTCCCATTA TTCATGCTGT TGGCCGTTGA GTCAAAACTA CCGTTGTCAA TAGCGAGATC GACCGACCCT TGGGAACCTT TATCGACGCC GCCAACAACA ATTTTTTCCG TAGTCACCGC GTTGTCGAAA TCAACACTTC CAGAAAAATT GCTGCTACCT TCATTTAACG CGCCGGCCAA AGAGTCGGCG AAGACCTCAC CATTAAGACT TCCACCGAGA ACCCCGACCT CGTCAATTCC ATCAAAGTCG AAATCAATGT AGCTGGACGA GTTGGCGGTG GCCTCAACCG TACCGTTTGC CATCAATGTT TCAATCGAAT CAGGAGGGGT GGGCAAAGAT ACTTCCAAAC CTGAAGCCAT ATCTGATACT GTTTTGGTAG AAGCGTCGGC TTTCGAGGCC TTCCCAGAAG CCCTGTTATT CATCATTCGA TTGCGATTAC GATTGCGAGT TCCTCGGGAC ATCAATTTGC GTAAGCTTGG TTCTTCCTGG TCAACAAGTG GAGCTCGCAA GTTCGCTGCA TGGCTAATGG TAGCAGCGCA AGCGATGAGG ACACAAGCGA GCATCACTTC CGCGGATCTG CATGGTGAAG TGCAGCCGAT TAATAAGCGG AAGGGAAAGA TGAGTGGGGG ATCAAATCAA TGTAGTGTTC TCCCGTGTAT CGTTGTACGC ACCTTGTTCG CACTTGTTTG ATCATGATGA AATGTTTGAC AGTGAGAGTT TGGGTTGTGC TGGTT
|
Protein sequence | MVKHTVFGVE RDLDIPHQVF PNEQELTDME AELPGCQHGW FESVHESAQL HYRKFLPENN KPPKAVVVFM HGIATHSGKG FTLNGRKLCM ALLSDACNAQ DWAVYAYDLY GHGFSEGKRF FIPNSWETNR QDLVNFCNLA AEDYPDVPLF IVGESYGCTL TILAAKQFQE HPETGPKNFN SIVLTAPAII GDLPPYPVYF TLRYIMAPLF PSWRPFFMPN PVSADRIWRD PEVLAKCSTP RQRSMQIDGS GLPFRLGTAV NLVTALEAVR TKAIPGFRLP YCIIHGTEDY GVPIAGSEYM WKTAVTPATC RVFRRQPGAY HDLLSDPTAE HTMQTILDFI RAQLSKT
|
| |