Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45899 |
Symbol | |
ID | 7200991 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 611446 |
End bp | 612636 |
Gene Length | 1191 bp |
Protein Length | 331 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180081 |
Protein GI | 219118625 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACGG CAGCAACATC CGTCACCAAA TCCACCATTC CCAAGTTTCG CTACTTGACC GCATGGTTTT GTCCCTAGTA AGTACGGTGG ATCAAGGAAT AAGTTCGATT GGGAACGCCC GGCGAACACA AAAATGGTAG CCTTGTCCCT GCAATGGCCA ACAAACGTGC TTTCTCACAC AAAATGTGGG GCGCTGTTTG TTTTTTGGTG GTTCCTGCAA TCGATAGTGC GCATCGTGCG ACGTTGGCTT TGGAACACCA TCAGGGTCGT GTGGACTACG AATCGGTCGA AGCCTTGGGT TGGTATCAAG GTGACGACGA AAAGAACGTC ACTGGAACGG GTAAGGAATG GTACTACCAC TGGAAGGCGG ACGAACTGCT ACGGGTGAAC CCTTCGGCCT TGGTTCCCAC CCTCATTCCC ATCGACGCGA CTACCGATAC GCCAATCGAA GAAAGGGCGG TATTCGAATC ACTCGTGGCG ATCGATTACA TTGATGCCGT AAGTGGAGCG ACCGGAACAG ATCGATTGGT ATCCGTAGAC CCGTACCAGG CGGCGCGGTC GCGTATCTGG ACCGATCGAG TCAATCGTGA CTGCTGTTCG CCGTACTAGT AAGTCCGTAC CGACGCTTCG GAAGAACATC AGCGTTGTCC TTCTGGTTGC TCAACTTAGT TTTCGCGTGT TTTCGTTCAT TTTCGTTCAG TGGCGTGCTG GTACGCAAAG AAGACGACGA GCGACGAGAA CATTTTGCAA CCCTGGTCAA AGGACTGACG TCGTTTTCCC GGGAACTCGA AAAGACGGAC GGCCCCGTCT TTTTACCTGA CGGGCAGCTT TCCAACGTAG ATTTGGCACT CATTCCCTGG GCGTTTCGGT ATTACGTGTT GCAGCATTAC CGTGGTCCCG ATTTTGCCAT TCCACAGACC CCCGCACTGC AGCCATATCA CGCTTGGTTC GATCACGTTA TGAATCTGGA GCGTGTCCAG CGGACCCTGC CCGACAAGGA CCGGTACCTC GCGCATATTT TCAAATACGC CGATGGCAGT GCACGATCGC GGGTGGCCAA CGCGGTCCGT CGTGGAGTGG CAGCACACGA ATTAGACGAT GACAAGGACG AATCACACCC GCACACGAAC GAAACGAATA CCAAACCATA ATTAATCTCC ATAACTTGGG CTCTTGCCTA TGATTTTTAT T
|
Protein sequence | MATAATSVTK STIPKFRYLT AWFCPYLVPA MANKRAFSHK MWGAVCFLVV PAIDSAHRAT LALEHHQGRV DYESVEALGW YQGDDEKNVT GTGKEWYYHW KADELLRVNP SALVPTLIPI DATTDTPIEE RAVFESLVAI DYIDAVSGAT GTDRLVSVDP YQAARSRIWT DRVNRDCCSP YYGVLVRKED DERREHFATL VKGLTSFSRE LEKTDGPVFL PDGQLSNVDL ALIPWAFRYY VLQHYRGPDF AIPQTPALQP YHAWFDHVMN LERVQRTLPD KDRYLAHIFK YADGSARSRV ANAVRRGVAA HELDDDKDES HPHTNETNTK P
|
| |