Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49299 |
Symbol | |
ID | 7195473 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 506365 |
End bp | 507764 |
Gene Length | 1400 bp |
Protein Length | 381 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183887 |
Protein GI | 219127323 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATGCGGAG AGCAGCCCGC GACTACTGGT GGGAACCAAA CCGAGTTTTC CAAGTGAATA TGGGAAACCG CGAGAAAGAA GTGAGAAGCG ATCATCTCAC AAGTTGCCTC CCAATTGTGT GCGAAACGCA AACACATCAA AGAACGGCAC CCTCTTTTGG AACCAGCAAG TACTCCCACG TTCGGGTATC TCAACACAAC TTTGAATGGA AAAGATGCTC TTTACCCTCG CACTGCTCGC CTGGGCTACA ATAGCAGTCT CCGCATGCCA CAGCGACAAT AGAAATATTT GACTCTTCGG ATCAGCACCA CCGCAGTCTA CAGACCTTGA ACACTAGTTT GTACGCCCTG AATGATTTCC TCTTTCGGGA AAGCGAGTGC CAGTGGGAGT TTTATGTGTA TTCCGACTTG TGCCGAGGTT TTCCGTATCG ATTTGAGATA TTTACAGCTC CAACTAATAT CACCACAATA GCGGACCCGA CCCGGACCTG TTTCAGCGAA GGAGGAGAGT CCTTCGATTT TATGGCCAGC GGGAGCGGAA TCTCATCAAT CTCGTCTACC TCCGCTCCTC CTGTCAGAGG CTACTTTCTT GGATTCAATT TCGACGAAAA CGTCACCGAG GCCAGCCCCT TCTACTACAC AATCAATGGC ACAAACAACA AGGAAATCAA GTACATTTTT TGCGTGAAAC TCACTCTCAC CCAAAATATC TCTGGTACCG TGACAGATAT CAATTTTAAA CAGGTAGCTA TTCAAATCAA CGTGTCCCTC GACGGTACCC TCGGTGACTC AAGCGCGAAT GCGTTTGAAG TCGATGCTGG TGTGGTCAGC ACCGACACGA ACAACGACAT TGCATTCACG TCAAGTGCGG ATCTGTGCAG TACCTTCAAT GGCAATCCAA CGCAGGGACA AGCCATTCCT ATTTGCATTG TTTCGGACAA CCACCCGTTG GCTCGAATCG TGTCGGTAGA GGATCTTATG TTTTCTTCGG GCAGTTTCAC CCAGTTGATC CTCGCGGACG GCAGTGCCGC GACGGGAGCC GAAGGGCTGT ACGGAGTGCC AGATCCCTTG AACGCTGAGC ATTGCGCCGT CAACAAGTGC ATTCAGTACA ATGTGATGCT GTACGCCGTA TTTGCAACGA CGGGCGCAAA CTTGGATATC ACGATTCGAG GCAACGTGGT GCTAGCGATT GGCGACGGTA CGCGTATGCT GCGCGCCCAA TTCGAGCCCA CCCGCGCATT GCAGGACGTG TTCCGTGAAC GATCCTTTCG CTCGAAGATT GTGTTGCCGG CTCTGTCGAC GACCGAGTCG GGTGCGGTCT CGACGCTCGG CGCGAAAAGT GTTGCGGCGA TGTTCTCTTT CGCTGCGTTT GTGGCGACCT GTTGGTTGTT TGATCTGTAA
|
Protein sequence | MEKMLFTLAL LAWATIAHHR SLQTLNTSLY ALNDFLFRES ECQWEFYVYS DLCRGFPYRF EIFTAPTNIT TIADPTRTCF SEGGESFDFM ASGSGISSIS STSAPPVRGY FLGFNFDENV TEASPFYYTI NGTNNKEIKY IFCVKLTLTQ NISGTVTDIN FKQVAIQINV SLDGTLGDSS ANAFEVDAGV VSTDTNNDIA FTSSADLCST FNGNPTQGQA IPICIVSDNH PLARIVSVED LMFSSGSFTQ LILADGSAAT GAEGLYGVPD PLNAEHCAVN KCIQYNVMLY AVFATTGANL DITIRGNVVL AIGDGTRMLR AQFEPTRALQ DVFRERSFRS KIVLPALSTT ESGAVSTLGA KSVAAMFSFA AFVATCWLFD L
|
| |