Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20066 |
Symbol | |
ID | 7200391 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 738996 |
End bp | 740518 |
Gene Length | 1523 bp |
Protein Length | 347 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179705 |
Protein GI | 219117835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTATGCGAC ACCCGTTTCG CGGCTTGCGG GAAAGGTTTG AAACTTGGGC CCTCGACCAT CGGAAAAGCT ACCACACGGA GATCGAGAAG CAGAAGCGGT TTGAAATTTG GGCCGAAAAT CATCGCCGCA CATTAGAAAA AAACGAACGC CATGGCCCGT GCCGTTTAAC TGAACAACCA GTCTTTGGTT CCAATCGCTT TCAAGACTTG ACGGACGAGG AGTTTCAATC CAGCTACCTC ACCGGATACT CCAGCTCGAA TGCTAGGCGT CTTTCGTTTT CCAAAGATAG TGGAGTGCTT GACCCATCAA AAAATATGAA ACGCCATCCC GAAGTACACC GACGTGTTGC TATGCAGCGC GAAGAAACAG GTAATATGGC ACGGGGATCC AAGTGGCAGA ATTGCGAATG GTGGAACGTG TCATGTGGTC TACGCTTTAT CTTTGAGAAC TACTTTTACG GATTCGGTCA CACAATGGAA CCTAAATACG ACGAAAGCTC GTACCCGAAA GGTAATTCAA ATGAGATCAC CCTGTGCTTT TCCAATTCTC GTTATCTAAT ATACCACGTC TCCTTCTGAA GCTGTGGATT GGCGCGGTAT TGGTGCGGTC ACTAGTGTGC ATTCGCAAGG AGACTGTGGT GCCTGCTGGG CCATTACGGC TGTAGAAACA GTAGAGTCAG CTGTCTTTTT GGCTACTGGA ACTCTATATG ATTTATCGGA AGCAGAAGTC ACCCTTTGTC AAGAGAATTG TGACATGTGC TACGGTGGAT GGCCGCAAGA CGCATTTGAC TATATCATGG ATCATGATGG CCTGCCGTTG GAAAGTGATT TGTCGTACAA CGGAAGCCTC CTGCTTAAAC TATCACAAGC CAAGGTTGGT GAGAGCGACG AAATGAAGTA AGTTGGTAAA GACGAGGGCA TAAACAGCGA CTGGTATTCG TTCTTACCTA CTGCTTATTT CTTTGTGCAG CGAAAGTTCA GTTGAATCAT ACATGAACGA TTGGTGCCCA GCTAACGGGT CCAATACATT GAAGCGGTAC GGACAGATTG AAGGCTATGG TTATGCCACA TCTCGATGTG TTTGTTACAC GGATGGATCT GGTTGCGACT GCGATTCACA GGATGAAGGT GTGGCGGTCA GAAATCTCGC TACCTATGGT CCAGCAGTCG TCTGCGTAGA TGCTTCGACT TGGAAAGATT ACAGCGGTGG CATCATAACT TCCGAATCTG GTTGCAGTCA AAAGTTTCTA GACGTAAATC ATTGCGTTCA AGCTGTCGGC TATGCCTATA CCTCTTCCGG AGGGGGCAGC GAAAGCGAGA ATGGAAGCCA CGATAGTGGT AGTCAAGACG ACTCCGGATC CCGTCAAGGC TACTGGATTG TTCGGAATCA ATGGAGCTCG TACTGGGGTA TGTCGGGCTA TGCATGGGTT GCCATGGGAG AAAACACATG TGGGATCCTA AACGATTTCG TTCAAGCTTA CGCATAAAAG CCTAACTTTG ATTAACATTC GAAAGCAGAT TGC
|
Protein sequence | MRHPFRGLRE RFETWALDHR KSYHTEIEKQ KRFEIWAENH RRTLEKNERH GPCRLTEQPV FGSNRFQDLT DEEFQSSYLT GYSSSNARRL SFSKDSGVLD PSKNMKRHPE VHRPVDWRGI GAVTSVHSQG DCGACWAITA VETVESAVFL ATGTLYDLSE AEVTLCQENC DMCYGGWPQD AFDYIMDHDG LPLESDLSYN GSLLLKLSQA KRYGQIEGYG YATSRCDEGV AVRNLATYGP AVVCVDASTW KDYSGGIITS ESGCSQKFLD VNHCVQAVGY AYTSSGGGSE SENGSHDSGS QDDSGSRQGY WIVRNQWSSY WGMSGYAWVA MGENTCGILN DFVQAYA
|
| |