Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38755 |
Symbol | |
ID | 7203744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 128421 |
End bp | 129452 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182903 |
Protein GI | 219125261 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.172 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGCAC CCATGGCAGG GGTTTCGGGG GGTAAACTCG CCGCTGCGAC CTGTCAGGCG GGTGCGCTAG GCTTCATTGC GGCCGGTCAT TTAATGGAGC TGGAATCGTT GGAACAAGAA TTAACTGCTT TTCGCCAAGA AGCTCCCACC TCGCCCCTCT GCATAGGATT CATCGGCTAT TCAACCTTCG GTACCGACGA AGGCTGGGAA AGGTACGAGC GTGTTCTTCG AAAGCACAAA CCCGCTGTTG TCCAGTTCTT TGCACCGGCG ATCCATACTC AACAATCTAC GGGGCGATCA AATGTGGATG TGGCTCACCA TCATGCAGCA CTGGTGCTTG CGCAAGTTGG CAGTGTGCAG GACGGACTCG CCGCTGCAAA CGCTAGTGTA AATGGTCTTA TAGCACAGGG TAGTGAGGCT GGCGGACACG GACTTCGGCG TGAAATGGGA TCTGCCGGGT CTACTCTCGC ACGTGATCTC ATTCGAAAAG TGGCTCAAGA TATCCCGGTC CTTTTGGCTG GCGGTATAGT CGACGGGTAC GGTGTGGCTT CAGCACTGGC ATTGGGATGT GATGGTGTTG TCCTTGGAAC TCGACTCTGG GCGAGCGAGG AAGCGCTCGG TCACGAATCT CTCAAACGCG CTCTGGTTGA CGCGGAATCA ACGGATAGCG TGCAACGGAC GACTGTGTTT GATCAAATAC AGAACACTTC GTCTTCCATT CCGTGGCCCG AGCCATTCGA CTCGCTGGGT GCACTGCGAA ATGAGACGAC AGCAAAATGG GATGGACGTA TGAATGAGCT TTCCGAGGAA CTCTCGACTG GCAGCCAAAG CACACTTTGT ACGATATATC GTGAAGCTCA GCAGGAAGGC AATGGGCAAA TTGCGGCGGT CTTATGTGGT GAAGGAGTGG GGGCTATCGA TTCCATCAAA TCAGCTTACG ATATCGTGAA GAAAATCAAC GAGGAATCAG TTGGTATCGT ACGGAGAATG CCGAAAATGC TTTTGGACAA TTGTGGAGAG AATCGCACCT GA
|
Protein sequence | MGAPMAGVSG GKLAAATCQA GALGFIAAGH LMELESLEQE LTAFRQEAPT SPLCIGFIGY STFGTDEGWE RYERVLRKHK PAVVQFFAPA IHTQQSTGRS NVDVAHHHAA LVLAQVGSVQ DGLAAANASV NGLIAQGSEA GGHGLRREMG SAGSTLARDL IRKVAQDIPV LLAGGIVDGY GVASALALGC DGVVLGTRLW ASEEALGHES LKRALVDAES TDSVQRTTVF DQIQNTSSSI PWPEPFDSLG ALRNETTAKW DGRMNELSEE LSTGSQSTLC TIYREAQQEG NGQIAAVLCG EGVGAIDSIK SAYDIVKKIN EESVGIVRRM PKMLLDNCGE NRT
|
| |