Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37272 |
Symbol | |
ID | 7202043 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 583442 |
End bp | 584723 |
Gene Length | 1282 bp |
Protein Length | 351 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181403 |
Protein GI | 219122125 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAATC GAGTCGATAA TGGCACTACC ACTAAACCAC CCCCAAAGGC ACCCCGCTCC GCCTTTATGT GTTTCACCGA CAACAAAAAA GAAGCCCTTA TGGAACAGCA TCAAGTGAAG GAAAATGCAG ATGGTAGGTA ACAAACAGGA AATTTGCGTC TATACCAGAA TCAAGAAGCT TCGTTGGCCG GAAAATCCCT TCTTTTCTAA CAAAGAACTC ATATGTAGTT TTGAAGCTCG TGGCGACTGC TTGGAAGAAA CTCAGCGGCC GCGAACGAGC GTATTGGGAC GAAGAAGCTA GAAGCGACAA GCTGAGGTAA GGAATTCGCG ACCTGACTGT AGCAGATGTC GAAATCGCTT TCTGATTTTG TTGAGATTGT TCCTTGTATT TGCGTTTACA GGTTTGTCCG GGAAAAGGCG GAATACAAGG GTGTTTGGAC TATTCCCAAA CGTCGGGCCA AAAAGCATCC CCTGGCGCCC AAGAGGCCCA TGTCAGCCTT CCTCAAATAC TCGCAAACCC GTCGGGCTAA GGTCAAGGAA GAGAATCCCG ATATGAGGCA AGTCACCGCC GGAGAGGGCA GGATCCGTCT ATCAAGATTG CTGGAATTAA CCTCACAACC TTCCTCTTTA CAGCAACACG GACGTGTCCC GACTCCTGGG AGAAATGTGG CGTAATGCAA GTAAAACAGA ACGAGCCCCG TACGTAGAAG TTGAAGAAGA GGAACGGGCA CAGTACAAAG AAGAGGTAAA GCGGTGGCGC CAGAGTCAGG CACGGATGGA TGCCGATACA AGAACCAGTC ACGATGCAGT CTTGACCTGC AGCAACATCG GTGACTTTCC TGCTCCGATG ACTCCAGTGC CGTCCTATTT TGAAGATCCT CAAGCTTATC ATAATTTTGA ACCGCTTCGA ATTCAATCGG TCGATGATGC TATAAACAAG GCGGATCAGC GGATGTCCAG CAGCCGTCAT CATTCGCCTA CGTTAGCTGT CACTCAGTCT AGTTCTACGG GAGGAGACAG ACCCTTGTCG AGGAATGAAA CGTGGCGAGA TTCTTCGGAG CAGTCTCCGA TCCACCGACA AGACCAGCAC ATTTATGGGC AGTCGTTTCG TCCAGCTCTC CCTGTGCAGA AATCGGGGGC ACGCACTCCG TTCCGCCCAA GCAATAGAGA AGAAACATTG ATGACGAAGC GCGACTTCAA GATACCGAGT CAAGGGGGAT TTCGGGCGTT TGGGAACAAT TATCAACAAC CGTTCCGCCC CTTGTATGAT CATGGTGAGT AG
|
Protein sequence | MENRVDNGTT TKPPPKAPRS AFMCFTDNKK EALMEQHQVK ENADVLKLVA TAWKKLSGRE RAYWDEEARS DKLRFVREKA EYKGVWTIPK RRAKKHPLAP KRPMSAFLKY SQTRRAKVKE ENPDMRQVTA GEGRIRLSRL LELTSQPSSL QQHGQRAPYV EVEEEERAQY KEEVKRWRQS QARMDADTRT SHDAVLTCSN IGDFPAPMTP VPSYFEDPQA YHNFEPLRIQ SVDDAINKAD QRMSSSRHHS PTLAVTQSSS TGGDRPLSRN ETWRDSSEQS PIHRQDQHIY GQSFRPALPV QKSGARTPFR PSNREETLMT KRDFKIPSQG GFRAFGNNYQ QPFRPLYDHG E
|
| |