Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44665 |
Symbol | |
ID | 7197651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 1203508 |
End bp | 1205076 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178384 |
Protein GI | 219115177 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGTGC AGATTTTCAA CGAAACAAAA GCGTCGACGC GGCTCCATTT TGTCGCCACG TTTGGTCTAG CCAAAGCCCT TGCCAACGCC ATGAGTGGTA GTCTGGCTGA TACGCTTGGC CGGAAACCAG TACTGATTTT GGGCTGCTTG GTAGGCTTGC CCGTAATGCC CTACGTTATT GTGGCGAATT CTTGGAGCGG CGTTACACTC ATGAATGTTC TCTTTGGACT GTCACAAGGG TTGCTAGGCT CTTCGCTTTT CTTTTTATTA ATTGATCTAA TGGGATCACG TCGTAGGGGA ATCGCTGTGG GTATGGGAGA ATCAACAATT TACGTTTCGA CTGCCATTGT AAACGTTCTT GCAGGGCGAC TTGCTTCAGT GTACGGGTTT CGACCTGTTC CTTTTTATGT GGCAACCTCA ATAAGCGTGT TGGGACTTTT GTCGACCATA CCCCTGCAAG ATACTTTGGA TCAAGTGAGG GCCGAACAAA CAGAATCCGA ACGGATCAAT CGGAAGAAGT ATGCTCGGTT ACTGAGTACC CAGCCCAGTA AGCTGGACGA AGAGCGAATA GACACTCCAA ATTGCACTGA TCGTAGCCGT CTCGATACCA CCAATTGCGG CATCGGCGAG ACTTATGGTA GTGTAGATGA AGGTTACGTT TCCCCGTGGC GGATGTCGAC GCACACGCTG GATACGACTG ATAAGGAAGC GGTGAAGGAA AATGACGGTG ACATGGAGAC TCAACCGATG ACCGCATTTT CATTTGCTAC TTCTTGCTAT GTGTCAACCA GCCGCAGGGA TCAGCGGACC AGTTCTGGCC CTCTACGTTC TGTTAAGATG TTGAAGTCGC CAGAGACGAG CTTAAGCTCA CTTCTTTTAA AGAATCGGAG CTACGTCGCT CTTTGCTTCG GAGGCATGTC GTTGAACTTC AAAGACGGTT TTTGTTGGGG TTCATTCCCG GTCTTCTTCA AACATGAACA TGGCTTGAGT GACGTGCGAA CTGATTGGTT GATTGCTATC TATCCTTTAT GTTGGGGCAG TGCCCAAGCT TTTACTGGGG CTTTATCTGA TCGTTTTGGT CGCAAATCAT TCCTGGTAGC AGGGGTAGGA TGCTGTGCGG TTTCAATGGT CATCTATGTA CTTCCTTCGT ATTGCTGGGG TGTAGCGGCA GGTTCCAAGC ACTTTCATGT TTGGGTGGCC GCGGATGTGT TGTTAGGATT TGGGACCGCA TTGGCATATC CGGCACTACA AGCTGGTGCG GCCGATGAGG TAGATCCTGC CTACCGCGGA CTCGCACTCG GATTCTACCG CTTCAGTAGA GATATGGGCT ACGTACTTGG AGCGATCGTT TGTGGCCCTC TTACGGATGC GATTGGCTAC GAGGACACAT TTCTGGTGAA TGGAACCGTA CTTTGCCTTG CTTTTATATT ACTGCTTGTG TTCTATTCAG ACGATCAGTC TGAACAAACC TTTGAATTCA GTACTGCCAC AGATGCAACT TTTAAGCCTT CCCCTTTTAC CGCAGGGAAA TCACGAATAC TAACCGCCGG GATTTCAGAT TCATGGTGA
|
Protein sequence | MAVQIFNETK ASTRLHFVAT FGLAKALANA MSGSLADTLG RKPVLILGCL VGLPVMPYVI VANSWSGVTL MNVLFGLSQG LLGSSLFFLL IDLMGSRRRG IAVGMGESTI YVSTAIVNVL AGRLASVYGF RPVPFYVATS ISVLGLLSTI PLQDTLDQVR AEQTESERIN RKKYARLLST QPSKLDEERI DTPNCTDRSR LDTTNCGIGE TYGSVDEGYV SPWRMSTHTL DTTDKEAVKE NDGDMETQPM TAFSFATSCY VSTSRRDQRT SSGPLRSVKM LKSPETSLSS LLLKNRSYVA LCFGGMSLNF KDGFCWGSFP VFFKHEHGLS DVRTDWLIAI YPLCWGSAQA FTGALSDRFG RKSFLVAGVG CCAVSMVIYV LPSYCWGVAA GSKHFHVWVA ADVLLGFGTA LAYPALQAGA ADEVDPAYRG LALGFYRFSR DMGYVLGAIV CGPLTDAIGY EDTFLVNGTV LCLAFILLLV FYSDDQSEQT FEFSTATDAT FKPSPFTAGK SRILTAGISD SW
|
| |