Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33827 |
Symbol | |
ID | 7198060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 445000 |
End bp | 446238 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178512 |
Protein GI | 219115433 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.373932 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAC GAACGGAAGA GTCTCCGCGA GTCCACGCAC CGTACCCTTC TTGGCATAAC CCAGTCGCCG GATCCGTGGC TGGTGCAGGT TCCCGGATGG CGACGGCACC ACTCGATCTC ATTCGAATCC GTCGGCAGCT CAATGTTGTA TCGTACCCAC GCGAAAGCCT CTGGGGATCC TGGAAGTCGA TCGTCAAGAA TGAAGGGGGA GTATCCGCGC TTTTTCGAGG AAACGTCGCG GCCATATTTC TGTGGATCAG TTACTCGGCA GTGCAGTTTT CCTTGTACAC TCAAACACGA GACTGGCTGA TTCAACACGC GCCAGCAGCA GATCCGGAGG ATTCCGATTC CGAGCCTGCA AAGTACTACA GGTCTGGTTC CGCGTTTGTT GCTGGTGCAA CAGCTGGAGT GTGCGCTACT ATAGCAACCT ACCCCTTCGA CGTATGCCGG ACAACATTTG CAGCCCGTGG AATCCAGACC ACGGGTAGTG CATTGACGCC GTCCAAACCG CCACCCATTA CCCCCAAGGC CACCATTCAC CACATGCCGT TTTCGTCGCT CGTGGAACCC ATGATTCATG ATCGCGGACG GTTTGCAAGC TCACCACCAA AACCCATCTC GACGCCTCCT CTGCAGCCAC ACCCTTCCTT TACTGTAACA CCACCGACGC GATTGTACGA TTTTGTGTGG TACCTTTATC GCCAAAAAGG CATTGCTGGG TTTTATGCTG GTGCTGGACC AGCCGTGCTT CAGATCATTC CCTACATGGG TATCAGCTTT TGGCTGTACG ATCAATTGAC CGCCGGGGAT CGACGAGTTG CGCTCTCAGC GTACGCGGGA TCCATTTCGG GAGCTGTCAG TAAAATACTT GTGTACCCCA TGGACACGGT CAAACGACGG CTCCAAGCCC AGGCCTTTTA CGATAATTCG AGTGCGACTG AAAGCAGGAC AGGAGGGAGC GAGCGCCGGA GGTTGTACTC GGGTCTTCGC GATTGCTTTA CCCGAGTTAT CAAGGAAGAA GGCTGGGCTA GTCTGTATCG AGGGGTTGTG CCGTCTGTTC TCAAGACTAC CATTTCTACA GGACTATCGT TTGCGCTTTT CCGATCCACC AAAAATATCT TGGAAGGATT GCATGAGGAC TGCCCCTCAA TTCAAACTTC AACATGGCGG GAAACGCCAC CCGATGCCAA AAGTTTGGCA TCTACAGATG ATCGAATATC CGACGAGCGC AAGCGGTAG
|
Protein sequence | MEERTEESPR VHAPYPSWHN PVAGSVAGAG SRMATAPLDL IRIRRQLNVV SYPRESLWGS WKSIVKNEGG VSALFRGNVA AIFLWISYSA VQFSLYTQTR DWLIQHAPAA DPEDSDSEPA KYYRSGSAFV AGATAGVCAT IATYPFDVCR TTFAARGIQT TGSALTPSKP PPITPKATIH HMPFSSLVEP MIHDRGRFAS SPPKPISTPP LQPHPSFTVT PPTRLYDFVW YLYRQKGIAG FYAGAGPAVL QIIPYMGISF WLYDQLTAGD RRVALSAYAG SISGAVSKIL VYPMDTVKRR LQAQAFYDNS SATESRTGGS ERRRLYSGLR DCFTRVIKEE GWASLYRGVV PSVLKTTIST GLSFALFRST KNILEGLHED CPSIQTSTWR ETPPDAKSLA STDDRISDER KR
|
| |