Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_33462 |
Symbol | |
ID | 7204037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 989114 |
End bp | 991168 |
Gene Length | 2055 bp |
Protein Length | 684 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186442 |
Protein GI | 219113717 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGCTC GATATCGACA AGATGCTCGG TCCAACACAA TTCTCTTTGA ATCGACAAAA TGCACGGAGG TAAATTCCTC GCAGCTTCTG CCGCCCTCAA AGTTGACTTT AAATCAAGCT GTTGAAGAAG CCGAAGTTTT GCGCCAGCGT GCGAGAGGTC TGCTGGAGGA GGTCGAGACC ATGGAAGCCG CGTTGAAAGA GTCTCGATCG CGGAAAGCCA ATAAAAAAAT ATCAGAAAAG ATGCATATGG TTGATAAGCT CTTCTTGAAT CGGCCGCTGA CACCGGAAGC GGTGGCGCGC GTTATGCGAG AAGAACGCTG GTCAGCCGAT CAGGCCTGTC TCGTTTTGGA TGCCTTGTAC GATCGGTTGA CGCAAGGATC TAGCACTTTA TCACCTGCCG AACTAAAAAT CGGCGATCAT CAAACAAGGG AAGGAGCCAA CCACACCGAG GTGTTGATTT TGGAATCTTA CATTGATTGC TTGATTCGTG GCGCGGAGAA TTTAGATCAA GCATGGAGCA GTCCCTCCGA AAGCCGCCAT CCTCGATGGT CTGGTCGGGT AGCGTCCACG CTAAAATCAC AACTCAACGA ACGTTATCGC AGAGATGCAG AAAATTTTCA ATTGCGAGTA GCAGCTGGGG CTAACGACGC TGTTCTAAAC GCAAACGAAA CCGTACAGGA GTATATGCGG CGAACACTTG GCTTACCGTC TTTAACGGAG GAAATGTTGG GAAAGTCTAG AAACCTCACA CATGTCATGA ATCGTATTTC GCTGATACCA GTGTGGATAC CATCGTCTTT GCTACAATTC GTGACTCGTA CCAGAGCCAG AATCAGCCCC GCGGACGTTA AAGCTATCAA GAATGAAGTG TTAGTGGAGA GCCGCTTCTT TTGTACGTCG TTTGATTCCA TACCAAGCGC TGCGATCTTT CGAGGTAATT TAAGACCGCC TGTAGGGCAT ACTGAAACAC GGAATTTGCC TGCCGAGTGC TTCAGAGCGA TTCAGCAAAA AATGGATGAA CGAGGTCTGT CGGAAAGAGT TCAGCTCTTC CTCATGCCAG ATCCAGAGTG GCGACCAACC AGAGACGTGA GAGATTCGAA ACCTAAACCG GTAATTTTAG CTATCCCACA AGACATTGGC CCATCGAGAC CGGAAAGTGT GGACTGGAGA CGATTCGCAC TCAAATGCTT TGCTGTTGGA CTGAGCGCAT TAACGACTTA TACATACTCT GTCAGCTGTT TTGCACTGAA TCCATTTTTC TTCGAGTCAA TTGTCAGTCG AAATGACGTG TCAGTGCTAA GAGTCTGCCT CCCAGTGTTT GTCGGGGTTG TGGCTGTACA ACTTGTTCAC GAATTAGCGC ATTACTTTGT TGCGAAACAG CGTGATATCA AGATAGGATT ACCCACTACC GTTCCTTCGA CGCAACTGGG TACGTTCGGC TGCGTAACCC CTTTAAAATC CTTTCCTACA ACCCGCGAAG CCCTTCTCGA TTTTTCCCTG AGCGGACCTG TCGCTGCAAT TTTGATGTCA ATCATCATGA TGAGCCTTGG TATCTCCGCA ACCCTCAATG CGTCCGCAGC CACTATCTCA ACCTTTCCAA CGGTTCCTCT CACTATGTTA AAATCCAGTC TTCTAACCGG AATACTATTG AGCGTGCTAG CACCAAAAGT TATGATGATG CCCCTGCCTC AACCTATTCC GCTACATCCC ATTTTCTTCG CTGGGTTTGT CGGACTCATT TCTTCTGCCC TGAACTTGTT ACCAATAGTT CGCATTGATG GCGGACGCGC CTGTACAGCG GCACTGGGGG GCCGTGTGGG CGCCTTCGCC TCCATTGGAA CCGCCATGTT TTTGTTGTCG TTCCTGGCTT CTGGAAGTTC TGGTTTGGGC CTAGCATTCG GATTGTTTGT GGGAATCTTT CAGCGCCGAC CTGAAGTCCC CGTGCGAGAT GAAGTAACCG AAGTCGGTAG GTTCCGACTC GGGGCATGGG TAGTTTCCGT GGGAATTGCC GCTTTTTCCT TAATGCCATT TCCAGGATGC TCCGGAATTC TTTAG
|
Protein sequence | MHARYRQDAR SNTILFESTK CTEVNSSQLL PPSKLTLNQA VEEAEVLRQR ARGLLEEVET MEAALKESRS RKANKKISEK MHMVDKLFLN RPLTPEAVAR VMREERWSAD QACLVLDALY DRLTQGSSTL SPAELKIGDH QTREGANHTE VLILESYIDC LIRGAENLDQ AWSSPSESRH PRWSGRVAST LKSQLNERYR RDAENFQLRV AAGANDAVLN ANETVQEYMR RTLGLPSLTE EMLGKSRNLT HVMNRISLIP VWIPSSLLQF VTRTRARISP ADVKAIKNEV LVESRFFCTS FDSIPSAAIF RGNLRPPVGH TETRNLPAEC FRAIQQKMDE RGLSERVQLF LMPDPEWRPT RDVRDSKPKP VILAIPQDIG PSRPESVDWR RFALKCFAVG LSALTTYTYS VSCFALNPFF FESIVSRNDV SVLRVCLPVF VGVVAVQLVH ELAHYFVAKQ RDIKIGLPTT VPSTQLGTFG CVTPLKSFPT TREALLDFSL SGPVAAILMS IIMMSLGISA TLNASAATIS TFPTVPLTML KSSLLTGILL SVLAPKVMMM PLPQPIPLHP IFFAGFVGLI SSALNLLPIV RIDGGRACTA ALGGRVGAFA SIGTAMFLLS FLASGSSGLG LAFGLFVGIF QRRPEVPVRD EVTEVGRFRL GAWVVSVGIA AFSLMPFPGC SGIL
|
| |