Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43576 |
Symbol | |
ID | 7197455 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 854421 |
End bp | 856188 |
Gene Length | 1768 bp |
Protein Length | 483 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178021 |
Protein GI | 219112539 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.389982 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGTGATTTAC GACTAGTGTG GCACTTGCGT TGCATTGAAA GTCAGCATAA ACCTTGAACG ACAAGTTTAG TAAAGAATCG GACTTTTCGC CTACACTAGA GCTTTAATGG AAGGAACAAG TACACTAGCA GACTCGCTGT TGGACGATCT CGACGACCTT TCCGACGTCG AAGACGTCGC CGACGATGCG AAAAACATAT TGCAAGGCGA AACAGAAACG TCGCAAACCC ACAAGGAATT TACTGGACCA AGAAGAAAGC GATTTCTGGA CGATCCGAAT TTGCTGAAAC ATGTCAATGA GGTGAGATAC AATGAAAAGA ACAATATGGA GAAGCCACAG AGCAAAGACC AGGATGCAGA AGAGTACCGA CTCATCGTGC AGTCCAACAA GCAACTCGCG AACTTAGCAA ACGAGCTATC TCTGACCCAC ATCGACCTCT GCACCGCCTA TCATCCGAAA TTTCCGGAAT TGGAAGAGCT GTTGCCGGGT ACTGGTCAGT ACAAGGATGC TGTTCGTGTG ATTCGTAACG AAATGGACAT GACAAAAGTC AACGAAGGTC TGAATGCCTT TCTCAACAGT AATCAGATCA TTACGATTAG TGTATCTGGG AGTACAACAT CAGGTCGTAT GCTTTTGGAA GAAGAACTAG TTGTCGTCGA CAAACTGATC GAGTACATGG ACCAAATTTG TGAGCTGCAA TCTGAATTGA ATCTGTTTGT AGAATCCCGC ATGGAAGGCC TCGCACCAAA TGTGTGCGCA TTGATCGGCC CTACAACGGC CGCCAAACTT TTAGCACTGG CAGGAGGTCT AGCCGAGCTG TCAGGAATCC CAGCCTGCAA TCTTCAGGTC CTAGGACAGG TCAAACAGAA CGCGGCCAGT CGTGCCGGAT TTTCGTCAGC CACCACAAGG CCCCACGAAG GAACCTTGGC CGAATGCGAC CTAGTCCGTA GATGCCCTCG GCACCTTCAA AAAAGGGCTC TCAAGATGGT TGCGGCCAAG CTCGCGCTTG CAGCGCGTTG CGATTACGTC AACGTTAATT CTGGACGGGA GAGAACCCCG ACATCGGGAC GACAGTTTCG CTCCGATATT GAATCCAAGA TTTCTCAGTG GCACGAGCCA GACAGGGCTC AAGTTCTAAA AGCATTACCA AAGTAAGTGT CCCAAACCGC CCCACAAGTG TTCTTTTTGA AACGCATTTC TCACGTCTCT GTCTCCCGCC TTCCTCGGGC TGTTAACTAC TTGTAGTTGC TTCGAGGAAT CCTGGTCGAC AGAGACGAGG ATGCTCCGTT TGCATATGCA TACGCGTATC TGAGCTGGTT TCTCTCCTGT TTTAGACCCG ATCTTACGAT AAAGAAGCGT CGTGGCGGCA AACGAATGCG GCGCCTTAAG GAACGTTACG AAGAAACAGC GATGATGAAA CAGGCAAATA CCCGGGCGTT TTCAGCGAAA GCCGGAGAGT ACGGCGATGA CGCGATGGGG TTGAGTTTGG GGCTTTTGGA TAAATCTGAC GTCACAGCTA GTGGGAGCTT GCGCAAAAAG ACAGAAAAAC GCAAATTGAG AGTAGCGAAT ACCAAGGCAT CGCGAAAGCG CGCCGAGCAG ATGAAGGCAA CAACGAATAC AAACGGATTG GCTAGTAGCA TTGCCTTCAC GCCGGTTCAA GGGATGGAAT TGGTGAATCC TGATGCTAAT CGAGAACGAT TGCGAGAGGC AAATAACAAA TGGTTCAGCA ATAACGCAGG GTTTCAATCG GCGCTGCCGA AAAAGTAG
|
Protein sequence | MEGTSTLADS LLDDLDDLSD VEDVADDAKN ILQGETETSQ THKEFTGPRR KRFLDDPNLL KHVNEVRYNE KNNMEKPQSK DQDAEEYRLI VQSNKQLANL ANELSLTHID LCTAYHPKFP ELEELLPGTG QYKDAVRVIR NEMDMTKVNE GLNAFLNSNQ IITISVSGST TSGRMLLEEE LVVVDKLIEY MDQICELQSE LNLFVESRME GLAPNVCALI GPTTAAKLLA LAGGLAELSG IPACNLQVLG QVKQNAASRA GFSSATTRPH EGTLAECDLV RRCPRHLQKR ALKMVAAKLA LAARCDYVNV NSGRERTPTS GRQFRSDIES KISQWHEPDR AQVLKALPKP DLTIKKRRGG KRMRRLKERY EETAMMKQAN TRAFSAKAGE YGDDAMGLSL GLLDKSDVTA SGSLRKKTEK RKLRVANTKA SRKRAEQMKA TTNTNGLARM ELVNPDANRE RLREANNKWF SNNAGFQSAL PKK
|
| |