Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47001 |
Symbol | |
ID | 7202235 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 144562 |
End bp | 146662 |
Gene Length | 2101 bp |
Protein Length | 534 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181146 |
Protein GI | 219121589 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTACGTACA AGCTAGTCGA GCACCCCATC TCCTTACACA CCCCCATTCT CTGAGTCGTG CACCCGCGTC TACAAACAAT TCGTTTGTAT CGAGTACCAG TGTTCTAAGC ACCCGTACAT AGTTTCGAGA CTACAAGGAG ATTGTAGCAG GTTGTTCGAA AGCAATAAAG CAAGGACATC CTTTTTCTTC AATCCAGGCG CCCTCTGAGT TGGTGACTAC GAGCGATCAG TCCGGGCGCT TTCAGGAGTA GGACAATGAG CCATTCGGTG GCTGGCTCCT CCGTGGATGT TTCGTCCTCG AATTCGGTGG CTTCTTCAGG TGACGCTTCT CCGACGTCCG GCTCTCGTAT TTTGAATCAA AGCACTCCTT CCAGCGATGC TTCTACGGTG GCTTCAGTTC CCTCTGTAGA ACCTCAAAAT CCCTTTGAAA TGTCGCTAAA CGACACGGCA GTGTCTGCTT CCGGAGCCGC CGAAAAGACT AATCAAAATA CGTCTTCTCC GGCTCGAAAC GGTTCAAATC CTCCTTCTCC GATGCATGCT CATGCCTCTC ACCAGACCAC TCGCTCGCAT CCTTCTCCGG TTGGAACTCC GGTTTCGACG ACCCCGCTAC GCGAACCACA TTCGCCGGGC TATCATCAAG CGCAATCGTC TCCCCCTCAC GCCATCACAG GAGTCGGCAG TTTGACGGTA CCCACGTTGT CGCAATCGGA TCAAGCGCGG TCTCGTATTC AACAGCAGTT GCCGGGTCAT TTGCAGACTT TTCCTAGCCC TGCTCAGCAC GGCGTGCGCG AGCCCGTATT TGATGATGAT GAGAATACAG AACCGTCGTC CACCAACAAT ACTGCTGATC AGCAGCATCA GGAGGTTGGG CACGGCTCTT CTTTCATTCG CTTTCTGGAA AATACTCGTA AGCGCTTGAG TGTTGCCAAC AAGGATGAAG ATGCAGAAGA TCCTTTGGAA GACGAGGAAG GCTTGGACGG TGCTTTGATC TATGGTTATT TGCAAAAAAT GGGACGCAAC GGCAAGTGGC AAACACGCTG GTTTGAATCG GATGGCGAGT GTTTGTCGTA CTTCAAAAGC AAGAAACGTA CAAAATTACT GGCCACGTTG GATCTAGAAA AGGTAAGTTT GTGTGCTCAT TTATGTTAAC GGTGAGTGAT TATGGAACTC CGGACCCTCA CAAACCATGG CATGACATTG ATTTATGTAG GTTGGATCGA TTTGTATTGA TCCACAAGAT CCACAAGGTT GCTCGTTTAC CATTCAAGTG TTGGGTCGAA TGTATCATTT GCGTGCGAAC AGCAAAGCCG CGACGAAGGA CTGGGTAATT ACACTGAACA GGATCAAAGA AGCCAAGATG CAGCAAGGGC ATATTCATCT TGTCAACCCT TACGAACAAC AGCCGCAGGA TCTCTTGGAT AACCACGAAG AGATAGTCGC GCCTCGGGTT GTCGTGGTGG CCAATCGGGA ACGGACACGC GCCGTTGCCG AGACCATCGA TTTTGACCAG CTTATCCGTG TTGACCAGAA TGGTGAGAAT CGTGAGTTGA CCTATGACAA TTCTAAACGG CGTTCGACCA TTGGAACTGT GGTTTTGGGG CGCTGGACAA AGCGTCGTTC GTCGCTTTCT CGCCTCAGTG CCAAGTTCTC CAAGTGGGCC CGTAGTCTGA AGAAGTACAG CTGTACCGAA TCAGGTACAG AAAATGTGCA GCTCGATCGC TACGTTCATC CTCCTGGTCA TGATGACATA CCGAAGCGTC GACAGCCAGA TTCTGGTCCA AAGCTCGCTG CGGACGCAGA GTCGAACCCT GTAAGCGTTT CAGGGTGGAT TGGCAAGGAG ACGTCCCGGT CAGGACAGGC TGGAAGCGGA TCGGCAGATG TACCCCAACC AACACGTGCC GTCCGTAGCA TGAGCCAAGC ATCCGACGAT GTTCGCATGC TATCGTAGAA GGCCGATGCG TCCATGATAG AGAGGATTGC AGTGCTGAAG CAGGTCGCAC ATTCTGCGAA AATGCTCTTA TATGTTTTTT ATATTTCGTG CGAGAAGAAA ATTGATAGTG GTAGGGTTGA ACGTATTCAG TTTCTTGGAA TGGCATATTT T
|
Protein sequence | MSHSVAGSSV DVSSSNSVAS SGDASPTSGS RILNQSTPSS DASTVASVPS VEPQNPFEMS LNDTAVSASG AAEKTNQNTS SPARNGSNPP SPMHAHASHQ TTRSHPSPVG TPVSTTPLRE PHSPGYHQAQ SSPPHAITGV GSLTVPTLSQ SDQARSRIQQ QLPGHLQTFP SPAQHGVREP VFDDDENTEP SSTNNTADQQ HQEVGHGSSF IRFLENTRKR LSVANKDEDA EDPLEDEEGL DGALIYGYLQ KMGRNGKWQT RWFESDGECL SYFKSKKRTK LLATLDLEKV GSICIDPQDP QGCSFTIQVL GRMYHLRANS KAATKDWVIT LNRIKEAKMQ QGHIHLVNPY EQQPQDLLDN HEEIVAPRVV VVANRERTRA VAETIDFDQL IRVDQNGENR ELTYDNSKRR STIGTVVLGR WTKRRSSLSR LSAKFSKWAR SLKKYSCTES GTENVQLDRY VHPPGHDDIP KRRQPDSGPK LAADAESNPV SVSGWIGKET SRSGQAGSGS ADVPQPTRAV RSMSQASDDV RMLS
|
| |