Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33046 |
Symbol | |
ID | 7197274 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1397495 |
End bp | 1399589 |
Gene Length | 2095 bp |
Protein Length | 670 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177821 |
Protein GI | 219112139 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.531262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCGA GCAACTTGTC GCAGCGCAGT CTCGACTGCC CTTGTCGGAA TTTCTCGAGC TTGGAACGTA TTTCTCAAGT GCTCGCATCG AAAAGCCAAA CACAAACTCT GATCGATTGG TTGGCAGGGA AAAGCGACGA ATCTTCTTTT GAAGAACGAC GACCTTTATC TTGCAGCGAA AAATATCGCT GCAAGTTTCG CGCAAATACT TCGCTATTCA TATTGAACGG AGTCCGCGAT GCTTGTCGGC CGTTTTTGGA GTCCGTTTTG GTCTCAAACG AAGGTCCTGC GAACCCGCGA CCCTCTCCAA TCGTGCCGTA CGAAGATGCC TTTCCAACTT TACTTTCTAC CCCTCCTCCA AATCTGCCTT TTTCTTCCAA AAGCCTTCCA ACACACTCTT TCCAATATGC TCGTTCAGCC ACTGACAAAG CTGAGAAATC AAAACCAAAG CGGCGTGTTC GACCTTTGAC AATTTCAACG TCCGGACCAT CGGCTTGGGG CGAAGGAAAT TTGGTGACTG CGTCACTGCC AGTCAGCACG ACTTTGCAAC AGTGTGATGT ATGCCATCCA AAACTACCGT TAGAAATGCC AATCCCTCAA GGTAAATCAA TACCGAAGTC AGCGCGGTCT GGCGCCACGC GGAAAACACC GACAAAGCAA GAGAAAAGCG CCCACCAACG GATGGAACAA ACCAGCATGA CTGAGGTGCA AAGGCTAATA GAAGTATACT GTGAGCTGGT GAAGAGTGCA CTCGTCCCCT CGACGGCGTT GGAGTTGCAT TTGCTGTGTC GTATGTTGTC TGTTCCGATC AGCGATCTAT CGAATCCTCA GAGCTTGCAG GATTCCAGTT TTTCCTGTGT TTTCTCCTCA GCGACACAAT GTACATATTT TGCGAGGGAA TGTTTGCTTC GCTTGTCTGG TATCCTCCGT GGCTACGGTA AATCCTTTCT GATAGAACTT GTTCGATGCG CTCCATTCCG TGTTCACCTT CCCGATATGG TTTCGGACTT TGAAGCACTC ATACGGCTCG ACGCGACGCG TGTAGATGCG TCTCTCGACT TTTCACCAAA CAGCCAAATC CCTTTGTTGA CCCTTCCTTT CAATGAGGAA AGAGACTCGC GACACAACTA CAAGACTCGT GAGGAACAAG CCCTTTACAA GAACCGAGAG GAAAGCCGCG ACGCATTTCT CTACCAGTTT AGATCCTTTT TAAATGTGAG AGGGAAATTA GTTGATACAG GAGCCGCGGA AAAAGAAATT AAGAAAATCG AACGGTCTTC TCGAACTGTC GTAAACGGAG TGATGGGCGA CAACGTTCCT TGGTTTGCGG ATTTTTTTTG CGATTTATTG TTACAGACTG GTCTAGTACC TCTGCAAGAG ACAGATACAG ACCTTCTTCG TATCGCCGGT AAGGAAAAGC TCCAAAAACT GCACAAGCGC TTTGGATCCA TGGTTCCTCT TACTGAAAAA AGTACCAAGA AACTCGTTGC CGAGCGGCAC TTTAGAGAAT CTATCCCGGC GGTGGCAGCA CAGCAGTTCT TTCCTGGCCA TCAGGAGTTT TTCTTCCTTT TTCTTATGTC CGCCGACTCC TTCATCTTTG GTTTGCATTT ACGCCGAGTA TTGGCCCAGA ATATTAAGAA ACTCGCTGCC GCTACAACGG TAAGGGATTT TGAAAGGCAG ATATTGAAAA TGCAATTGCT AGGTCGTTTT GTTGGTGTAC TCTTCTTCGC TCCGAATTGG GTTTCATCGA CCACAAAGCA ACAATCTCCG TCTGTACTTT TATCAACTGC TCTGTGCGAA ATCTCAATTG CTGGTTTACC AATGCTAGAA ATGCTCGGTG AAGCGTGCCA AAATGGTAAC CTCGTGAGTT TCGTTCCATG GGTGGTTGAA GTTCTTAAGA TGTCCATCTG GGACAGAGGG GCGCGCAACA GTTTTGAGGT GCGTCAGCTT CTAGCGTATT TGCGGCAGAT ACAATTATTG TATTGCAGAA GGGATGAGAG AACTGAAACA ACCAATTCAG TACGCGAGCT CATTTTTGGT AGCATCGAGG CTATGCTTGA TGAAGTTCTT GGTCTTGGAC GGACCACGAG TCTAG
|
Protein sequence | MKSSNLSQRS LDCPCRNFSS LERISQVLAS KSQTQTLIDW LAGKSDESSF EERRPLSCSE KYRCKFRANT SLFILNGVRD ACRPFLESVL VSNEGPANPR PSPIVPYEDA FPTLLSTPPP NLPFSSKSLP THSFQYARSA TDKAEKSKPK RRVRPLTIST SGPSAWGEGN LVTASLPVST TLQQCDVCHP KLPLEMPIPQ GKSIPKSARS GATRKTPTKQ EKSAHQRMEQ TSMTEVQRLI EVYCELVKSA LVPSTALELH LLCRMLSVPI SDLSNPQSLQ DSSFSCVFSS ATQCTYFARE CLLRLSGILR GYGKSFLIEL VRCAPFRVHL PDMVSDFEAL IRLDATRVDA SLDFSPNSQI PLLTLPFNEE RDSRHNYKTR EEQALYKNRE ESRDAFLYQF RSFLNVRGKL VDTGAAEKEI KKIERSSRTV VNGVMGDNVP WFADFFCDLL LQTGLVPLQE TDTDLLRIAG KEKLQKLHKR FGSMVPLTEK STKKLVAERH FRESIPAVAA QQFFPGHQEF FFLFLMSADS FIFGLHLRRV LAQNIKKLAA ATTVRDFERQ ILKMQLLGRF VGVLFFAPNW VSSTTKQQSP SVLLSTALCE ISIAGLPMLE MLGEACQNGN LVSFVPWVVE VLKMSIWDRG ARNSFEYASS FLVASRLCLM KFLVLDGPRV
|
| |