Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48297 |
Symbol | |
ID | 7203727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 50189 |
End bp | 52153 |
Gene Length | 1965 bp |
Protein Length | 586 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182887 |
Protein GI | 219125227 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.469884 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATGAATTCG ATTCAACCAT CACAATTCTT GTACATCGAG TAGTCGATGT CCGTTATATC CTATCTCTCA CCATCCCTGT AGCTTTATTG ATATTGGGAA ACGAAAGTGC TGCGACTCGA GGCTCTCGTC CGGGCTTCGT TTTGAAAAGT TGCAATCGTA TTTTCATCTG GACTGATCAC GTTCTTTTTG ACCCGTGCGT AGTAATGAAT ATTACAAGGT TCATGACCGT GCATGTTCCT AACTTACAAG TCTTCCAACA AGATTCAATC ATGACAACTA TTATTCTCGC TTTTGTTTTG CTTTTCCTTG CGTCGTCCGT CTCTAGTTTA TACCGAAACG CTGAAGCATT CAGAGCATAC ATATCCTCAC GAGCACCACC TGCTTCTACG GTATCAACAA ACGCTAAACC CAAGCCTTCT TCTTTAGACA ACTCTCTACG TGGATCTATC CAGTGGTGGG AAGATTTAGC TCCACCATTT CACGATGCAT CCGAAGACTA CGTTGATCTC GACGCTGAAG TTACACATTT CCTCTTTTTG CTTCACGGAC ACAGGGGTTT TTCGAAGGAT CTGTCTTACT TGCAGTCCGT CATGCAGCAA GTGGCTACTA TAGAAACTCG GGAAAGCATG CGTGGAGCAA ACTGCATGAT GGAGTCGGAT GAGGAGACCA AGCCGATCAA TGAGAGCAGT AACCCTAGAT CAACGACCGA CAAGGTTCGC CGGTCGAATG GTCGTCAAGA AATGGTTGTC CACTCGGCCA CATGCAACGA GCGTAAAACT ACTGATGGAG TCGAGAAGGG AGGGGAGCGC TTGGTAGAAG AAATGCTGAC AACTATTCGT GAGCAGATGA AACTACGACA AGATGACAGA CCGATCAAGG ATATCACCAT TTCCGTGTTG GGAAATAGTT TAGGGGGGAT CTATGGACGC TATGCTATTG CGAAACTGAC TCGACATTGT GATGAAAAAG TAGATGGATC TTGGCTCCTA GACAACCACT ATCGGATCTA CTTTAACATC TTTTGTACAA CCGCTACACC GCACTTGGGG ATTGCTGGTC ACACTTTTTT ACCGATTCCC CGCACAGCCG AGATCGGAGT TGCGCACGCC ATGGGAGATA CAGGTAGGGA TTTGTTCCGG TTAAATGATC TTATGAAGAA AATGGCGACA GATCCGTCGT TCCTAGGGCC ACTGAAACGT TTTCGCAAGC GCATCGCTTA CGCCAACGCA TATGGAACAG ACTTTCCCGT CCCAGCACAA ACTGCCGCCT TTTTATCGGA TACGAGCTCA TACCCCCACC ATTTTGCTGA AGCTACACGT GACGACGATC CAATCGTTGT TGATGACAAC GGGCTGGTTG TTGCCACATT GCATACGCCT CCTCGACAAC TTCGAGGTGA CCTTGCAGAG ATCAAGATGC TCGACATGGA TCAAAATGAT GCGGACGATT TGGCACGTAT GTCAATGTCA TTGGACGCCT TGGGTTGGAA AAAAGTATTT GTTGATGTCA GGAAAGAAAT TCCCAACATT TCCGTACCCA AAGTCTCACT ACCAACATGG AGAAACAATT CAGCTGCCGC AGGCAGCAAT GAGCCCAACG GCCGGAGTTC TGAAGAGAGC TGTACGTCCG ATGACGCTCC TATCAATGAA GATACAGAAG AAGAAATGCG GGTAGAGCAG GCTCTGCAGC GACTTAAGCA AAGAGGTGTT GTATCGTCCC GAGACGTTGC GGCCGCAGTG ACTGCCCCCT TGTTTGATGA AAAAGTTTAC TGGCCATCAG GTCACAATAT GATTGTTGCT TTTTCGCGTT CTCGGCTGAG CACTTACATG AACAAAGCAG GACGACCAGT TGTTGATTCG CTTGCCAAGG AGCTTGTCGA AGATATCTTT TCCTGGAACG CTTTATCGAC GGAGGCAAGC CCCTCTATTC AATCTTTCGA GAGTACTCCT GAGCCGAAGA CTTAG
|
Protein sequence | MNITRFMTVH VPNLQVFQQD SIMTTIILAF VLLFLASSVS SLYRNAEAFR AYISSRAPPA STVSTNAKPK PSSLDNSLRG SIQWWEDLAP PFHDASEDYV DLDAEVTHFL FLLHGHRGFS KDLSYLQSVM QQVATIETRE SMRGANCMME SDEETKPINE SSNPRSTTDK VRRSNGRQEM VVHSATCNER KTTDGVEKGG ERLVEEMLTT IREQMKLRQD DRPIKDITIS VLGNSLGGIY GRYAIAKLTR HCDEKVDGSW LLDNHYRIYF NIFCTTATPH LGIAGHTFLP IPRTAEIGVA HAMGDTGRDL FRLNDLMKKM ATDPSFLGPL KRFRKRIAYA NAYGTDFPVP AQTAAFLSDT SSYPHHFAEA TRDDDPIVVD DNGLVVATLH TPPRQLRGDL AEIKMLDMDQ NDADDLARMS MSLDALGWKK VFVDVRKEIP NISVPKVSLP TWRNNSAAAG SNEPNGRSSE ESCTSDDAPI NEDTEEEMRV EQALQRLKQR GVVSSRDVAA AVTAPLFDEK VYWPSGHNMI VAFSRSRLST YMNKAGRPVV DSLAKELVED IFSWNALSTE ASPSIQSFES TPEPKT
|
| |