Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47188 |
Symbol | |
ID | 7201965 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 752093 |
End bp | 755112 |
Gene Length | 3020 bp |
Protein Length | 916 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181438 |
Protein GI | 219122198 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCAGGTATC CAGCGGGATG AACAGAAGTA CGAGCGACTG CGACCGTACC TCGACCGAGC CAAGACGGCA AACCTTCACT GAGAATGATT GTGGCCAACA ACAACAGCGG CAGCAGCAGC AAGGGCAAAA ACTCCCAAAG AACAAATCCA AAGAGGCCGA ACCGAATGTA GATGCCCCCC TACGAATCAC GGATGGCACC GACAGTAGCT GCGCCCATTC TTCAATGTCC TCTTCGGAGT GTAGCTCCAA TGTAGGTCTC ACAATCAGCG CGATTGAGAA AACGACTCGG CGAATACGCT CATCACAACA TTTTCCGCTG GCGGAAGTTT CGGTACCGAG CTCGATAGCT TCGCCGCCGT CGGATGCTTC TGTAGCGAAC ACTTGTACTG TGACGGAGAT TTCTGAACCG GGAACTGCAT TGAAGCGAGC AAGCGTTTTT TCGGAGGAAA GCTACCTTGC GCAATCTCTT CCGATGGCAG ACAGTACTCC CAGCAAGCCG CAACACGAGA AAAATGCTGA GGCAGAAATC GACGATGAAT TTGAATCAGC TCTTTTATCA CAGCGGAAGA GCAGGGTCTT GTTCTCTCGT TTGCGAGAAG CCATACCCCA AGGTCGTCTA TCGATAGTAC GTGCTTTTCA GTAATGTCTG TGTATTTCAA AAACTGTTTT TACTCACTAG CTCTCTTGTC GTACAGATCG ACCTGTCTCG AAGGGGTCTC GATGTCTCTC ACGCGTTCCT CTTAAAGGAG GCAATCACTC ACAGTCCACA GCTATCTGTT TTGAAGCTTG CTTACAATGA ATTTTGCGAC GAAGGGACTA CTATTCTCGC AATGGCATTC TGTCAAAACG GGGTACATCA CAAGCATTTG TCGGTAGTAG ATCTGGCTTT CAACGAGATA GGCGATGTGG GGTGCGAGGC GCTGGCAGTA CACGCTATGG CCGGGAACTA CGTATTGCGT GCAATAGACC TTAGTGGAAA TCAGATTGGA GAGCGGGGGG CGCTTTCTAT TGCCGGTGCA ATTTTACATG GCACTGGCTT GTCGCGATTG CACATGTCGG CGAACCGAAT TGGATCTATG GGTGTGCAGG CCGTTGCTGG TGCAATCGCC AATCGAGATT CACGAATAGC TGAGACGGAG GCTGCGTTGA CGGGGTCAAC TGAAATTCAC AGCATTGTTG ATTTGCAGTT AGGAACAGTT CTGATAGCAT CTGGAGGATT CGCTGCGATA CCGGGAATGC TTGTGACAAA TACTGCCCTG CGCTCGCTTT GCGTATCAAA TAACAATCTT GACGATCAAG ATATTCTATT GATGTCGCAA GCTCTGACAC AAAACAAAAG GCTACCCTTG GAAGAGCTAG TACTTTCTTT CAATCAAATC ACGTGTCAAG GTGTTGAGAA TTTAATGAAC TCTATATGGG GATCGACTAC GTTGAAGAAA ATAAAGCTGG ACAATAACCG ACTACGAGAT CGGGGCGCAC AGCTTTGTGC GGTTGTTCTG ACATCAATTC CACTGCAATC ACTTGATATT GGGTGTAACG CCGTGACGAG CGCCGGGATC AAAGCTTTGA TGAAGAATGT ATCCGAGAAC AGTTCGCTAA TTTCACTCGG GCTTGCTGGA ATACACATCG ATCAAAATTC TTCCAAGGCT GTATCGTATG CGTTGGCGTA CAACACTTCG TTGCAAGCGG TTTACTTCGA CAATTCTCAC GCGGGGTATT CTGCTCAGCG ACACATCGTT GCCGGAATTG TTTCCAACCA AAGTGCACCT TTACGATTGC TGACAGGCTT CCCTCTTGCG CGTACGTTTT TGAAATCTTT TTTACTCACG AGCCGTTTTG AGTTCATCTT ACTTATGTTT TGCTTCTGCT ACAGCCATTG CCGTCACCCT GGGAGTGCCA CGGTTGCCGG AGGTTTGGTC GAACGACCAG GTGTTGGGTT TTTTCCGGCT TATGTGGCGC CAGTGGGCAA TTAAAGCCGG ACGCGGAAAC GTTGGAAAAG GCGATATTCC CCGTGGACCA GCGCCTCCAG CGGCGGTTGC GGCAGCTGCC AAAGTCGCTT TTGCTTCGCT GGGAACTGCG CTTCAAACTC TATTTCAGAC GGAAATGTAC GAGAAACCAA TTTCGGAACG CCCCTCGGTC GATCCTTCGG ATACTGCTTT GTTGGAACGA AGTCTATCAG GAACCCTTGA GATCCCAAAA TGTTCCTTCG TGAACGAGGA CGAATTGAGC GAATGGGAAG GAGGAAAGTC GAAGATGGAC GGCACCGATA CACTGCCTTC TTCGGCGCAT ACACTATCCG TTCAGGAAAC TTATGAGAAT TCCGAGCGAC GTAGTCGCAA TTTACGCTGG CTAAGGTTGC ACTTTCGTTC ACTATCAGAG GTTGGGCGAA TTCCTTTCAA CAACGCCGAA CTTTGGCATC TACATCAGTA CTATTTCTCG CCACCGAATG TCGTGCTTCA TGACTGCGAT GGTCTGCATC ACGAGGTAAC GTCGACGCCT GCCCGCGGTA TGGCAGCACC TTCAACACCA AATCAGCAGC CCAGCGCACC TGGAATGGGT CGCGCGATTT CCTTTCAATC GTTGGGAAAT GCATTCTCTG TCTCCCGCTC CCTGTCACAC GCTGGAGGGC ACAAACGGCG ATCTGAAAAG CAATGCCAAT CAGAGGAACA ACCCGCACTG AAACGCCCGA AAAATTCAAA GCCTAGGATC GCCTACTATC CAAGAATCAT GGTGAGTTGC AATATTCGAG TAACTGGGTC TCTTGTGAAA GTTTTTTTGC TCACAAGCCG TTTCAGACCA AGCTACAGGC TCTTGGAAGT AGTCAAGCAG ATCAAATACT AGCGTTGCTT CGACAGTTAA AATTCGCAGA AAGCTTGTTG TTTGCTGGAA AGAGTCTCTA CTGTGATGAA GCTTCGATTG CTGATAACGA AGCTCATTAC AGTGACGTCG AAATGATCCT GTTAGATCTT CTGTAGCGAG TTACTGCGAT CTGTCAATAA TTCTATAACT TCACATGATA
|
Protein sequence | MNRSTSDCDR TSTEPRRQTF TENDCGQQQQ RQQQQGQKLP KNKSKEAEPN VDAPLRITDG TDSSCAHSSM SSSECSSNVG LTISAIEKTT RRIRSSQHFP LAEVSVPSSI ASPPSDASVA NTCTVTEISE PGTALKRASV FSEESYLAQS LPMADSTPSK PQHEKNAEAE IDDEFESALL SQRKSRVLFS RLREAIPQGR LSIIDLSRRG LDVSHAFLLK EAITHSPQLS VLKLAYNEFC DEGTTILAMA FCQNGVHHKH LSVVDLAFNE IGDVGCEALA VHAMAGNYVL RAIDLSGNQI GERGALSIAG AILHGTGLSR LHMSANRIGS MGVQAVAGAI ANRDSRIAET EAALTGSTEI HSIVDLQLGT VLIASGGFAA IPGMLVTNTA LRSLCVSNNN LDDQDILLMS QALTQNKRLP LEELVLSFNQ ITCQGVENLM NSIWGSTTLK KIKLDNNRLR DRGAQLCAVV LTSIPLQSLD IGCNAVTSAG IKALMKNVSE NSSLISLGLA GIHIDQNSSK AVSYALAYNT SLQAVYFDNS HAGYSAQRHI VAGIVSNQSA PLRLLTGFPL APIAVTLGVP RLPEVWSNDQ VLGFFRLMWR QWAIKAGRGN VGKGDIPRGP APPAAVAAAA KVAFASLGTA LQTLFQTEMY EKPISERPSV DPSDTALLER SLSGTLEIPK CSFVNEDELS EWEGGKSKMD GTDTLPSSAH TLSVQETYEN SERRSRNLRW LRLHFRSLSE VGRIPFNNAE LWHLHQYYFS PPNVVLHDCD GLHHEVTSTP ARGMAAPSTP NQQPSAPGMG RAISFQSLGN AFSVSRSLSH AGGHKRRSEK QCQSEEQPAL KRPKNSKPRI AYYPRIMTKL QALGSSQADQ ILALLRQLKF AESLLFAGKS LYCDEASIAD NEAHYSDVEM ILLDLL
|
| |