Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49890 |
Symbol | |
ID | 7198600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 207061 |
End bp | 209908 |
Gene Length | 2848 bp |
Protein Length | 811 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184670 |
Protein GI | 219128964 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.146051 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTGC CATTGGGTTT TCGCCGAACG CGCTTATCGC TTCTTTTTCT ATGTTGCCTA GCTTTATACT TTCAGAGTAA GGCGTTGGTT AAAGTTTGGA ACAACGATGT GCCGAACTTG GACACCATAC TTGGATCAAA TCCTTGGCTA TTCAACGACC AAAGTCAAAA GTCCACTTTA TCCCTCCACG AAGCTGGAGT CAGCCGAGGG AATACAAGGA ACGCTAGACA CAAGACTACC AACGTAATAA ATTCACCATT CGTCGCAGGC ACGACCACAT CTAGTCAAGC GGCGAGCGTA ATCGAATCAC TGTTGCTCAC AGATCAAGGC TTTGGCGCGT GTTTGATAAT GAAAGACGAC AATCATTGGC TGATCGAATG GATTGCCTAC CACTACCACG TCTTGCCTTT AAAGCATTTA ATTGTGTTGC GCGATCCGCG AAGTCTCACA CCGTCCCAGC ACATCTTTGA CCGTTGGAAT CAAACCTATT TGGAAATAGA GGAATGGCAG GACTCTCGCG TCACTCCGAT TTCTGTGTAC GAGCGAGGTA CCAAGAAGAA AAAAACGGAT CTTCAGTACC ATTTGACGCG ACAGCGCTTC TTTTACTCCA AATGTTTGAA ACATCTGAAG CAGGAGCAAA CGGTGCAATG GGTATTGCTG ACAGATACAG ACGAGTTTGT GCGTGTGAAT TCTTTGTTGT ATCCACTGTC GGCTTCCGTG TTGCAACAAC GGGGCCACGT GAGTGCATTT TTGGCCCAGC GAAAAGTGCA GGATCCGTGT TTACAGATGC CCCGCATCCA AGTGTCTTCG ATACGAGCTA GTGAGAGTAG TATCAATGAC CAAGGAAAAA TCGATGCGTG GAAACGGGCT GAATCTATTG AGGGGCTGAA TGCGTCTGAT TTCCTGACGT ATCAATGGCT CGTTCACAAC AACCAGGCAA TGTCAGTGGG GAAGAGCACA ATGTACTTGC CTACGCTGAA AACGAATGAC ATTCCGTCTT TGGCAGATTC AGTGCATCGT ATTGTGTCGA GTGTGTGTCC GGAGCACTCT GAGTCGTTGC TGAACAGCAC GAGATCGTAT CTTTCCGTGA TGCACTATCT CGGAACTTAC GAACAGTTTA GCTACCGGAC TGATCCTCGA GACTATGACG CAAAAGATGC CGAACGACCT CAAAGAGAGT TTCAGCATTT GAAAAGCGCC TCAGTTAAGA GTCGTGTTGA TACCTGGTAC CATCGAGGTC GGAAGCCTGC AGCGTCCACA TTGGATGTGG GCATGACGCA CTGGCTGCCT GGGCTGGTGG AGACAGTGGG CTTGTACGAG GTAAAGAAAC TACTGGAAAA TGTGGGCGTT TTGGAGTCGA TTCCAGTAGA AACATCACAA CACAACAAAT CGAAGCGGCA ACCAGAAATT GATGTGGAGG AAGAAGTATT TTCTGCGTGT CTGCTGACAA AGGACGACAA TCACTGGCTG ATTGAGTGGC TAGCGTATCA CTACTACGTT TTCAAACTAC GAAGGCTCAC TGTGGTGCGT GATCCGACTA GTCGAACTTC GATTGAGGCT ATTTTGGATC GCTGGAAGGA TCGCATGTCT ATCTCTGTGT GGAACGATGA GCAATTCGTT CCTAGTTGGA TCCTACAAAA GCACAAAACG AGAGGAATCA GTGATACGTT GCTCCACCGT TACCGGCAGC AGTTCTTTTA CAGTTCATGC ATGAAAGACT TTAAGCAACG GGATCGGAGC TGGCTAGCTC TGGTAGATAC TGACGAGATC CTGCGGCCAA ATCCGTACGT CCTGACAGCC CCGATAGATT TGAAAGTGGA GGGTATTGGC TTGACACTGC TTGAAGAGCA ACAAAAGAAG CGAGCTATGA GGGCTGAAGA CGGACCTCTG AAGTGCATAC ATGTGCCACG ACTGCAGATA GTGTCGACGG AAGCGAATGC TTCCACTGTC AGCTCGGATA TTCCACTGGG ATTGAACGGG TCGGACTTTC TCACCACGCG GTGGTTGTAC CACAACAACC GCGAAATATC GATGGGCAAT AACTTGGACG GCAAGAGCGT TTTCAATCTT CAGTGGTTGG ATGAATCTGC AATTCCCAAA AGAGCGGCGA ATGCTCATTA TATCATTCCG GGTGTGTGTC CGGAGACGTC AGGTGACCGG CTCGACCACC CTGATAGTTG GCTGTTGATT CACCACTACT TGGGTTCTTT AGAGCAGTTT GTATCTCGAG ACGACCCCCG GAACAGCATA GAGGGACGTC CGAAGCGAGA TGCAAGTCTG TGGCGGCAAA CGGGTCAGAG TCCTGCAGCA GACACGTTTG ACGATTCGTT ACGGTCCTGG CTGACTGGGT TTGTAGAATC GGTAGGCTAC GTCGATGCTA AGCGGCTGTT GGCGGGAGTG GGACAAGTGG AGGAGTTGAG TCCAATGACA GCGTAGCATT CCGGACAGTG CATGGCATAC TCCAGGTCTT TTTGACAGCT TTGAGAACTG TCTTGCTGAT TCGCCTGAAC CTTGGTTTCA TCATCATTTT GGTGTCGATA GTCGTAGGTG GCAAAGTTTG TTATCAGGAG ATTTTTTCGC AGGAGTCACA TACTACTGGC CTCACTCTCA AGCGGTATGT GTCGTAGATA CTGAGAGTAT TTCTCCGCCA GTCAAGGCTT TTGTCGCCAA TAGGTCCTGT GTTGATGGCA GAAAACTTTC ATGGCGGCTC GACCGTGGTC ACTCCGTGTC TTTGTGATAG GCGTTTCGGC TGCATGATAT GGTACAAAAA GGTCTTCCTC ACATCACTGT CATTGGCAAT CACTGTCATC TTTCATTCAA AGTAGATCGA ACAGTATATT TCCATTAC
|
Protein sequence | MAVPLGFRRT RLSLLFLCCL ALYFQSKALV KVWNNDVPNL DTILGSNPWL FNDQSQKSTL SLHEAGVSRG NTRNARHKTT NVINSPFVAG TTTSSQAASV IESLLLTDQG FGACLIMKDD NHWLIEWIAY HYHVLPLKHL IVLRDPRSLT PSQHIFDRWN QTYLEIEEWQ DSRVTPISVY ERGTKKKKTD LQYHLTRQRF FYSKCLKHLK QEQTVQWVLL TDTDEFVRVN SLLYPLSASV LQQRGHVSAF LAQRKVQDPC LQMPRIQVSS IRASESSIND QGKIDAWKRA ESIEGLNASD FLTYQWLVHN NQAMSVGKST MYLPTLKTND IPSLADSVHR IVSSVCPEHS ESLLNSTRSY LSVMHYLGTY EQFSYRTDPR DYDAKDAERP QREFQHLKSA SVKSRVDTWY HRGRKPAAST LDVGMTHWLP GLVETVGLYE VKKLLENVGV LESIPVETSQ HNKSKRQPEI DVEEEVFSAC LLTKDDNHWL IEWLAYHYYV FKLRRLTVVR DPTSRTSIEA ILDRWKDRMS ISVWNDEQFV PSWILQKHKT RGISDTLLHR YRQQFFYSSC MKDFKQRDRS WLALVDTDEI LRPNPYVLTA PIDLKVEGIG LTLLEEQQKK RAMRAEDGPL KCIHVPRLQI VSTEANASTV SSDIPLGLNG SDFLTTRWLY HNNREISMGN NLDGKSVFNL QWLDESAIPK RAANAHYIIP GVCPETSGDR LDHPDSWLLI HHYLGSLEQF VSRDDPRNSI EGRPKRDASL WRQTGQSPAA DTFDDSLRSW LTGFVESVGY VDAKRLLAGV GQVEELSPMT A
|
| |