Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42590 |
Symbol | |
ID | 7195963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 517393 |
End bp | 519486 |
Gene Length | 2094 bp |
Protein Length | 634 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176598 |
Protein GI | 219109688 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGTTATCGA TCTTTTACAT TGGCAAGTTT TCTTGGTTCC GCCATTGCAA CGTTCTAAAG ATGCGCTTTC CTATTCTGAC CACAATAGTC ATTGCCTCGT TGGGTTGGAA TCACCAAGCT TGCACCAAAG CGGAAGAGAC CAACGCGAGC GAGGCGGAGA CCTGTGGCTT GTATCTGGCA GTGTCGTCCA CTTCTACCGC GGAAGAAACC AACTGGGGAT TGTATTCGGG ACGCGATCTC GCTCCCCGTG TATCCGTTGG TTTCCCGGAT GTGGGGATCA ATATGATCAA TCTGCGAGCT CACGGCGTGC CGACGGATGC AGAGAACGAC CGCAATACAC TTCTATCCCG GACCGTCGAC TTCTTGGAGA GCTTTGTGTG GGTTCCGGAT CCGGCGGGTG CCAAGTTTGA AGTCCAGAAC GGCAAGATCA TTACAGCTAT TGCTGGAGCC GGAGTTCTCG GTGGATTTAA CGTCAAACTC ACCAACGCCA ACTGGAAACA TTCCGCAGCC TATCGCCGTC CCCGTTTGGA GCACGAAAAT GATGTGGAAA AGAATCATCC TGGTCGGGGT GCCAGTACCC CCTTTTACAA CGTCATGCTC GAGTCTTCCA CGGAAATTTC GCAAGGATCA GAAATTTTTG TGGATTATGG AGATAATTGG GAAGACGAAG ACAAGGAGCA CGAACTCACC AAGGACGAAT ACAAGAAACT CGACGAAACG GTCGTCAAAA TGGTTAACTT TTTCGAAAAA CACAAGGAAG AGTTGGACGC CGACTCGAAA CAAGAAATAT TCCAATTTTT GATGCAGGAT GTCTTATCGG CAGCCATTGG ACCCGACAAG GCGCATAAAG TAGGCACTTT GTTGCCTACC GTCCCTGACG AACTCCCCAG AGTAGTCGAG GCCGGTGGGT CGTTAGCCTA TAGCGACCCC ACCATTTACC GTAAACTGGA ATGGCTCGAT GAACACGGAC GTTGCATGGA TAATATCAAG GCGGGGGCAT CCAACATCTC CTACGCGGGC CGTGGAGCTA TGGCGACACG GGCCATAAAA CAAGGCTCTC TCGTGGCACC CGTACCCTTA ATTCAAGTTC CCGACCGAGC AGTCTTCAAC ATGTACAACC TGCAACTCTC TGAAGACGGC GAAACGTACA TACGGACATC GGACGATATC GTTGGAGAAC AAATGATTAT CAATTACAGT TTCGGACACA AGGATTCAAG TTTGGTGTTT GTACCGGCTG GTGCCATTGT TAATCTTATC AACCATGGCG ATACTCCCAA TGCTAAAATG GTTTGGTCCA CGCATCCCAG CCACCGAAAG ATGTGGCTCA ACTTCAAGCC GGAGACTCTG CTGGATGATG AACAAATGTA TACTGGCTTA TTGATGGAAA TTGTTGCGAC TCGCGACATT GAGCCGGGCG AAGAAATTTT GCTGGACTAC GGACCAGAGT GGAAAGCGGC GTGGGACGCG CACGTGGAAA GTTGGAAAGG ACGATTGGCC AAGGGTGAAA TTTCCGAGAC TCGTGCGCCG ACGGCAGTCG ATCTAAATAC CAAGTACAGT GAGCAGCCCT ATCCCAGTGA ATCCGAGTTC GCCGCTCCCG AGAATGTTTG TCTCAAAGCG TCATTGTCAG TGGAAGAGTC CGATGCCACC GGAACATTGG AGAATCCCAA AACTTGGGCA ACCCCAGACG AATTTGCAAA TTTAAACCCA GACACATTGG TCAACATGCA TGTGGTGGAA AGTAGAAAAG TGGAGGATGA AGAAGATGGA GACGAGCCTT TTCGATACGT TGTCAAGTGG GCAAACAATA ATGGCGAACT TACGTTCGTA AAGGAGGTTC CACACAGCGC CATTGCCTTT GTGGACATGC CCGGAATGAG TGACGCCTTT ACTGAGGGCG CCTTCCGGCA CGTAATCGGT ATTCCGGACG ATATCTTCCC CAAAGCTTGG CGCAACCGCA AATAGGACTA CCAAGTCTTG ATCGCGATAG AAAACGTGTG CCCTTTCATT GCTTTTTTCA GTTAAGTAGT TTTATGATAG TACAAACGCA GCACCTACTG CACTACCGTA TAGCATCGAT ACGCCAAATT ATCA
|
Protein sequence | MRFPILTTIV IASLGWNHQA CTKAEETNAS EAETCGLYLA VSSTSTAEET NWGLYSGRDL APRVSVGFPD VGINMINLRA HGVPTDAEND RNTLLSRTVD FLESFVWVPD PAGAKFEVQN GKIITAIAGA GVLGGFNVKL TNANWKHSAA YRRPRLEHEN DVEKNHPGRG ASTPFYNVML ESSTEISQGS EIFVDYGDNW EDEDKEHELT KDEYKKLDET VVKMVNFFEK HKEELDADSK QEIFQFLMQD VLSAAIGPDK AHKVGTLLPT VPDELPRVVE AGGSLAYSDP TIYRKLEWLD EHGRCMDNIK AGASNISYAG RGAMATRAIK QGSLVAPVPL IQVPDRAVFN MYNLQLSEDG ETYIRTSDDI VGEQMIINYS FGHKDSSLVF VPAGAIVNLI NHGDTPNAKM VWSTHPSHRK MWLNFKPETL LDDEQMYTGL LMEIVATRDI EPGEEILLDY GPEWKAAWDA HVESWKGRLA KGEISETRAP TAVDLNTKYS EQPYPSESEF AAPENVCLKA SLSVEESDAT GTLENPKTWA TPDEFANLNP DTLVNMHVVE SRKVEDEEDG DEPFRYVVKW ANNNGELTFV KEVPHSAIAF VDMPGMSDAF TEGAFRHVIG IPDDIFPKAW RNRK
|
| |