Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49634 |
Symbol | |
ID | 7198290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 266003 |
End bp | 267944 |
Gene Length | 1942 bp |
Protein Length | 621 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184432 |
Protein GI | 219128463 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.387641 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCGAGCCGG TAAGCAGCAA ATGGACCCCA TGAACATATC TACAAGCCTT CTCCTGCGTC ATTCACTCGT TAGAACATGG CGAAGCGAAA GATTGGCAGC GGTCGGGCAT GTCAAGCATT AGCGGCTGCC TTTCTGGTGA GCACTCCCGC GGCTGCGGCT TTTGCTACGG CGTCTCTCTG CATTACGGCA TCACCGTGTA GGTTTACTTC TACACGAAGT CTCTCCAGCT ATCCTACAGA TGAGGGTGCT GCGGAATCTC AGGCGAATGA CAAACTCTCC AGACTCAAAG ACATGCTGCA GCAGACATCG TCGGAGGATT ACCATGGAGT AAGGGAAAAT TCCCCGATTG CGAATGGGGA CTTAAAAGTA TCGTCAGCCA TGGGCGAGGA TCTAGTCGAC AGCATGCCAG AGCTCTCGTC GCAGGGCCTC TACCAAATCT TTAGCCAAAA GCAGCATAAC GCTTTGCTGG AAGCAAATCC AGACAAGCTT ATTGTCGTCA AATTTAAAGC TTCCTACTGC CGGGCTTGTG CAGCGTTGGA TCCGAAATTT CTTATGGTCC GGAACGATGA GAAGTTGGCT CATTTGCCTA TCGTTTGGGC CGAGTTCACC GCGACTTTGG ACACCAAAGG CTTTTTTCGC CGACTAGGTG TCCTTTCATT ACCAACTGTA CAATTCTACG ACGGTGATGC TGGCCTCGTT GAGAATTTTC CATGCGGTCC GGCGAACTTT CCCAAACTTC AACAAAAACT AGCTCAATTC CTCAATCGCC GAGTGGACCC AGACTCGTTT CAGCTGAAAC CGCGAAATCC AGAAGATGGA ACGCCCACTA TTCCGCGCAG AAGTCGTGAT ATCCGAATAG GAGATGAACT TATAATGCAG GAGCACGTTG ACTTTCTACG CAATGATCTT TCCTTTTTTC AAGCCCTCAC GGACGATGAG TTCGTGACAA TGATTTCTAA AGCGCGCCTG GAAACTTTCT TACCCGGCGA TGTTATAATT CGCCAAGGAT TACCCGGCAA AACCTTTTAT GTACTCAAAT CTGGGGCAGT AGAGATGTAC ATCCGCAGCA AGTTTGATGA TCCCATCTCA ACTCCGGCTT GGTATTTGGG TGCTGTGGTC AACCAACTGG GCAAGTTTGA CTACTTTGGG GAGCGCGCGC TTTCTACTGG AGAGCCCTAC CGTGCTAGTG TCCGTGTATT GGAAAAATGT CGTTGTTTTG CGTTCAGCGT AGAAGACATT CCTGACAGCT CCATTCTGAG TCTGAAACGT CGGGCTACTC GATCCATGAT TGCTAAACTA ACGGAACGCT ACGAACTCCC GGAAGACTAT TATAAGCCAA CATATGTTTC GCAGGAAAAG CAAAAGGACG AGAACATTTT GGAACTTCTC GTGCGCTTTA AACAGATTCG CCAGGCGGCC AAGTGTTTAG AATACGTCTT GAAAGCCGAG CCACTTTTCG GAGACGAAGG CGAGATTGTC CGTCGCAGTC TTCTTGCTTC CAAGTTGACC CCCTCACAAC GACAGGATTT TATCGATGTG TTTAACATTG CAGATAACAA AGGGTCTGGT AAAATTAGTA TTCTCGAACT CCGACGATTC ATGCAAGGGG CCCGTAAGAA GTCGACAAAC GATGAACTCC TCGAAATGAT TCACAAGGCG AACCCTAGTA TCACCGATAA AACTCTTGAA CGTGGCATTT CTTTGGACGA GTTTCTTGGA GTCATGGCGG AAGCCGAATT CTACTATCTC TTCACAGACA TTTTTCAAGA TTTGGATCCC ACGGGTACCG GCTACGTCCG TGCGGGTGAT TTGGATGAAG TCTTAGATGG TGTTCGAGAT TTGATCAGCA ACGATCGAAA GAGTTTGATC GACGTCGAAG ATCAAGAAAT GCAGGTTGAC TACGAACAAT TTGCTAAGAT GCTACTAGGG GCCGCCTTGT AA
|
Protein sequence | MAKRKIGSGR ACQALAAAFL VSTPAAAAFA TASLCITASP CRFTSTRSLS SYPTDEGAAE SQANDKLSRL KDMLQQTSSE DYHGVRENSP IANGDLKVSS AMGEDLVDSM PELSSQGLYQ IFSQKQHNAL LEANPDKLIV VKFKASYCRA CAALDPKFLM VRNDEKLAHL PIVWAEFTAT LDTKGFFRRL GVLSLPTVQF YDGDAGLVEN FPCGPANFPK LQQKLAQFLN RRVDPDSFQL KPRNPEDGTP TIPRRSRDIR IGDELIMQEH VDFLRNDLSF FQALTDDEFV TMISKARLET FLPGDVIIRQ GLPGKTFYVL KSGAVEMYIR SKFDDPISTP AWYLGAVVNQ LGKFDYFGER ALSTGEPYRA SVRVLEKCRC FAFSVEDIPD SSILSLKRRA TRSMIAKLTE RYELPEDYYK PTYVSQEKQK DENILELLVR FKQIRQAAKC LEYVLKAEPL FGDEGEIVRR SLLASKLTPS QRQDFIDVFN IADNKGSGKI SILELRRFMQ GARKKSTNDE LLEMIHKANP SITDKTLERG ISLDEFLGVM AEAEFYYLFT DIFQDLDPTG TGYVRAGDLD EVLDGVRDLI SNDRKSLIDV EDQEMQVDYE QFAKMLLGAA L
|
| |