Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49367 |
Symbol | |
ID | 7195881 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 51121 |
End bp | 54046 |
Gene Length | 2926 bp |
Protein Length | 896 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184175 |
Protein GI | 219127923 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.40946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACATAGACT CGTCATTGAC TCCTCGTGGG TAGCAAGGAT CGCCAGCACT AAGACGGATC CGCTTTACAA GCAACCGTCT AATTCATGCA GACTGCAATG CCCCGTACTC GAGCTTGGGG AAACCTAGTG GCGAGTCGCA GCGGCCAACG ATGCCCGGCT TCCCATCACC CAATTCGTCG ATGGTTGACA AGAAATAAAC CAAATCGAAG TCCATTCGTA ACCACAGCTT CTCTCGTGCC CACAAATTGT ATGCATGTTC ATACGCGAAA CGTTCAATAC TTTTCTACTG GGAAGTCAAG TGATCCCACA AAAAACTGGC GCTTTGATCA GAGAAACATG AGAAATCGTC GCGTGTCCAA GAAGGATAGT GAACAATTTG CCAAGGCCAA TCAAAAAGCC CACCAAAAGA GCAACAAACC TCCCAACGTC TTTATTAGGG AGCTAATGAG AGACATCAAC AATCTCCAAA AAAGACTGGA CAATTTAGAT TCTCTTCCGG CAATCTGGGA CAGGTCGACG CGCTCAAAGC TTTACGATAC AAATTCTCAA GAACAGCGCC TTGCGGTGTT GAACGAAGGA CGTCAGCTAT TGGAAACGAT TCGTCTTGCG GTTGAAAAAG GCACGTTAAA GCCTTTGGAA GACCACGGTC ACGAGACGTC TGTATTGATG GAAAGAATTC TCAAGCTGTA CAGTGAAACC CCGTCCACAA CCGATACGCA TTCATCGTTT GACGAATGCC AAACCGTGCT TTCTGTAATG GATGCTTGGA AGATTGATCG GCAACATCTG CACGTTGTGT ATTCTATTAC GGCTGCTACA CGGGAAGGGC GGTGGGAGGA CGCTTCCAAC TTATTCTGGA GTCACATAGA CCCAGAGGAC TCGGGCTACC GACCATATGA TGAAGACGTT GCCAATCCCG TTGGTCTTTA CGCAGTTGCA CGGTTTGCTC AAGAACGCAA CATGCCCGTT ATAGAACCTG TCATGGATAG TGTGTTGCGT ATGACTATGG TATCGCCGTT TGATCAAGAC AAATGTATGT TGTAGAATGA GCCTTTGCTT GAGCGACAGT TTGATGATTC ATCTTAACGC ATGTTTCTTT CTCGTAGATT TGCTTGCCGC GGGAACTGCA ATTGGCCATG TTGGTGAATG GGAGGCATTT ATCAAATACT TAAAAAATAG CTTCGATGCA AGTCGGCTTG GACAGGTATG TACGGTTGCC CCTGCCTGTG TTGACCTAGA AATAAAGCAG ACTCTAACTG AACAAGGTGT TTTGAAATGT CAGCCCCTCG TGGCTGCTGT CATGAAGGCT TGCATTGCAA ACGACTCCTC ACAGCAGGCA ATGGAAATAT TCGATGAGTT CGTGGTCTCT AAACTGAGCA TCGCTGGAGA GTGGCAGTGG GCTGGAGGCG AGAATCCCCT TCATCCATTG TGTCGAGATC TTGCCCTTCA AGCCTTAGGC CAATCTTTTG AAGAAGGATC GAGCAGGCGC GCATTGGATT ACTACAACCA AATAGCTCAG GAAAAGTACA CAATCAGTAT TGATGCACTT GTTGGAGTTT TTGAGGCATG TGTCAATGAT GGACAGTGGC CTGACGCTGT GGACACACTC CTCTCAATGA TTGCAAAATC ACCATCGGAT CGATGGGTTG TCACGACGGA ATCGCTTGCT ATTGAGGTAA TGGAGGATTC TGCAGCGAAG TCGATGGACC TCCTTACTCG TCTTGGTTCT TGCGTGGAAG TCACAATGCG TGGCTGTAAC TCAGACGGTC AGTTTGGGGT TGCCATGGTG TGCTTGCGAT TGTTCCAAAT ATGGATGCCT GCGGATTATT TACTGCCGTA TCAGGAAGCA ATGTCATCTA AAGGACATGA GCATTCCGAG AACTCGCTTA TTGAGGCGAT AGCTCCTATA ATCTGTGCCT TAAATGATGC TGATGGGGTT CTCTCTGCAT CCATGGTCGC CCTTTGCGGT CTTGGTTTGA ACCAAGAGGC ATCAAATTTA TTTAAGTTGG TAGAGGGCAT CTTGGTGTAT GTTGATGGAA AGATAGAAGT AGATCGACTG GAGCAGGCTC GAGACCTTGT ATCTTATGCA AACGAAGGTG CAGAAAGATT CAAACCGACA AAGCTGCACG GGGGTTGGGA GGTGTCGTAC AGGCACATTC ATCGCCTGAC ATCTGCATGT TCAATATTAG TCCACAGTAA ATCCTGTCCC TCACAAGAAG AAATGCGACT GCTTTCTACC GCTGCAGCTG TCGCTGTTCG ATCATGTACC GTTGAGAAGC AAGCCGATGC CGGTCTTGTT CTTCTAAAAT GGATTGAAGA AACCTTAGCT TCACATGCGG CGAGTCTTCG TTCACATGAG TCTGCCCTCG CTTTGCCTCC AACTGACGCT CTACTATCGG CGCGAATGGG AGCGTACTTA TCAATAGAAG ATGCCGACGC GGCTTTGGAA TTGTTTCAAA ATGAAATCAA AAACAGCCAG GGGATGACAA AGTGGAGACT TAGCACCTGC GAAGCTATCA CAGCATTTTT TCGGGTAGGG CATGCTTCTG ACGCGATGGA CTTTTTTGGG AAGGTGGTCG CCGAAAATAG AAGTCCTGAT ATCTTTTGTA GGGTTGCCGA AGGACTTGTG GACGAGAAGG ACTGGGATGG TGTTGCCAAG GTTTATAAAT CGGCCCTGTC ATGTGGATGC CTTTCAGAGG AACTTTCGAT TCTTGCAATG AAAGCAGCCG CTGCTGGACG GATAGACGGT AGAGTCCGTG TTTTCCGCAA CATTTTAGGC GAAACTGCTA AGTTTGTGGG GACACAACCT TTGGTGTGGG CAGTAGCAAA GTACTGGATA CTCAAGCGGG CCATAGGGTT TCCAAATATT TGCATTGTAA TGTGGTGGAA CACTAGCAAT CCACATCTCA ACGAGGTAGA GCTTGC
|
Protein sequence | MQTAMPRTRA WGNLVASRSG QRCPASHHPI RRWLTRNKPN RSPFVTTASL VPTNCMHVHT RNVQYFSTGK SSDPTKNWRF DQRNMRNRRV SKKDSEQFAK ANQKAHQKSN KPPNVFIREL MRDINNLQKR LDNLDSLPAI WDRSTRSKLY DTNSQEQRLA VLNEGRQLLE TIRLAVEKGT LKPLEDHGHE TSVLMERILK LYSETPSTTD THSSFDECQT VLSVMDAWKI DRQHLHVVYS ITAATREGRW EDASNLFWSH IDPEDSGYRP YDEDVANPVG LYAVARFAQE RNMPVIEPVM DSVLRMTMVS PFDQDKYLLA AGTAIGHVGE WEAFIKYLKN SFDASRLGQP LVAAVMKACI ANDSSQQAME IFDEFVVSKL SIAGEWQWAG GENPLHPLCR DLALQALGQS FEEGSSRRAL DYYNQIAQEK YTISIDALVG VFEACVNDGQ WPDAVDTLLS MIAKSPSDRW VVTTESLAIE VMEDSAAKSM DLLTRLGSCV EVTMRGCNSD GQFGVAMVCL RLFQIWMPAD YLLPYQEAMS SKGHEHSENS LIEAIAPIIC ALNDADGVLS ASMVALCGLG LNQEASNLFK LVEGILVYVD GKIEVDRLEQ ARDLVSYANE GAERFKPTKL HGGWEVSYRH IHRLTSACSI LVHSKSCPSQ EEMRLLSTAA AVAVRSCTVE KQADAGLVLL KWIEETLASH AASLRSHESA LALPPTDALL SARMGAYLSI EDADAALELF QNEIKNSQGM TKWRLSTCEA ITAFFRVGHA SDAMDFFGKV VAENRSPDIF CRVAEGLVDE KDWDGVAKVY KSALSCGCLS EELSILAMKA AAAGRIDGRV RVFRNILGET AKFVGTQPLV WAVAKYWILK RAIGFPNICI VMWWNTSNPH LNEVEL
|
| |