Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47966 |
Symbol | |
ID | 7203147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 569495 |
End bp | 571509 |
Gene Length | 2015 bp |
Protein Length | 391 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182423 |
Protein GI | 219124254 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.667552 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACCAGCACT CTCGAAAGTG GTGACTAGTG CGACTGGTGC TTAAATTAGT TCTGAGAGGA TCTCTGCAAC TGCCTATTTC TATATTGTCG AGGGACTCTG CCAAGGAGAA ACCCAGAAAA ACGATACCAC CAGTTCCTCC TCTGTTGCTC GTACAGATTT TCCTCTGGCG AGCCGTAACG TGTTACCTGT CGCATAGGGT TGGATTTAAT CACGACGTAT CTCTTTTTTT GGAATCAATG GATTATGCGC GAGCCAAGCC GAACCGTGAG GAAGCTGTAG CTGATACTGG CGAGAAGAAG GCGAGCGCAA AGAAGGTGAA GGTGGGAGTC TACTACTACC CGTGGTACAG CGGGAATTTT CACGGTGGCA AGTACTTGCG TCAAAAACTG GATCCGGTCC AACAACCTGC GCTGGGCGAG TACGATGATC GTGATCCGGA CGTGATCGCG CAGCACGTGG CTTGGAGTCA ACAAGCCAAC ATTGACGTAT GGATAACCAG TTGGTGGGGC CCGGACTCGG ACTCGAATCG CACAACCAAA GATGTCATTC TGGCGTCACC CGTCTTCCAG AAAAGTGGTT TGCAGCTGGC GCTCTTTTAC GAATCAACCT CACGTCTGGG CAAAGCATTC GACGATATTT CCAATATTGC CAGCGACATT ACTTACATGG CCAAGACGTA CTTCAAAAAT ACCAACTACT TGCGCATCGA CGGCAAACCA GTCCTTGCGG TATATTTGAC GCGATCGATT GAAGCACGGA GCGATATCAA CCGATTCACC AGTATTGTCC GACAAGCTGC ACTTCACGCC GGCGTCGGAG AAATTTATTT GTTGGGGGAT CACGCGTTCG GAAAGCCCCC AGCCAAGAGT TCTCCTTCGT ACAGCAAAAA GATCAATAAT CTTCGGCGAC TGGACGCGAT CACCAACTAC GATGTCTACG GTAGTATGCA CGCCAAAGGA AAGCACGCGA CGCAAGCCGA AGTCGCTGCC TACACGCAGG CCCAAGAAAA CTGGCGCACC ATGGCGCAGG ATGCCAAGGT CGCCTTTGTT CCGTGCGTAT CACCCGGCTT TAACGATCGT GGGGTGCGTT TGCGGGCCAA CCATTCAGCG TTGTCGCGAA AGCTTGATTC CGACGAGAGT GAACCGGGAT CGCTCTTTCA GGCACAGTTA CGGCAATCGG TGGCTCTCGT AGATGATGCG GCCGACCGAC TGTTGATGGT GACGTCATTC AACGAGTGGC ACGAGGACAC GCAAATTGAG CCAGTTGCGG AGCAAGCAAT GACCGTGCGT GATGGTAGTG ATGGGAGTCA AGACTACACG CAGGGGGTCG GCTACGAAGG ATACGAAACG CTATATTTGG ATCTTTTGCG GGACGAAACG GTAGAAAAGC AGAAGTGCAA TGCAGAAAAA TAGAGACGGC TGGGATTGAT ATTTACAAAT ATGATTCTAA CATAGTTTAT GTACGTGTTT TCTAAAGGTG AATAAGAAGG GCGTCGTGGA GCGGATACTC GACCAAGTAG GACGCTTCAA AAAGACGCGA AAAGTACCAA TCTTTGCAAA GGAGGAATCT TATTGAGCAT CACTGCTCTT GGAAAATATA AGATCAGATA AGGAAAAGGA AGCGTGCACT TTGCAGGCGT CCTACCACTT CAGCGAATCA TGCCACGAGG GCATCACAAG CAATGAGCTT GGAAATTTTA AACAATCGAA GTGAGGATTC AGTATCGTCC AACCTATTCT TCGCTGATAG CAGATAGTCC CATTTGGTTC GAGGCGATGC GGTTGCGAGA AGCGAGCGAC TCGGAAGTGT CGGTCGAGGA ATCACGAGTT AAAGAAGCTT GTTCCACATA CTCCAAAAAA AGTTTGCGTT TCATTCTACG CAACAACAAT GCTGGACTTT CGGCCATTTC GCTGTGTGAT ACTGCTTCGT TTGGGGACGT GGAACGGTTG CGTGATAGTC GGCTCATGGT TAACTATCTA TTCGAGGTTT TTCTATACGG AATGGTGTCA GGAAC
|
Protein sequence | MDYARAKPNR EEAVADTGEK KASAKKVKVG VYYYPWYSGN FHGGKYLRQK LDPVQQPALG EYDDRDPDVI AQHVAWSQQA NIDVWITSWW GPDSDSNRTT KDVILASPVF QKSGLQLALF YESTSRLGKA FDDISNIASD ITYMAKTYFK NTNYLRIDGK PVLAVYLTRS IEARSDINRF TSIVRQAALH AGVGEIYLLG DHAFGKPPAK SSPSYSKKIN NLRRLDAITN YDVYGSMHAK GKHATQAEVA AYTQAQENWR TMAQDAKVAF VPCVSPGFND RGVRLRANHS ALSRKLDSDE SEPGSLFQAQ LRQSVALVDD AADRLLMVTS FNEWHEDTQI EPVAEQAMTV RDGSDGSQDY TQGVGYEGYE TLYLDLLRDE TVEKQKCNAE K
|
| |