Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47420 |
Symbol | |
ID | 7202554 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 584337 |
End bp | 586705 |
Gene Length | 2369 bp |
Protein Length | 514 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181588 |
Protein GI | 219122513 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGTGGCATCA GTGACCAAGG CCTGTAATTG GAACGCGCCT ATGTCTCTCA CAAGCACAGC GGAGCAACTT TCTGATGAAG AATTACGAAA GTGGTTGAAA CCGAACGTAA CGGAAGAAGA TGCCTTGAAA GTGCTGGACG CCTCTTACGT TCCTTCCTCC GGCAGCCAGC ACCGGCGGAT ACTGAAACAG CTCGATAGTT ACGACGACGC CAATTTCTGG GTCGAAATAA ACAACGTACA GTATCTTCTC AAGTTTCACA ATGGGGTGGA ATCCCAAGAC TATTTGCGAT CCCTGGAGGG GGCTGCTGGG GATTACCACA GGCATGGTCA CCAATCTTCT GTCATTCACT GGCAAACCAC TTTGATGGAA ATTTTGAATC AAGAAGGAAT ACGCGCTTCG ATACCCAGAC CCCCACTATC GGAATCGTAC ATCGACACTG ATGACCATGA AGACGAATCA AGTCTCCAAA CCGGTGTTGG AAAATGGCCG GACAAGAATC TTGTCCAGGT GTGTGTGCAT GAACTACCGG TAGTGTCTAG TGAACGTTCG CCGTGCAATC TTGTTGTGCG ACTGTTGACG TGGACTTCGG GTAGGCCGCT GAGCGCACTG CGAGCTTTTC CCGTGGAGAC CCTTGCCGAA GCTGGTCGAC TTTTGGGACG TATCAATCGC GCCTTTGACA AACTGACACT AGTGGATTCG GCTACGAAGA CCGTCTCGGA TGCTTTCCAT CGGTACGATA CTTCCGTGTT GATCCCCGGG AAAAGGTATC ATCAATGGGA TGGCAAACAC ACGGCCGACT TGGAATCCTT TGTGCAGCAT ATTCCGGATC AGAAACGTAG AAGCCTCGTA GAGTCAGTTC TGGATGCCTT CCAGCGAGAT ATCCTGGATA CTGGCATCGA CAAGAAGTTT CGGACGGGGG CCTTACATGG GGATTTTAAT GACGCCAACA TTTTGGTAGA CGAAGATCTA AATGCTACGG GTGTAATTGA TTTTGGAGAT TCCATAATAA GGTAAGTTTC GCAGCACGAC TCGTCGCTGT TGCTTGCGGC TCATCAAACT AGAGTGCCCG GATCCATTTG TGCGCAAGAT CGACCGATTG ACAAAATATG TCGTCGGAGG TTAAGTTCAT GATCGCCGTG CTGGATACTT CATTTCGACG AGCAAGATTG AATGAGGGAA TTAAAGCACC CGAATGTCAA CGCTATCACA GCTTCTCAGT TGGTGAGAAT GATTGTGACG ACGGCATTCA AACACCATAT CTGATACTGT CCTTTCTTTG TATGATTCGT CAAAGCTGGC GAGTATTGGA TATTTCCGTA GCTATGGCGT ATGCGACATT GTCTGTGTAC GGTAAATCTA AGCGTATGAT TTCTGCCGCG GCTGCTATTC TTCGCGGTTA CAATGCAGTT TATCCTTTGA CCGAATTGGA ACGCAAGCAT TTGGTGCTAC TAATGTGCTG CCGGCTCGCT TGCTCCGTAA CACTCGGGGC GTTCTCGTTT CAACAGAATC CTGAGAACAA GTACTTGCTG CTTCATTCCG AGCCAGCATG GGAAGCTCTC GAATTGCTTT GGTGCTCCGA TCCGGGCCGG CGGTTTGAAA TATCCAATGC GGCAAACAGT CTTTTTTCTC AAGCTTGTCT TTACAAGGAT TCTCTACAGG GCGTGGTGGA TTGCTCCGAT ATTGATCTTC CTGATCCAAC GGTCCCTGAT TATTTCTCGT CAGTTCGTGC TCTCCCGATG CCAACAGGGC ATCGACAAGC CAGCAAGGAT GACGTAAACG ACTGAGATTG ACATCATAGC CTTTGTGACA GAAATCGTAA CGAAACTCGA CGCGAGAAAA AAATACTGAC AGTGAAGACA TACTGTTTCG CAGTAAAATA GACCCTGTCT CAAGTTAAAG GAGGTCATCT ATCAATTACA AGAGAAAGGC CTGCTGTTCG TGAAACAAAT CCGAGCAGCT GTCACAATCG GAGATAGCGC TTTTCCTTCA ACGCATTTGA CGGTTTACCA TGACCATACA TAAAGGGGTT TCTAGAGCCA TATGGGCATG GCGGTCTGAA CCGAACGATG AAATCTTTTT GTGACAGGAG TGCGAATGCT CAGGTCGTTG TTGTGAACCT GTCACCTTTG AAGGGCAAAA GGTAGGTGAT ATTGCTGAAC GAGGACATCT AGATTTTTAC TGGGATTCAA TCTTTTTAGG GTTATTTCTA TGGTGAAATG GAGAAGAAAA CCGTGGTTTA CATCGGAGCC GAGTATTAGG CAGTTAAGGG CTTTCAAGTT TTTTAACTGT AAGATTTGCT GGAGCTTTGG CAGACGTGAA TACCGTGTGG TAGGTTGTAA CTTCTAGTAA CGTTTACATT CGTCTACGA
|
Protein sequence | MSLTSTAEQL SDEELRKWLK PNVTEEDALK VLDASYVPSS GSQHRRILKQ LDSYDDANFW VEINNVQYLL KFHNGVESQD YLRSLEGAAG DYHRHGHQSS VIHWQTTLME ILNQEGIRAS IPRPPLSESY IDTDDHEDES SLQTGVGKWP DKNLVQVCVH ELPVVSSERS PCNLVVRLLT WTSGRPLSAL RAFPVETLAE AGRLLGRINR AFDKLTLVDS ATKTVSDAFH RYDTSVLIPG KRYHQWDGKH TADLESFVQH IPDQKRRSLV ESVLDAFQRD ILDTGIDKKF RTGALHGDFN DANILVDEDL NATGVIDFGD SIISFSVGEN DCDDGIQTPY LILSFLCMIR QSWRVLDISV AMAYATLSVY GKSKRMISAA AAILRGYNAV YPLTELERKH LVLLMCCRLA CSVTLGAFSF QQNPENKYLL LHSEPAWEAL ELLWCSDPGR RFEISNAANS LFSQACLYKD SLQGVVDCSD IDLPDPTVPD YFSSVRALPM PTGHRQASKD DVND
|
| |