Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44493 |
Symbol | |
ID | 7197718 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 726186 |
End bp | 730242 |
Gene Length | 4057 bp |
Protein Length | 1247 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178290 |
Protein GI | 219114989 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAGCCTTAT TTCATCTTGC CATCTTTATT GGTCTTTGCA GCTGATCGTA TTCACAGTAA ACCTCTTAGC ATTCCTTGCT GACAACATTT TCTTTCAGCC TGGACGAATT CAAATCTGCT GTGTAGACTT TCTACGTAAC TGTACGGGGA ACGTCTTGTC ACTGGTCTTC TTTGCATTTC CTATTTTGCA CCTATCATTG ACTTGCTCTC TAAGGTCGGA TCCAGCCAAC CATTGCAACG ATGTCGGTTA ACGTGAAGAC TGTGTCCCAA GCCGAACCGA CGACTTCTCC AAAGTCAGAA CCGCCCATCA CGTCATTTAC ACTAAAGGTT TCGGCCAAGG ATGCAGCGCG CTTTTTCGCC ATGTCCAAAG CTATCAATGA GCGACTACCA GAAGTATCAC CAGATACAGG AGATACTTGT GAATTCAGTC TTCACGCACA ACTCTACGAA GCCACGCAAA AGTGCCCTGA CGAGCTGGTG AAAGGATTTT TGGATCGAAT GATTGCTTTC CTCATCCCTG TCAAGGAAAT AAAGGCTGGC TTGGTGCTTT TGCATGATGC TAACATGAAC GATAAGCTTC TACATGTTGC GGAAGCGCTG AGCCCAAGTG CTACCACTAG TGGTTTGGAT AAGGGACAGC TGACTTCATT GTTTCGGTCG CTGTTGACGG CGATCTCATG CTGTATTGAC CAATCGACCG AGGCAGTCGA CTTGGAAGAT ACGCTAATGG AAGATAAACC CCAAGGACTA GAACCGCCGC CGAAAAAGAC CAAATTAGAC ATAAAAATTG TCGAAGAGAG CGATCAAGAG TTCCGACCAT GTCAGTCCCC TTCTTTTGAC TGTTCGTTGG CAACGCTGCG AGACGAGGAC GACACAGCTA CCAATACTGT TCGTCGAGAG ATTGACGAAA TATCTAGTTT CGCTGCAGGA GAAGTTCTCT CAGACGGAGT GGAACGAGCG ACTTTTGATC TGTTGCGATC GTGGTATGAT GAAAAAGGGA AGTCGGTTGT TCCTTGGCTA GAACTATTGG ATGCAACTAA GTGGATGTCT ACCGAACCTC CAAGCCCACC CTGCAGAAAG ACGGCCGAAT GCGACCCCTC CATGGAAACG CACCAAGCTC TAAGAGACGA TATACTGCAA GTCGACGAGA ATGAAAGAAG TGTCTCTTCT GGCTCTAGAG ATATCCGCTG CTCTGTTGAA GAAGAACTGC CTGTCCCACC TACAGCCGCG CATTCTCCGA TTGATTCACT GAGCGATGGT GACGGCAGTC GAATTCTCGT TTCGTTTGAT TTTAGCGGCA CTGGCCATAC AACCCCACTT TGCATCAATG TGTCGGAAAA CAATATCGTT GCTCTTCGTC AGTTGGTCCA TCGCACTGGT ATTGTGCACT GTCAAGCCGT GGAAATGTGT CGTCGTTTGC TTCATATGGC TTCCCAGCGC CAAGAAGGTA ATGAAACTAT CCTGGCGCTT CACTGTCATG ATTTTTCTCG ATCGATTGAT CAGTTGTGGT CTCCAGAGGT GTTGAAGAAT ATTTCAAAAG AAGAGAGGGA ATCTTTTACC TTGTCGCTCA CTTCAATTTT GTCTTGCTAC CAAGAGACAA AGCCATCTCT TTCGTCTGAA GAAGTCGACC TTCAGGAATT TGCCGTGGGC TTCTGCTTCT TTTGCGCGGG CAACAAGAGT GCAAAGCTTG CAACAGGCTT TGAGATGTTG GATGATACAA GACATGGATA TTTGACCGAA CAGCAGTTAC TACGATACTT GAGCTCCTAC CTCATGATGC TTTCAGCGAT ATCGCTCTTA CATCCGCTTT CCAAAGGGCA TCATTCAAAC AAGCTGACCC CTCAGCGTCG AAAAGCGATG CGCACTGCAG TTGACAACGG TGCAAAATGG ACAATCGGTC ACTTCTTGAA GCACATGCAT GATCGTGAAG GTGGAGAACA CCCAAATGCA TATTCCTTCG AGTCATTTGC ACTGTGGTAC AGTGTCGGAG GATACAATGT TGCACCCTGG CTAGAACTTC TTGACCTCAA CAAACTTTTC GTATTGATCG CTCCGGACGT GGAAAGCTCC TTGCATACCA AATCTTCTGC ATCAATGGCA CACACATCAG GGGGACGGCG TCCAGCTGGT CAGCGCGATC GAATGTCTAC TCTACGCAGA CACCACTCCC GACGGAATCC AGGAACTCAC CCGGAAGTGT TGTTCACCTT CCCTCTAGCC CGCAGCCGCT CGTTAGTCGT TTTGAAGGAA GATGCGAGAT ACGTTCGTGA TGTCGTCCAA GAACTCGGCC TCCTGTCTTT CAGCCCCGAT TTCGTGTGGT CCAGTTTGAG CAAAATTGTG ACACGACCTA GCACTCCCAC AGAAGGCTAT GGCGTCGACA TGCAGACCTT TGTACAATGC ATGATCGATG TTTGTAATAA ATCAAGTCGC AAACGTTCTG CGTCAGGAGC TGAGTCTACT ATGGAAGAGC TTCTGTGTAA TTTTTATCAA TGTTTCAATT TGGATCAAAA AAAGCTGGTC GCCGTAGACG AGCTTATGGG TGGCCTTACT CTGCTTTGCG GGGGAAAGAA AAGCGTCAAA CTTGCCTTCG CTTTTGGAAT TTTCGATACA CGCCCGGGAG TTCACGGCAA GAGTGCGGAA TCGGTTGTAC ACTCTTTGGA TGGTCACGAT CTCTTCGTGT TTCTACGCTC AATCCTGATA GTTGCATTCT CATGCTGTCG CCAAAGCTTA GATATGGATG ATTCTGTTGT AGGGCAATGT ATCTCTGATA CTGCCAACAT GCTTTGCAAC GACGTCATGA CTCACCAGGG GAAGCAGCGC TTCTGTGACC GCCTCAACTT TGATGAATTT GGGCTATGGT ATAACGAAGG AGGATTTGAG CGAGCTCCAT GGTTGGAGCT ATTAGATCTT AAGAAGTGGG TGCTTGCCGA CAATTTCGAC GCCACCCTTG AGAAGCGTGT TGTCGAGTCA CAATTACAAG TGATCCCTGT AAGCATTGCG ACAGATTCTT CAATTCCACC GCCTCCTCCC GAGGATGCAT TAGACGGTAG CTTTTTCGAA GAGAACGGAA TTATGGCAAT GGACAGTGTA TGTATCCGTT TTAAAACTGA TTCAATTCTT GCATATCTTT TTAATCCGTG TTCTCTTTCT CAAATTTCAG ATGGATGAGA TGGATATGAT TTTGATGCAG TCGTCTACAG ATCGCGAAAG TGACCAGCGT TCGCCGGCAT ACGGACCTCT CCCTGAGTCT TCGTCTCACT CGCCAGGATC AAAACTGTGC TCTCCCGATC CAAGAGGGAA TCCACTCAAG TTTCATCTTT TGACAAATGA AGAGCAAGGA GGGTATAATG TTTCTCTCAG TCACATCCGC ATCACACATC TGAAAAGCGT GCTGGAGGAC AACTGTTTGC ACGGACTGGA CTGTGAAAAT GTATGCAACG CCATTCTGAA TAAAGCGACG AAGAAGAATA AAGCGATATC AAAAAAGGGT TTCGATGCAG CAGTTGCAAG TGTCATGGGA AGTCAGAGAG GAAGGCCTGA GACGCAACAA GTGTTGTCTA ACCTTCTTTC GGGAATTTTC GATGCGTTTG ATCGATTAGG ATCAGGCACT CCGAGCGCAG TAGAAATTGC GTGTGGCTTT ACTGTCCTTT GCCACGGCAA GAAAAGCGAC AAACTCGAGT TTGCCTTTGA AGTCCTGGAC ACGAAGAAGA AGGGCAAACT GAGCCGCTCG GATATTTTGA CTTACCTGCG CTCTTTCCTG ACTGTTCTCA TGAGTATTGC ATTTTCGCCA GCCCTCAAGA AGGATATTCG AGACGACAAG ATATCCACCA TGAAAGGATT TGGCTGTAAT CAAACAACCG CTGCCGTCAA ACACGCCGTG AACGCTGGCG CTGAGTGGGC CGTAACTGCA GCATTCGATG GAAAAAGAGA AGGCGACATG TCCGTTATGA GCTTCAATGA ATTCGCGGAC TGGTACACTA CCGTCGGCTA CAGTAGTATT CCATGGCTTG AGCTGTTGGA TCTACAGAAG TGGGTTTTCA CCAATGATGC TACCTAA
|
Protein sequence | MSVNVKTVSQ AEPTTSPKSE PPITSFTLKV SAKDAARFFA MSKAINERLP EVSPDTGDTC EFSLHAQLYE ATQKCPDELV KGFLDRMIAF LIPVKEIKAG LVLLHDANMN DKLLHVAEAL SPSATTSGLD KGQLTSLFRS LLTAISCCID QSTEAVDLED TLMEDKPQGL EPPPKKTKLD IKIVEESDQE FRPCQSPSFD CSLATLRDED DTATNTVRRE IDEISSFAAG EVLSDGVERA TFDLLRSWYD EKGKSVVPWL ELLDATKWMS TEPPSPPCRK TAECDPSMET HQALRDDILQ VDENERSVSS GSRDIRCSVE EELPVPPTAA HSPIDSLSDG DGSRILVSFD FSGTGHTTPL CINVSENNIV ALRQLVHRTG IVHCQAVEMC RRLLHMASQR QEGNETILAL HCHDFSRSID QLWSPEVLKN ISKEERESFT LSLTSILSCY QETKPSLSSE EVDLQEFAVG FCFFCAGNKS AKLATGFEML DDTRHGYLTE QQLLRYLSSY LMMLSAISLL HPLSKGHHSN KLTPQRRKAM RTAVDNGAKW TIGHFLKHMH DREGGEHPNA YSFESFALWY SVGGYNVAPW LELLDLNKLF VLIAPDVESS LHTKSSASMA HTSGGRRPAG QRDRMSTLRR HHSRRNPGTH PEVLFTFPLA RSRSLVVLKE DARYVRDVVQ ELGLLSFSPD FVWSSLSKIV TRPSTPTEGY GVDMQTFVQC MIDVCNKSSR KRSASGAEST MEELLCNFYQ CFNLDQKKLV AVDELMGGLT LLCGGKKSVK LAFAFGIFDT RPGVHGKSAE SVVHSLDGHD LFVFLRSILI VAFSCCRQSL DMDDSVVGQC ISDTANMLCN DVMTHQGKQR FCDRLNFDEF GLWYNEGGFE RAPWLELLDL KKWVLADNFD ATLEKRVVES QLQVIPVSIA TDSSIPPPPP EDALDGSFFE ENGIMAMDSM DEMDMILMQS STDRESDQRS PAYGPLPESS SHSPGSKLCS PDPRGNPLKF HLLTNEEQGG YNVSLSHIRI THLKSVLEDN CLHGLDCENV CNAILNKATK KNKAISKKGF DAAVASVMGS QRGRPETQQV LSNLLSGIFD AFDRLGSGTP SAVEIACGFT VLCHGKKSDK LEFAFEVLDT KKKGKLSRSD ILTYLRSFLT VLMSIAFSPA LKKDIRDDKI STMKGFGCNQ TTAAVKHAVN AGAEWAVTAA FDGKREGDMS VMSFNEFADW YTTVGYSSIP WLELLDLQKW VFTNDAT
|
| |