Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50113 |
Symbol | hPng1 |
ID | 7198915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 78720 |
End bp | 81019 |
Gene Length | 2300 bp |
Protein Length | 681 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184963 |
Protein GI | 219129581 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.424415 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAGCTTTAC AGTTGCTTGA TCGAAGACCA TGGGTTTGTC CGCACTCCAA TAGAGAGAGA CACACGCACT CACTGTCATT CACGCACACA CTCTTTGAAC TACGGGTATA TCCTTACATA CTCGCCGTCG TTATTGCCGT TGCTGTCGCT ATCGCTGCTG CTGTATTCGT TTGTTGCTCT CGCTGTGACC GTCGCGTTGT TTCTATTATT GTATTGTGTA ACATGCCGGC CTTACGCACA CGACCACTCC GTCGCAATGT TGTGGTGTGG GTGGCATTCG GCTACTGGAT CGGCAGCTAC GGCAGTGGGT GTCACGGCAT GAGTTTCCAA CATCCGGTAC CGCACGCATC GAACGATCCC TCGCCGCAGG CTTCCGGGAC GACGACCGAG GCGCACACGA GTTCCGGGTG GGGACGAAAT ATCGAAGCAC GTTCCGCACG GGGTTCTCCG GCACTCACCT TACGCAGTAC GGCTCTTTTA GTGAATCGCA GTTCCCCGAC GAACGCGGAC TCCGTCTCGT TGTCACCAGC AGCAACAACA TCAAGACAAC CCTTTTGGAA ACGTTGGAGA CTCCCCTTGG TCCAAACGCG TCCACCCGTT AGCTGTGCTG CCGCAGCAAT GTCGACAGAA ACTCCGTGGG AAGACGGTGG GGATCGAGAC CGACGACGAC ACCCAACCGA CGCTTCGTCC ACCGCGTGTG CCGAACAACC GACCCACGGC GTCGCCACCA CCCCGGGTTC TCCTTTGGTG CATTGGTCCA CGCGCCGTGA CCCCACCCGA CCTTCTCCGA CTCATGCCAT CAAGGATCCG GTTCCCACCA ACAACAATGA CACAACCAAC AACGACATCA AAAACTCGAC TATTACAACT ACTACCGCCG CCGCCGTACT CGTCGAAACA CTCCCCTCTA CCGTCGACGA CACGCAGGAC CAAAACATCA CAAAGCCCAC CAAAATACAA ATTCACACCT ACCTGGACCC CCGCTCACAA CACGTTTGGA AGGCTCTCAA CACGGCCCCC GAAAAGTTTT TCGCCTTTTT GAAATACTCG GGTCTGAGCC TGAAGCGTAA TGAGATCCGA ACGGAACAGC TGGCCGTAGA CGGACCGGCG GTCAACGCGA CCGCGCAAAC CACAATCCGC GACGAATGGC GTCAGCTTTG GAAGGAGGGT CGGCTGTTGA CGGACCGTAC CGAACTTCTC GCCGTATACC CCAGCGATCA GGACGCGGCC ACCGCCACTC CTACATCCGT AGCCCCTAAT CGCGGCGGCT TTACGGATTT GCTCTCACTC TATACCGAGC GAGTCGCGGC TATTCTAACA GACGAGAGGA CAGACGCAGC GCGCGACGGC GGCTTTCTGG TCGCTTGGCT GGAACAACAT TACGGCGTAG ACCGCATTGC TCAACTCCAG GCAGGTGCTT TACAAACGCG GTCGCACGAG CAGCAGTTGA AACTTTGGAA GGCATTTTTG GAATGGTTCC GCTCCTATTT TCCCTATTTT TATGATCGAT GCGAGTCTTG TCAGGCATCC ATGAAGGAAG ACACTGATCG GAACGCCAGT TCAGAACCGG ATAGCAATGC GGATTCTGAC GAGCCAGATC ACCAGACGTT TGTAGGGTAC GTCTATCCAC GAATCGACGA ATTAGTCGGC AAAGCCTCAC GTACAGAATT GTACCAGTGT CACAAATGCG GACATTTTAC ACGCTTTCCG CGTTTCAACG CAGCCTCGCA CGTAATGGAT CATCGTCGCG GTCGCTGTGG AGAGTATAGT ATGTTGCTGT TTCGTATCTT GCGAGCTCTG GGACACGACG CACGGTGGGT CATTGACTGG GCGGATCATG TGTGGGCCGA AGTTCTGCTG CCACCCGAAA CGACCACGCA CCACGAGGCG GAGCAGACAC CGGGTACCCC ACGATGGGTC CACATGGATC CGTGTGAAGC CGCGGTGGAT CATAATTTGC TCTACCAGGA GTGGGGAAAG AAGCAAACAT ACATTTTGGG CTTTTATGCT CCTCGCGATG GGACCCCGTC GACTCCACTG ACGCAAGAAG CGAAGCCCGT TGCGGATGAT TCTACTACAC GTCTCTTGCC AATGATCGAA GACCTCACGC ACACGTATAC ATCGGATTCG TGGATCGACA TTTGTCAAAG GCGCGACGAG TCGGAGGAGC AAGTCAAGAC TTCAATTAGC ATCGCTGTGA TGGATTTGCA CGACCGACTG ACTCGATCTG AAGAGAGCGA CCCACATTTA ACCACGCAAC AGAGATAGTC CAATCCGCAC CCCATAGAGA TGAATGAACA
|
Protein sequence | MPALRTRPLR RNVVVWVAFG YWIGSYGSGC HGMSFQHPVP HASNDPSPQA SGTTTEAHTS SGWGRNIEAR SARGSPALTL RSTALLVNRS SPTNADSVSL SPAATTSRQP FWKRWRLPLV QTRPPVSCAA AAMSTETPWE DGGDRDRRRH PTDASSTACA EQPTHGVATT PGSPLVHWST RRDPTRPSPT HAIKDPVPTN NNDTTNNDIK NSTITTTTAA AVLVETLPST VDDTQDQNIT KPTKIQIHTY LDPRSQHVWK ALNTAPEKFF AFLKYSGLSL KRNEIRTEQL AVDGPAVNAT AQTTIRDEWR QLWKEGRLLT DRTELLAVYP SDQDAATATP TSVAPNRGGF TDLLSLYTER VAAILTDERT DAARDGGFLV AWLEQHYGVD RIAQLQAGAL QTRSHEQQLK LWKAFLEWFR SYFPYFYDRC ESCQASMKED TDRNASSEPD SNADSDEPDH QTFVGYVYPR IDELVGKASR TELYQCHKCG HFTRFPRFNA ASHVMDHRRG RCGEYSMLLF RILRALGHDA RWVIDWADHV WAEVLLPPET TTHHEAEQTP GTPRWVHMDP CEAAVDHNLL YQEWGKKQTY ILGFYAPRDG TPSTPLTQEA KPVADDSTTR LLPMIEDLTH TYTSDSWIDI CQRRDESEEQ VKTSISIAVM DLHDRLTRSE ESDPHLTTQQ R
|
| |