Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46971 |
Symbol | |
ID | 7202079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 44406 |
End bp | 46496 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181293 |
Protein GI | 219121896 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTCT CTTCTCTCTG GAAAGTCTTG GACAAGGCCG GTTGCGCCAC CCCAGTGGGT ATAGCAGAGT TTACTGTTTA TGACCCACAG CACTACGCTC GTAGTCAAGA AGCACCCAAT GAGGCGAGTC CGTGGAATCA CAACCTGGGC GTTCGTCCAT CTGCAATACG AGAGCAACAC AAGGCAACCA CATTGGCGGT GGATTTGTCA ATTTGGATTT GTGAGTCCCT TACGTCCCGT GCGATGACGG AAAATCACGC CAATCCGGCT CTTCATCTGG TCTTTTCACG AACAATGAAA TTGCTTTCCC TGGGAATAAA GCTTATTTTC GTGCTTGAAG GCAAGCGACG CGTGCAAACG GCGGGAAAGC GAGACAATTT TCGTAATCGG CGCAGTGGCA CTACCTTTTG GAAAGCAGGT GAGCAATGTC ATGACTTGTT GACACGGCTT GGCATTCCAG TTTTCCGGGC CAAAGCCGAA GGTGAAGCTC TTTGTGCTTT ACTTTCACAG CGCAATATTG TTGATGGTGT AATTTCCAAT GATGGGGATT GTCTACTATT CGGAGCCCGA GTTGTATACA CGAAATTCTC GGTCGAAAAC CTTGTTGAAG GTAGTGTTAT GCGGTATGAC CTCGGCAATC TTCGAGCCCT GATTGACCAC GCCGGCGACA AAGAAGCCTC AGACCAGCTT ACTGGATCGC TTTCTCTCAG TCGTTTTGAC CTCCTCTCTT TTGCATTGCT CACCGGCAGC GACTTAGCAG GGAACGGACT ACCGAAGGTT GGGCACAAAA AGGCCATTCG TTTCATTCGA AAGTGCCAAA TCGACAACCC CCTTACGACT GAGATGGCCT CCATTGATGA GGTGAAATCA TGGGCAGTTG CCGCTCATGT TCGACCAACC AACCTGCCAC ATCAAACGAA AGCGAACGAA AAATGCTGTA GTCGCTGCTG TCATATCGGA ACCAAGCACA GCCACGAGAA ACTAGGGTGT GAAGCTTGTG GTACTGCTCC TGGAGAACCA TGCTATGCAT TCTCTACGGA AGATCGCTTC CGAAAGTCTC TCCGCGAAAA AGCTCTAAAA GTCATTCCGA TTTTTGAGCC TTCCCAGGTT GTTGAGGCCT ATCTGCGACC TAACGACAAT CAGCTCCCCG CTCAGCTGGC TGGCAAGACC TCAAATCAAA TAAAGATGGA CCCACCGGAC CTCAACGCTC TCTTGCAACT CCCTCTAATA CTCAAGGGTC GTAGTCAGGA AGAGAGTCGC GAGTACTTGG TTCGTTCAGT TGGCCGCCTC TTATCTCGTG CTGAACTATT TCGAAAGCAC GAGGAAAGCG ATGAAAAAGA ACTGATGACT TCTTATCGCC GAGCGCGAGA AAGACCCATT CCCCACAAAA TTACGAACTC AATGACGCAG AACGGAGTGC CTTCTTACGA GGTAAAATGG TTGGTTAACG CTACCGTGAC TGATAATTTC GGTGAAGGAA TCGATGGCTA CGAATATTCC ACGGTTGAAC CTTGTGACCT GATAGCGAAC CGTTACCCCG TCCTAATAAA GGAGTTTCAG CAAGCGGAGA AAGAGCGTAT GAAGCAAGGA GACGGAGAAA AGCTTCGTCG TTTAGACTTT CTACAATCGC ACCTGTTCCT TCACAGAGAC CCGCAACGTC CTTCAGAAAC CGGCGACGCA AAGAAGGTGA AACGATCGAA GCACCGAGCA GGATTCTTTG AGACCAAAGA AGCAAGCCTT CCAAAACTCT ACGCATCAAG CAGACGAAAT AATCGATCCA AGCGCAAGAC GGGAGATGAC GTTGGTCATT TGTTACGTTA TATTGTCCGT TCTAGCGACA TCACACAAGA TCCATTGGGA TTCAGTCCGA TTGACAACTC CAAATATGGA TTACTTCCTG GAACGAGCTT GCGCGGTCAC TGTACTCCAT CGCCGCGAAA GAACGTATAC AAATCAAAAG CTTCATTGCA CTCACCGGAA GGAAGGGTGT TTTGCAATAT GGGTGGTGTA TTCATTGAGA TTACGCCGAT TATATCCAAC CGTCGTACCT TCCCACCCCG TCACATATTC ATTCGCCGGA GTTCAGCCTA G
|
Protein sequence | MTVSSLWKVL DKAGCATPVG IAEFTVYDPQ HYARSQEAPN EASPWNHNLG VRPSAIREQH KATTLAVDLS IWICESLTSR AMTENHANPA LHLVFSRTMK LLSLGIKLIF VLEGKRRVQT AGKRDNFRNR RSGTTFWKAG EQCHDLLTRL GIPVFRAKAE GEALCALLSQ RNIVDGVISN DGDCLLFGAR VVYTKFSVEN LVEGSVMRYD LGNLRALIDH AGDKEASDQL TGSLSLSRFD LLSFALLTGS DLAGNGLPKV GHKKAIRFIR KCQIDNPLTT EMASIDEVKS WAVAAHVRPT NLPHQTKANE KCCSRCCHIG TKHSHEKLGC EACGTAPGEP CYAFSTEDRF RKSLREKALK VIPIFEPSQV VEAYLRPNDN QLPAQLAGKT SNQIKMDPPD LNALLQLPLI LKGRSQEESR EYLVRSVGRL LSRAELFRKH EESDEKELMT SYRRARERPI PHKITNSMTQ NGVPSYEVKW LVNATVTDNF GEGIDGYEYS TVEPCDLIAN RYPVLIKEFQ QAEKERMKQG DGEKLRRLDF LQSHLFLHRD PQRPSETGDA KKVKRSKHRA GFFETKEASL PKLYASSRRN NRSKRKTGDD VGHLLRYIVR SSDITQDPLG FSPIDNSKYG LLPGTSLRGH CTPSPRKNVY KSKASLHSPE GRVFCNMGGV FIEITPIISN RRTFPPRHIF IRRSSA
|
| |