Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49168 |
Symbol | |
ID | 7195623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 116811 |
End bp | 118604 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183815 |
Protein GI | 219127173 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGACA AAAACTGCCA GGCCTTAACC TTGGACTTTA TTGCCGTAGA GGACGACGCG GAAATGATCG CCGTGGAGGA TGAGATTCGC CGGGACTTGA AAGAAATTGG CGTTACAGTC AATACCCGTT TTCTTAGCCG AGAGGATTAC ATTGAAGCTG AGTTGAATGG TGACTATAAC ATGCTCTTCA CTCGTACCTG GGGTGCCCCG TACGATCCTC ACAGCTATTT CAATTCGTGG GCCGTTCCAA GCCACGTCGA GTACACGGCC ATTGACACGT TAGAAGCACC TCTTAGTCGC GAGCTCCTTT TGAAAAAGAT TGAAAATGTG CAGAAGGAAC TAGATGAGAT GCAGATTCAG GCACAGTGGC GAGAGATCTT GAACGATGTC CATCAGCAGG CCATCTTTTT GCCGCTCTGG GGCACCCGAA TTCCATACGT GATCAACCGT CGTCTTTCAG GGTTTACGCC AAGTGATCAG GCCTTTACGT ATCCATTAAG TAGCATTCGT ATCTCAAGCG GATCTGCCAA CATCACCATC GCGCCGGGTT CCGGCGGCTC GCTCTTCACG TCGGTCGGAC CCTTGAATCC TCACCAGTAC TTTCCCAATC AGATTTTCGC CAGCGATTGG ATTTACGAAG GCCTCGTGAA TTACGGACAA GATGGTGAGA TTGTTCCATC GCTAGCATCG GAGTGGACTA CGGAACGCAC TGCCGAAGGA CAGCGCGTTA TCTTCCAGCT TCGTGAAGGC GTCAAATTTC ACGATGGCAG TGATTGGAAC TGCACTGTCG CCAAGCTCAA CTTTGACCAC ATTTTTTCCG ACACGGTCCG CGAACGTCAT TCCTCATTTG GAGCTACAGC GAATCTCAAG AGCTGGACGT GCAATCAGAA TGGGGAGTTT GTTTTGGAAA CGTCCGCACC GTTTTACCCT CTGCTCCAAG AGCTTACGTA TAGTCGCCCG TTTGTTTTTG CGTCTGCTAG TTCCTTTGCT GCAGGCATTG ACTCTGATCC AGAGACTCAA AACTCATGTG AATCCGGAGA TTTTGGGTCC AAATGGGACT ATCTTGAGGA GTTTGTTACC TGCCTCGGTC TCTCGGCTCC CATTGGTACG GGACCGTTCA AATTTGCGGA TCGTGAATAC CTCCCGGGAA CGAACGAGAC AATGGATGCC AAGGTTACGT TTGCGCGCCA CGAAGACTAT TGGGGTGGCT TGCCCGCAAT CGAATTCCTT GAAATAATCC ACTTTGAGGA TACGGATGCG GTCGAAGCCG CGTTATTTGA CGGTCAGCTG GATATGGTTT TGGGCTCCGG TCCCCTTTCT GCCAAACAAG TTCAGAATAT CAAGTTTGTA CATAGCGACA AGTTTGATGT CCGCCACAGT GCAGTTTTAC AGAATGCACT GGTTGTCTTA AACTCTGGTA AGGCACCAAC GGATGACATC CAAACACGCC AAGCCATTAT TCACGCCGTC AACAAAGCAA TCTTTATTGA AGATGAGTTT GCGGGCTTGG AACAAGCCGT TTCGCAGCTT TTGCCGCTCA CCGCACCGTA CAGTAACGTT GATCTCAATC CAAAGTGGAA TTACGATTTG GAAAAAGCCA GATTTCTCAA CTGCCCTGCA GATATGAATG GCAGCTCGGA GGACAGCTTG TCGGGTGGTG CAATTGGGGG TATTGTTGCG GCAATTTTGG TGGTACTGGC AATGGCTGTC TTTTTGGGAC GTTTGATTCT ACGCGAAAAA CAGGGGAAGC CAATGTTTGC CCCAGAAAAG ATACGCAAGG GCGAACAAGC TTGA
|
Protein sequence | MADKNCQALT LDFIAVEDDA EMIAVEDEIR RDLKEIGVTV NTRFLSREDY IEAELNGDYN MLFTRTWGAP YDPHSYFNSW AVPSHVEYTA IDTLEAPLSR ELLLKKIENV QKELDEMQIQ AQWREILNDV HQQAIFLPLW GTRIPYVINR RLSGFTPSDQ AFTYPLSSIR ISSGSANITI APGSGGSLFT SVGPLNPHQY FPNQIFASDW IYEGLVNYGQ DGEIVPSLAS EWTTERTAEG QRVIFQLREG VKFHDGSDWN CTVAKLNFDH IFSDTVRERH SSFGATANLK SWTCNQNGEF VLETSAPFYP LLQELTYSRP FVFASASSFA AGIDSDPETQ NSCESGDFGS KWDYLEEFVT CLGLSAPIGT GPFKFADREY LPGTNETMDA KVTFARHEDY WGGLPAIEFL EIIHFEDTDA VEAALFDGQL DMVLGSGPLS AKQVQNIKFV HSDKFDVRHS AVLQNALVVL NSGKAPTDDI QTRQAIIHAV NKAIFIEDEF AGLEQAVSQL LPLTAPYSNV DLNPKWNYDL EKARFLNCPA DMNGSSEDSL SGGAIGGIVA AILVVLAMAV FLGRLILREK QGKPMFAPEK IRKGEQA
|
| |