Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48175 |
Symbol | |
ID | 7203312 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 416176 |
End bp | 418244 |
Gene Length | 2069 bp |
Protein Length | 604 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182531 |
Protein GI | 219124481 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.696617 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGAGA CGGAGAGAGA TGCACGAGTC AGTCGCGTTA TCGAGAGGGT ACGGCTACGA GAGGAAGTCT GTGCGAGACC AGTAGCGTAC GGTACGGTAC GGTACCGGAT CGATCGATCA CTCCACACCT GGGTTTGCGA TCGGTACCAA GCGACTGTAC TACCTATTCG GTCTCCCCGG CACCGTACGG ACCGCGACGG CGTCGGCACA TACAGACACG TAGACACTTT GACGTACACA GGTACAGACA TAGCCAGATA AATACAGACA CCCGTACACA TATACACACA TATACACACA GACAAACTCG GAGGGGAGGG GGGCCGGTTC CGCCACGCAG TGCACACGTG CACAACGCGC GTTGGGGTTT CTCCCGAGGC GTGCCGTTGC CCTTGCCCTC TCTGGGCGGG AAAAAGTCAC GGTGTCATCC TCACGGGTAC ACGACCTATG AATGTCAGAA CACCACCTTG TTGAGACATT ACCTCTCTCG TTGCCCTTTC TTGTTTCTCA TGGTGGTGTC GACCTCCTTG CTCCTCCGTT ACAATTTAAA CGCCCCAACG ACAACGACAA AGAGTCTTCT ATAGTAGTCT TTCTCAAGGT CCACCAACAC CAAGCACATT TTCAACACTG TATCGTTCGG TTCCTTCACT CGTCTTTCTG ATTTCCATCA TCCCCTCGGA TCCTACGGGG CCGTGCGGCT CTACGGGTTG ACGTCCATTC CGGCCAATCC AATCCAAACT GTTACACGCA CACTTTCTAT CCCCCCAGGC TTCCCTGATC ATGAGATCGT CCCAGCCCCT TTCACCCACC GGCTCTGGTT TCGCCTTGTA TCGCTCGCGC ACACGTCGCC ATCGTCTACC CGGACCGTAC CCTTGGGTGT CGAGAGAAAC TCGTCCAGGC GACCGCGACG GGGGAGCGTG GCACTCCGGC GACCAAGGTC GCTATCCTTT ACCACCATGT CGGCCTACAC GCGCACCGAT ACCCACGCAT CCGCCGGTTC CGTCGGCCCC TCCTCTCCAC ACTGTCAAAC CTGCACTTGT CACTTTCACG AACTCGACGA GGCGCCACAC CTTGTACCAT CACAACAATT ACCACAATCC GTAGTACCAT TGCCGCCTCC TTTACCGGAA CCAACCTATT CCGTACGTCG CCGTGTTCTA CCCGCTTCCC TCGTGCCGCT CAACAGCCGA CAAGGCACGC TCTGGCTCAC GGACTGTCTT ACACAAAACA CGGCCGCCTT GTATATTCCG CTTTCGGAGC ATTTTTGCAA TCAGTCGGAT CCCGCCTATT GCGGCGTCAC CACACTCCTC ATGGTCCTCA ACGTCTTTGC CATGGATCCA CACGTACGCT GGAAACACGG TTGGCGATAC TTTGGGAACG AAGACGTCCT CTTGTCCCAG TGTTGTTTGA CTCCGGAACG GATTCGACGG GCCGGTATTT CCATGCACGA ATTCGCGCAA CTCGCCTCCT GCCAAGGCGT GCGGGTCCGT ATGCAACGAC CGCAACCCCC ATCCGCGATA GAGACCGACT CGGGGTCCGT CCACACCAAC CCATCATTCG ACTTGGATGA CCTGCAACGC TTCCGATTGG ATATTCAACG GGTCTTGTCA GGCACCGCCG AGACCTCCGG CAATCCGGCC GCCGCGATGG AAACGAATGG CGTTATCGTA GTCAGTTTCA GCCGCGCCGT CCTGGGACAA ACGGGGGACG GCCATTTTTC TCCACTGGCT GCCTACCACG CCGCCACGGA CCAGGTTCTC GTCCTCGACG TGGCGCGCTT CAAGTACCCG CCGTACTGGG TGACCGTGAC GGACCTCTAC CGGTCCCTCT TGCCGCACGA TCCCGCCACG TCCCAATCGC GGGGATGGTA CGTTTTGCAA CCCCCGCTCC GGTCCGCCTC GTACCGCGGA AGCGCCGGTG GAGAAGACCG ACGTCCGGCC AAACTCGTAC CGCTCGTGGG TGAACCGCAC GGTGCTCGCA CACACGCCTG TCCCGTGCAG GTCATTAAAA CGGCGTATTG CCAAGTGGCG GAACACGAGC CGCGGAACAA TGATGCGAAA GGCCCGTAA
|
Protein sequence | MHETERDARV SRVIERVRLR EEVCARPVAY GTVRYRIDRS LHTWVCDRYQ ATVLPIRSPR HRTDRDGVGT YRHTNSEGRG AGSATQCTRA QRALGFLPRR AVALALSGRE KSFSRSTNTK HIFNTVSFGS FTRLSDFHHP LGSYGAVRLY GLTSIPANPI QTVTRTLSIP PGFPDHEIVP APFTHRLWFR LVSLAHTSPS STRTVPLGVE RNSSRRPRRG SVALRRPRSL SFTTMSAYTR TDTHASAGSV GPSSPHCQTC TCHFHELDEA PHLVPSQQLP QSVVPLPPPL PEPTYSVRRR VLPASLVPLN SRQGTLWLTD CLTQNTAALY IPLSEHFCNQ SDPAYCGVTT LLMVLNVFAM DPHVRWKHGW RYFGNEDVLL SQCCLTPERI RRAGISMHEF AQLASCQGVR VRMQRPQPPS AIETDSGSVH TNPSFDLDDL QRFRLDIQRV LSGTAETSGN PAAAMETNGV IVVSFSRAVL GQTGDGHFSP LAAYHAATDQ VLVLDVARFK YPPYWVTVTD LYRSLLPHDP ATSQSRGWYV LQPPLRSASY RGSAGGEDRR PAKLVPLVGE PHGARTHACP VQVIKTAYCQ VAEHEPRNND AKGP
|
| |