Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31816 |
Symbol | |
ID | 7196133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 998301 |
End bp | 999965 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176704 |
Protein GI | 219109902 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0623295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATTTG CTCAAGAAGC CGGTGACTCG ACAGAAATTG TCCGCGCCAT ATACGAAGCG TTGTCCACGT TGCCAGACGT GTCTCTGCTT TCCCAACCTG AAACAGACAA AGAGGAGGCC TCGCCACCAA GCATCACTTT TCAACCAGAG ATATGCAGAT CAACGTCGTC AAACCTGTCT ATCGGCAAAC GATACAATGC ACCTTGTGAG CAGCCGAGCC CTTTCGAAAT TTCTTTGTCT ACTTTCCCTG CTTTCCTTTC TTTTGTGTTG GACTTTCTTG GAGATCCGGT AGCGGTTTGT CGGTTCAAGA TGGTAAACCG TTTGTGTTTG AAATACGTGG ATGAGCATGA GCATGTCCTT ATGAGAGACG CCGTCCGTCT TGGAGGCCTG CAGATTAATG TACGTCCGAG CTTCTGGTTA TGGGTTACGC TTGAAAAAGA CAGCAGGGAA TCTTGCCATA GGAAAGAGGA CCCCGACGAC AATGTCATAA TGGGAGCGCG AAACGAATTG ACGCTCCTAG AGCGGAAAGG CAGGGAAGGC AAGTGGAACA ACGTGATCCA TAGAGATGTG TTACGGTCGT TTGGCAACTT ACCTCCGCAC AAATCTGGGG CCCGCCTACG TACGGACTCG ATTGTCCGTG CGCTTGCAAC TTGGGGCAGG AGTAGAATAC TGAAGAGTGG TATTCGGGGA TCCGGCGACC CGCCGCCGCC TTCGCATTCC TTTTATGAAG AAGATGATGA CGATGTCAGT TTGGCTCCGA CGGACACCGT TAGCGACTGG GGTGCCGTTT CCCCTGTCGG AAGCGTTACA GGCTCATTCT GCAGCACACG ACTAGACGGT CGTCATACCA AAAAGTTCGA GAGACAAGCT GAAGCCCAGG AGCTCGCGCT TGGTGGAAGC GCACTGTCAG ATATCGCAAA AGCTCGATTA CAGGAAAAGT TGAGCTTCAT TCTCCATGTA CTTGCAGCTA CTTATAGCAA TGTAGGATAT TGCCAGGGAA TGGATTATGT TGCAGCTCAT CTTCTTCGAA TCCTGGAAGA CACTATTCGT TGGAAGGCTG TCACAGGAAA CCTTCCTTCT GTAATTCAAT TCGAGCCGTC AAGCATGATT GGTCACGACA ATCCCGATCA ATCGCTATCG GATATGTATG CGGAAGTCGA TAAAAGTTCG GTGGTCGAAG AAACATGTTT TCGAGTCATG GATTCGTTCT TTACTACGTA CGGTCTTCGA CATTTCTACT GGCCAGAGTT GCGGTGTCTG AAGACGTGCT GCTTGGTCTT CGAGAAACTT GTCCAAATCA AACTCCCAGT TCTTGCTGAC CACTTCGAGC ATCACGAGCT GAACATCGGA CTGTTTGCAT TGGGATGGTT TCAGACACTA TTTTTGTACT TGCCATCTAT GCCTGCAGCA ACTGTTTGTC ACATGTGGGA CATTTGGTTG GTTGAGAGGA ACTTCAAAAT CTTCTTTAGA GTCGGCACTG CTATTTTGTT TCTTTGCCAG CCTGTTCTTT TGAACAATGA GCTCGAAGGT ATGATGGGAT ATCTCAATAC TTTCCCTGAT GCTACACTGT TAAGCCCAGA TATTCTCATT GCATGCGCAT TGCAAATCAA AGTTACAAAT CGAATGCTCA CACAGATTGA ATGCGACTTA TATCAAAAGT TGTAA
|
Protein sequence | MIFAQEAGDS TEIVRAIYEA LSTLPDVSLL SQPETDKEEA SPPSITFQPE ICRSTSSNLS IGKRYNAPCE QPSPFEISLS TFPAFLSFVL DFLGDPVAVC RFKMVNRLCL KYVDEHEHVL MRDAVRLGGL QINVRPSFWL WVTLEKDSRE SCHRKEDPDD NVIMGARNEL TLLERKGREG KWNNVIHRDV LRSFGNLPPH KSGARLRTDS IVRALATWGR SRILKSGIRG SGDPPPPSHS FYEEDDDDVS LAPTDTVSDW GAVSPVGSVT GSFCSTRLDG RHTKKFERQA EAQELALGGS ALSDIAKARL QEKLSFILHV LAATYSNVGY CQGMDYVAAH LLRILEDTIR WKAVTGNLPS VIQFEPSSMI GHDNPDQSLS DMYAEVDKSS VVEETCFRVM DSFFTTYGLR HFYWPELRCL KTCCLVFEKL VQIKLPVLAD HFEHHELNIG LFALGWFQTL FLYLPSMPAA TVCHMWDIWL VERNFKIFFR VGTAILFLCQ PVLLNNELEG MMGYLNTFPD ATLLSPDILI ACALQIKVTN RMLTQIECDL YQKL
|
| |