Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44302 |
Symbol | |
ID | 7197965 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 166686 |
End bp | 168777 |
Gene Length | 2092 bp |
Protein Length | 538 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178177 |
Protein GI | 219114763 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAACCCAAA AAGATTTCTC CTGCTCGAGT CATCATCGCG TGTGTTCTCT CACGATCGCC TCTTGGTGTG CGTTTGCGTT CGTGTCTTTT TCGTGTGTGT TCCCCACTGG AACTACGTAC AGTAACGCCT CGTCGTTTGC TATTATTCAC AGTCCGTTTG GTGTTTTGTT CATTTTCATT ACAGTAGTGG TACCTACTTT ACCTAGTGTG AGAGTGGAGG ATGATGATGT CCAAGCCGGA ACCGCAAGCC GCGGCAAAAA AGAAATCGGC AGGCGATGCC GTTGCGGATT TGGAACGCCG CCTCGCCCAA TTGGACGTTA CGGAAGCTCC ACCCGTTAGT ACCAGTACTG CTAGTGTCAC GAGTACGTTG GATGATCCTC CGGCCTTTGC GGCACCGCCG TCGGTGGACG AAGCCGTTCC CGTCAAGGGG GGCAAGAATG CTCTTTTGGT ACGTACATCC AACAACGATA GTTCGTATCC ACGAAACAGC CTCTGATGGG AAGCGACCGG ACTTGTTGGA CGCTGCGTCC AACAGTGATC GGTTTCACTG TCACTATCTG TTGGCATTCG TATGCTTTCC ATGCTTTTAC TTACTCGTTC GTTCACACGC TCGCTCTCTT TGTACTCAAC AACAATACTT CTACTACTGC TGCTCCTGTT ATTGTTACTA CGACAACAGG CTCGCATCAT GGCGGCACAA GAACGCGCAA AACAGGCTCA ATCCACGGTG AAAGCGCCGC CGCCTCCTCC CCCAGAGGAT TTGCTCGACC TCTCGGACGA CGTTCCGGCT CCCCCACCGT CCTTTCAATC GTACGAACAA ACCGCCGTTG CCGCACCGAC GCGGTCACCG CCAGCCCCGT CTTACAATTG GGATCCCCAC GCACCAACCC CCACCACCAC TAGTACGACT TCCGCTCCAC CCACGTGGAT TCCCGACACC TCCGCGCCTT CGGCTCCAGC CTTGGAAGAT CTGTTGGCCT TGGAACCCGC TACGCATTCG ACAAACGTGG GGACGGACGA TCACAATGCC TACTTTCTCG ATATGCAACC CGTCGCCCCA CCGTCCCAAG GTCACGCCGC GGAACCTTCG GTGGAAGAAG TCTTGGCGGC GCTCGAAGGT CTCACGGAAG AAGAAAAACA AGCCCTCCTG GCCGAACAGG CCAAGATCAT GGCGTCGATT GAGCATTCCC AAACATCCGC GGCGGCTGCC AAGGCTGACG CCTTTGAGAG TCGCAGTTTT TCCACGGCCG TTCAGAGCGT ACACCGTCGC CCGCAACAAC AACAACAACA ACAACAGCCA ACAAACGCCG CTCACCGCTC GGTTACCATT GACGGGCAGA GCGTTGCACT GCACGGACAG GAACAGACCC GGGCCGCTAT TCAGGACGGC ACCGCCGTAC TCGTACAGTG TCTTTCTTGC CAAAACATGA TGCAAGTGAC CAAGGCTGCT GCACTCATGT TCTGTCCCGT CTGTCAGGTA GTCTCGCCCG TAGAGCACCT GGATGGTATG GACGCCGCCC AAGCCGCACA GCTGCAGGCC GACATGGAAT TGGCCGAGCA ATTGCAAAAG GAGGAATACA AGGAAGCCGC GGCGGATCGA CAAGAGCGGA CTTCGCGACC AACGTCCACG CCGGACAAGA AAAAGGATCA ATCCTGGGGT GAGTGGTTGG GTTTGAGTGC CACGGCCGGT ACCACCAGCT CCACGTCTTC ACCGGAACGT CCCATGTCGT TCGCGCAAAA ACCAACCGAA CGGGGTGCCA TTGGAGTCGC GCGTCCCCCC GGTGCCGTCG CCGAAACCGG GACGGCCCAG TACGGATCCC GCTCGTACGA TGATGACGTT TGGGCGTCGC CGGGTGGCGG CGGTGCCCGG GTAGCGGAAA CGAAACCCTT GTTCAACTGC GTCGCCGATT CGGTCTACTC GGCTGCGTCG ACCTTCACCA CGGCCATGCA CGCCACCACG CTGTCCGAAG ACGACGAAGG CAACGTCCAC GGCGTTGACT CGTCGTCGCT CTTGGCCATG CCCGGTGTTT CGCGGGAATC GAATTACAAA CAGATGGATG GAAACTAACT GAAAAACGAA ATATAACTCT TT
|
Protein sequence | MMMSKPEPQA AAKKKSAGDA VADLERRLAQ LDVTEAPPVS TSTASVTSTL DDPPAFAAPP SVDEAVPVKG GKNALLARIM AAQERAKQAQ STVKAPPPPP PEDLLDLSDD VPAPPPSFQS YEQTAVAAPT RSPPAPSYNW DPHAPTPTTT STTSAPPTWI PDTSAPSAPA LEDLLALEPA THSTNVGTDD HNAYFLDMQP VAPPSQGHAA EPSVEEVLAA LEGLTEEEKQ ALLAEQAKIM ASIEHSQTSA AAAKADAFES RSFSTAVQSV HRRPQQQQQQ QQPTNAAHRS VTIDGQSVAL HGQEQTRAAI QDGTAVLVQC LSCQNMMQVT KAAALMFCPV CQVVSPVEHL DGMDAAQAAQ LQADMELAEQ LQKEEYKEAA ADRQERTSRP TSTPDKKKDQ SWGEWLGLSA TAGTTSSTSS PERPMSFAQK PTERGAIGVA RPPGAVAETG TAQYGSRSYD DDVWASPGGG GARVAETKPL FNCVADSVYS AASTFTTAMH ATTLSEDDEG NVHGVDSSSL LAMPGVSRES NYKQMDGN
|
| |