Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44577 |
Symbol | |
ID | 7197601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 953752 |
End bp | 955522 |
Gene Length | 1771 bp |
Protein Length | 431 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178332 |
Protein GI | 219115073 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000753938 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATGAGACTC ACTTTTGCGT TGGGAGTCGA TGCCCTCCGT TCGCGACAGT CGTTGACGCA TATATTCTTA TCGTCGCCAT TGAAGAGGTA CAATTGTATT TCTACGTATT CTTCAATTTC ATTTTGAGTT TCTTTACTTT CTCCGGGATA AACGCCAGAA ACGAAGATAT TGTTCAAGAC ATTTTCAGTA GTTTGAAAGA CAGATTCTGA ATTGATTGTC CTCTGCCTAA AGCAGGTGGG CAGTCATATC GATAAAACGG CTTCGCAACT AACTTTCTTT TTGCAGGCGG CGATGGGGCA ATGTTCCACG TTGCCTGCCG AGGGGAGAGG CTCTTCTTCA GTCGCTTCAA GATACGAAAA TATACATGAT CCTGAACAGC GGCATCGGCT CCGTTACGAT AGCCGGAAAG AAAGCATGGA TTTCGATCGA TTATCTTCGA CCGCGGAGTA TCCCCCACGT CGAAACCAGA ATAAGGGACA CAGTCAAACC CAAAAGAGCG AAACCTTTCG TCAAGACACA GAGCATCCTC AAGCGAGAGA TCCGGATCAC GAGCGCAACT TTCTCAAAAA AATCAATCCC ATGAATTGCG ACGCGCGGGA TGAGCCAGTA CCGATCCAAG CAACACCTCC CCCGCCGTCA AATGCCGTTC GGCAGCGATG TTACAAGTTA AACTTAGATT CCGAATTCAC AAATCTCTCA AGCTCACAGC GGCAGCAGCT ATGTTTGGGT CCATTTTCTG AGCCTCCTCC GATGCTCACG TACAGTTCGT CTGAAGATTC CAGCACCGGG GTCGACGATA CGATTGTGGC AATGAAGACA GCACAAATAT TCAGGGGTAT TACCATCGGC CGTGACGGAG CAATTTTATC TCAAAATGCG CGAGCAACAA GAAACAACAG AGGAAGCAAA CAAAAAAGGA GTGAGAAATC TCGTCAAGCA GCCAAGATTG ACAAAGCGAA AGATCTCGTG GAAGAAACGC TGGCTACGGG GAAAGCTCCC GACTCTGATG ATCCTGTCAA CATGGTCTCA CTGTTCATAG TTGGCCAATA CGACGATATG AAACATCTCG TCCGAGACGG CTCGAAAAAG CTTAGAGATG CCGATGGTCT CCCAGATGAA ACTCTGTATT CCACCAATCT TCATCGAAGT CCGTCAAAGG ATGCATTCTC CACAAGCAAG GAATCAGCCA TGATCAGTCC TCGAAAACGT GCGTCGCCCC ACTATATTTC TAGCCAACGA ACGGCTGCCA TGCACTGTTC ACATCAGATG CAGGTGACTG GGATCCCGTA TTCAGCGCCG CCAAAATTGA AATCACATCC ACGCGACACA CAGACTATGC GTCGGGAGGA ATACCCTGGA CCACGCATGC ACTCCTGCAA CAATTTTGAG GATTCCAGCC ACGTTCGTGG TACAAGCGAC GGAGCTGACT GGACTCATGC CTGGAACCTA TGGAACTGTG GGGTAGTGGG AGCGGCAAGT CCAGTGGACA CGCGTAGCCC AAGGGAACGC AAAACACCGA TTTTTGAAGG TCGTGACTCA CACTACAACA CTGTCCGTGA CAATGGCGGA ACACGTCGTG CGGGATAATT TTTCCTCACA GGCAAAGGCA AAATGGCTAC ACTATGAATT ACATCAACTC GGACAGACAG GCTGAGTACC ATAGATAATG CGGTATGCTG TTAAGCCGCG TGTCCTTTCG CTTATTATCA CACGCCTGTA TTTTATATCT CATGTTTCAT TAGTTACCGG TTAGAAAGGC ATTTTCCCGT C
|
Protein sequence | MGQCSTLPAE GRGSSSVASR YENIHDPEQR HRLRYDSRKE SMDFDRLSST AEYPPRRNQN KGHSQTQKSE TFRQDTEHPQ ARDPDHERNF LKKINPMNCD ARDEPVPIQA TPPPPSNAVR QRCYKLNLDS EFTNLSSSQR QQLCLGPFSE PPPMLTYSSS EDSSTGVDDT IVAMKTAQIF RGITIGRDGA ILSQNARATR NNRGSKQKRS EKSRQAAKID KAKDLVEETL ATGKAPDSDD PVNMVSLFIV GQYDDMKHLV RDGSKKLRDA DGLPDETLYS TNLHRSPSKD AFSTSKESAM ISPRKRASPH YISSQRTAAM HCSHQMQVTG IPYSAPPKLK SHPRDTQTMR REEYPGPRMH SCNNFEDSSH VRGTSDGADW THAWNLWNCG VVGAASPVDT RSPRERKTPI FEGRDSHYNT VRDNGGTRRA G
|
| |