Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37111 |
Symbol | |
ID | 7202247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 189043 |
End bp | 190344 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181322 |
Protein GI | 219121956 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0416192 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGTGT CCCGAATGAG GGCCTCTTTG TTGCTTCTTG CACTACTGAC ATCATGCGAC TATGCTGAAT CCTTTCGACA TGCAGCACAA AATAAGATGA GGTTCAAACT GCCTTCCATA AAGACGTATG AAAGAAACGG ATTATCGGCA AAATTTGTGG ATGGCACACA CAATTCCTCG ACTGAAAGAG AGCATGCGTG CAACAAGGCT CTCAAAGTAG CACTGCTACA AAACCTGTCC GTAGATCTTG CAAAACTATC GACGATTCGC CCTGTATCGC CAGCTGCTGA TTTCAGCGCA CCGGCAGCTA TCATTTCCGC TGGATCCAGC TACACTCGCA TTTGGACACA CAGTACGTGG GAAAGCCATT CTCGCCCTCC CCACGTGCGA TATACAAACC ATGTTATCCG ATGGGGAGCC AGCTCTACCG CGCGCAAAAT TCTCCCCACG GTTCTGCTCG CTGCAGCCTG GGCTGCTTTG GTCGCGAGGC TGGCGCGATC GAATTTTTGG GTCTTAAGGT TCTTGACGGC GACGGAACCG TCCAAGGCCT TCGGATTTCT AGCAGCTCCG CTCGCATTGT TACTAACGCT TCGTGCGAAC GCCAGCATGC AAAGACTTTT GGAAGCTAGA TTATTATGGG GTCGCTTAAT CCTCCACACT CGATCGTTGG CCAGTGTTAT CAGGGTTTAC CTTTACCCTG CTTGCCCACA AGCGTCGACA TTAGCCATTC GACATATAGC CATGATGGGA TGGATCCTGA AGGCTACATT GCGTGGAGAA AGTTCCGAGT CGCAACAGGC TGTGTTACGG GTCATGCTCC CTGACGAACG GGATTTCCAA TGGCTTGCTT CGCATCCCAA AACGAGCGTC GCGGTGACAT ACAGATTACG ACAAATCTGT TCGCACATGT TAGAATCTTT GATCGATCGA TCTTCCTCTT CGGCAATAAA GTTTGTGATT GAAGATAAAA TCGGATCGTT GGAGGAGGTC GTTGGCGGGT GCGAACGGCT ATTTGGGAGT CCGATTCCAC CAACCTACAG TCGACACTTG AGTCGCGTTG TAGTTATGTG GGTTTTGCTC CTACCGATGT CTTTGCTCTC ATCTCCGGGG CTTTCCACAC TCGGAATTTC CATAGCGACC GCCGTCGGAA CCTATGTTCT GGTAGGCATT GACGAAGTTG GCATGGAAAT CGAGAATGTC TTCCAGATGC TACCCCTACA GCAATTGGCG GGTGCGGTAC AAAACGATGT GCGCGACCAA TTTATTCCCA AGCAGGGGGA AATGCCAAGG GTTATTTTGT AG
|
Protein sequence | MPVSRMRASL LLLALLTSCD YAESFRHAAQ NKMRFKLPSI KTYERNGLSA KFVDGTHNSS TEREHACNKA LKVALLQNLS VDLAKLSTIR PVSPAADFSA PAAIISAGSS YTRIWTHSTW ESHSRPPHVR YTNHVIRWGA SSTARKILPT VLLAAAWAAL VARLARSNFW VLRFLTATEP SKAFGFLAAP LALLLTLRAN ASMQRLLEAR LLWGRLILHT RSLASVIRVY LYPACPQAST LAIRHIAMMG WILKATLRGE SSESQQAVLR VMLPDERDFQ WLASHPKTSV AVTYRLRQIC SHMLESLIDR SSSSAIKFVI EDKIGSLEEV VGGCERLFGS PIPPTYSRHL SRVVVMWVLL LPMSLLSSPG LSTLGISIAT AVGTYVLVGI DEVGMEIENV FQMLPLQQLA GAVQNDVRDQ FIPKQGEMPR VIL
|
| |