Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42574 |
Symbol | |
ID | 7195955 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 474293 |
End bp | 476744 |
Gene Length | 2452 bp |
Protein Length | 705 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176590 |
Protein GI | 219109672 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGGAAATCC CCAAGCATCA CATAATAAGC AAGTCTCCCA GCAAAGAACT GTTCGAGGCA TTATCAATCA ATATGAAACT TGTTGTCTCT GCCTTGTTCT TCTTGTCCCT GGCGGAAGCC GATTTGTTCA ACTATGGCAA CACGGACACC ACTGTCGATG GCGAGAAAAG TTACGGAATG CCGAATTGGA ATCGAGTTGA ATGCAGCAAC GAAGACACCT GTGTGAGTGC GCAATTTTCT TAAAACATTT AACATATGGG TCTGTTGCTC ATTACTTGCT TTTTTTTTTG CCTTTACAGC GAGGCTGGCC CGACAAGTTT CCCTTTTTGG TAGGCTGGGA CGCGGGCAGA AACACCTGTC AGTGGTGCCC TGAAGGTGGA CCAGAGAATT GTGGTTTTCA CAGACAATCA CCGATTGATT TGAAGCGCGA CCGGGGGGTG ATCGGCGGAA GCAACGAGAA ATCATGCCCA GATTGGCATT GGATGGCGTA CAAGGACGAT ACTTGTGCAT GGAAAGATAT GGTGGATGAA TTTTCGATTG AACGTCACGG TTTGCGTCTG TCAGTCCCCA TTGAATCAGA CGGTGAAATT TCTTGTGTTG AAAATCGAAA TGGACAAGAT GTGAGGAGGT TTCCTCGTCT AGATTACAGC AAAGGTTTCC CGGATTGGTG GTGGCTCCAA TCGACCGACA TATCTGTACC CAGCCATCAT ACACAAGAGG GAAAGAGATA CGACGCCGAA GTCACACTTA AGCATTTCTA CGAGATTGAA CATGACAAAA ATCAGGTATG AAGCGCATCC AATACTTCTG TACTTAGTAG TACCGAAACC TAACGTGAAT CTTTTCTGGT CTTATCAGCT CGGCTATGTA ACTCTCTTTA TGGAAGCCTA CGACGATGCA GAGTCATGGC CGTTTTTAGA CAAACTGATT TGTGCCTGGA GAGAAAAGGA GGAGAAGGTC CGGCGTGAAT GTGGACTTCC GCCTTCAGCT CCGTACGGTA GATGTCGAAT CTTTTCTGAA CGTGGTCAGC AGCCAACTGA TGCTTGGAGA TTCGAAGCTG GTGAGCTACA GTCTTTTGGA GAGGGTGAAC CCGCCCCGGT TCCATCCGTT GCCCCCACCA AATCACCAAC TGCCTCGATC ACTACCTTGT CTCCTGTAAT TCCGACTACG TCACGTCCAA CCGAGTGGGT CTTACCTCCT ATCTTTGCAC CCCCTTCCGT CACCGTCACC ACCGACACGC CAACCATCAC TGAGCAAATT GTTTCGGCAC CTCCGCCTAT TGACTGTGAT GCATTTGATA TGAACTATGA TAGGCTTTGC TACTCCAACG ATCCTTGTTG TGAAACCCAG AGGTCAACTT CCGAGTATTG CTGGGACGCT TATGAGAACA TTTTCCCAGG AAATGCCATC TACTCGGCTT GTCACCATTG TTGTGGAGGA GAACGAAAGA GCGTTGGACC GCCTAGTCCC ATCAACCCCA AAATTCCAAA AACGTTACAA TGTTCGTCGT TGTCGAACGA ACCAAACCGC ATGTGCAATC TTGAAAGCTG TTGCGATGGT TCCGACTCCA GCTACTGTCG GGATGTTCAG AAGCAGTTCG GAGACAAAAT GACTGAAATC TGCGTACGTA TTGATAACAG TTGATGGAGT TGATTCGAAG TATTCCGCTG ACCGCGTGTT TCCTTTTCTT GCAGTGGTAT TGCTGCTCAG AGCCGAAGGA GTATGATTCC AACAGACGAA CGCTTCGCGG AACAAGTTTT GGCATGGAGG TCGGCGAAGA CATCAAAGGT GTGCAGTCTT TCCCAAGTGG TACTAAGTTT ATGGAGGTGG ACGGCCGCCG GCTTGTTTTG CGGAAAGAAA ATTTTGAAAG GGATGAAGAG AGCGAGGAAG ATTATTTCAA TCGGATCTAT TCAAATTACA AGCACCGGTC CTTACAAGCC ACTGTCCATC AAGAGGACTA TGCTGACATC GAGTACTGGC CGTACGAATG GATGCTGAAG GTAAATACCG AGTATTACTT CAGATACGAA GGTACTCAGG TAGTTGCCCC GTGCGCAGAA ACTGTCCATT GGAGAGCTAT GAAAGATCCT ATTAAAATTC ATCCTCGCCA GCTTGCCGAG TTGACAAGGC TTTTGAAAGA AAGAATCGCC CCCACAGGAG ATCCCAATTC TTGTCAATCG GACACTGCAG GCGTTTCCGG AAGTGACGGT TCACTCAAAC TGAACAGAGA CCTTCAGTAC TACCACAATG TCCATCGCAA GGTATTTTGC GAGTGCAAAG ACTGGCCCTC AAAGTTCGAA AGTGACAAGC AGTGGTGCCG CAATTGGCAA GATGACACCA ATTACGAGCG GTTTTACCAG CGTCCTTATA GTTTCGATTC AAATGGAGAG TGGTAAAGCA GGTTGTTTGA TAAAATACTC TATATAACGT TAAATGGAAT GATGTAAGCT TT
|
Protein sequence | MKLVVSALFF LSLAEADLFN YGNTDTTVDG EKSYGMPNWN RVECSNEDTC RGWPDKFPFL VGWDAGRNTC QWCPEGGPEN CGFHRQSPID LKRDRGVIGG SNEKSCPDWH WMAYKDDTCA WKDMVDEFSI ERHGLRLSVP IESDGEISCV ENRNGQDVRR FPRLDYSKGF PDWWWLQSTD ISVPSHHTQE GKRYDAEVTL KHFYEIEHDK NQLGYVTLFM EAYDDAESWP FLDKLICAWR EKEEKVRREC GLPPSAPYGR CRIFSERGQQ PTDAWRFEAG ELQSFGEGEP APVPSVAPTK SPTASITTLS PVIPTTSRPT EWVLPPIFAP PSVTVTTDTP TITEQIVSAP PPIDCDAFDM NYDRLCYSND PCCETQRSTS EYCWDAYENI FPGNAIYSAC HHCCGGERKS VGPPSPINPK IPKTLQCSSL SNEPNRMCNL ESCCDGSDSS YCRDLMELIR SIPLTACFLF LQWYCCSEPK EYDSNRRTLR GTSFGMEVGE DIKGVQSFPS GTKFMEVDGR RLVLRKENFE RDEESEEDYF NRIYSNYKHR SLQATVHQED YADIEYWPYE WMLKVNTEYY FRYEGTQVVA PCAETVHWRA MKDPIKIHPR QLAELTRLLK ERIAPTGDPN SCQSDTAGVS GSDGSLKLNR DLQYYHNVHR KVFCECKDWP SKFESDKQWC RNWQDDTNYE RFYQRPYSFD SNGEW
|
| |