Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42944 |
Symbol | |
ID | 7196191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1592525 |
End bp | 1597584 |
Gene Length | 5060 bp |
Protein Length | 1422 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176813 |
Protein GI | 219110123 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.237933 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGCCC CGCCGCTTGA CTTTGAAGCC CTTCTTTCCA GAGAATACCG GCCGGCACGA ATCGCTCGCT TAATTCCGCG AGAGGCCTAC CATCCAGAAA ATAATCAGAA GAAAAAGTCT GTTTCGATCA AGCATTCTAC GGCTGCGCTC GAGGACGCAA CGCGACGACG AGCCGAGGAG GAAACCTCAC GATTGGTATT GAGCCTTGTT ACGCAGACTT CTGCTTTTAT TCAGCATCTT CCTGAATCAG AGCGACGGGA TGTGTCGAGT CTGCTGACGA AGAGGCCTCC GGTGGTCTCC TCTTGGCCGG CGTACGAGAC CATCCCAGCA CGATTCCACG AGATTGAACT CGGCGACTGG GAATCGAAGG TTCGTTGGAA AATTGAAGAA GAACCAGAGG GCGAGTCCAA ATATTCCAAT CGAGACCCTA CGGATTTACT GAGACGGCCC CGAAATCCCT ACTTGGATAA CCTTGTATTA GACGAATCCA CAATTTGTTG GGACGGATCA CTCGAGAAAC TGCAGGAAAA GGCTCGAAAT ACTCCTCTGA TTTTAGAACT TGGCGTAGCT GGACAATCTG TAGCCCGTCA TGTGTATCAA AATACGGTTC TTTCTGCACA ACGCCCTACA CCTGCACTTA AGTCGGATGC CTATCAAATG CGGCGAGAAC GGGAATGGGC CAATCCAATT ACTTCCACAG CAGAAGTCTC TAAAGCTGGT TCGCTGCACG CTGATAAGGA CAAAATGGCG GCGTTGATCG AGGCTCGTCA AAAGCAACGA GCACAAATGG CCGAAGACAA GACGAATCGC GTGACAGAAG CCATGGGAAC GTTGGCTTTG GGTGGTGGAA AAGGACGGAC TATCACTTCC TCGCTGATGG GTCCAGGAGG AACTGAACGC AGCGGGCGGC CGTCCCGAGA TGTGGGTTCA TCGGGATTGC ATGAGGCCGA ATATATTGAG CAGCTCGATA TGATTAATAG CCATAGTCTC GTGCGCGATC TCTCCAAGGT TCTGTTACGA CAATATCATA GACCCAAGTT GCCCCTCAGT GTCGTTCGTC AAGACCTGTC CTGGCAATTC CAGATCCGCT TTGCTCCCAC CAGTAAAAAG ACGGAAGTTA CCGGCGCTTC AGGATCATAC CAGGCAATTA TGACAGGAGC TCACGCAGGT GCGATATCAA AGGCCAAGCT GCGCAGCGAA GCGGATCTGA GTCCAACAGA AGGCAAGCTT ATTTTGCTTG AGTACTGCGA AGAGCGCCCT CCAATCCAAC TAACAAAAGG CATGGCGAGC AAGATTGTCA ATTATTACCG TGGAGACAAG GCACATTGCC CTGTTTCGGC TGGTGGGGGT GACCGGCCCG CACGCAGAAA AAGAGCCGAG CCCATTGCTG GAGAGGCAGA CGCTCGATCG AGCCGTGCAG AACGACTTCC TCGGTTGGAA GGGCCAAGTC GGGAAACGTC GGTTTTGGAA TGGGTTGGCA AAGTTCCTAA GAAATCACAG AAGGAGCGTG CGGAACAAGA TGCAATAGAT ATCCTTCCGG AGGGTGTAAC TGAAAACTTA CATCCCAAGG TCCACGGGCC ATTTCTTGGC GAAGTCGAAG ACAGCACAAC AGTGACCGGA ATTATTACAA ACTTGTTCGT CGCGCCGATG TTTCTTCACG AACCTGAAAC AACTGACTTT TTGATGGTCC TCACACCACC TAGTGGAGCG GCAAGGCCCG GCCAGCGTGA GTCAATGAGC GTGATCCTTC GAGATTTGCC TACGAGCACT TTCACTGTTG GTCAAACAGA GCCTCGTGTG CGGGTCTTCG CGCCAAACAC CCAGGGTGAA AAGAACTTTG TAGGGCCTTT TGTTTCATAT CAAATTGCTA GAGCTCTCGC TCGTTCTCAA GGTCGAGAAG GACACGGTTT ACGATTTGAC GAGATCCAAG ATCGCGTACT GCCTAACCTT GAGTTACCGT CGAATGCGTT ACGGCCCCGC CTCAAACAAG TCGCTCTGTA CGACAAGAAT ACTCAGATCT GGACGACGAA ACAAATAGGG TTTGAAGAGT ACCCCGGAGT TGACGCCCTC GGCAGGACTA TTGCACCCGA AGGTGTTGCA GCTTTTGAGA GTGCTTGCGC AGCCAGTCGC CGACTGTCAG ACCTTGGAAT CCACCAACTT CTAGCTGGCT CACATACTGT TCTAAGCGTG GGCGTCATTA TGGTCTATAT TTCCGGACAG CTGAATGCAG CCAAAGACTT GTCAAGAAAA ATGAAGAAGC TAGCGGAATT GCGTCGCTCG AACAAGAGCA TCTCGGCTGT CCAAGTCGCT TTTTATGAAC AGGCGGCCGC GATCATTGAA TCTCATTTTA AGATCTTGCG GCAAAAACAT GAAATTGCAC AGTTTATATA TGAGCAGCTT CAACTTGCTC CATGGCATTT GACCGGTGAG TTTATCGATG TTCATAAAAA AGGCGACGGG ACTGGCATGA TGAAGCTAAC TGGTCTCGGC GACCCAAGCG GTCAAGGAGA AGGTTTTAGC TTTATCCGTG AAGCGGACTC GAAACCAAGC AAGTCCGTCG GGAATGCAGC TCTGAGTGCA GAGGTCAAAA AGATTACTGG GACAGAAGAT GATCTTCGAA AACTTACGAT GAAGCAAATG GCGAGCTTGC TTCGCTCATA TGGTATGACA CAGGAAAAGA TTGATACACT AAAGAGGTGG GATCGAGTGC ACGTCATCCG GGATCTTTCC ACGAAAGCCG CGAGCGATGG AATAGGTGAC GGCCTTGAAC GCTTTGCCCG CGGTGAAAAA ATGAAACTTT CGGAGCAGAA GCAGATGTAT CGGGATCGTG TCCGAGTCAT ATGGAGGCGA CAGATTGCTG CATTATCGAT GGATGACAAG GTCGCTGGAA GCACAGAAGG AGCGGCTATC GCTGACGGCG AGAACGAAGT TTCTGGAATG GCACAGCAAT CACAATCCAA TAAGCCGGAC AGCACCAGTA AGCTAGGCTC CGATTCAGAT TCATCAGATG ACGATGACGA TCTTGCAGCG GCGTTGGAAG ACGAAATGCT GGATCGATCG GAAGCGAATC AGCTCGTTGC GGAGCATACT GGCGGAGGAG AAGCTGACGG CGGTCTGGGA CAGCTGCGGG CGGCTGCACA GGACCACGAG ATGAATAAGG ATGCCCGCGA GCTTGCAGCA TTAAAGCGGC AGCGTGAGGA GGAAAGGGCA GTCCGCGAAG GTTTGCAGTC GAACAAGCCG AAGGTAGAAT CCTTCGATAC ACAAATGCGT TCAAACAGAA AGGTCATAAG AAAGAAGGTA GTAAAAACAC ATCCCGACGG CCACCAGACA ACAACCTTCA AATTCGTCCT CAGACCCGAC GAAGTCGGAA AGATCATGGC CCGGCTTCAG CAAGACAACA GCGAAGATCA CCGTCGAAAA AAAGAGTTTC AGTACGAAGC AAACTCAGAC GAAAAGCCTC CAGGTCAAGC TTTGTTCGAA GACGAAGACG ATTTTGAGTA TTCTTCCCGC GGGCGCTTTG CCGACAAACG CGGGGGGAAT CGTAAGCGGC GAGCCGGAGG TCGAGCAACT CCCCGGGGTA CGCTCCAGTT TGGAAAACTG AAAAGCAAAA TATCCAAAGA GGAACGGATG CGGAAGAGAA AACGGGAAGA GGAAGAATTA GAAGTTTATA CTGCGTCAGC GAAGCACAAA GGAACTAACA ACCGCAAGGA GCGCGGTTCT ATTCGAAACC GCCGACCACA TGTTATTTTC TCCGAAAAGT TGGAAGCAAT TCGGTCAGCC GTAGAAGCCC GTCCAGGAGC TTTACCGTTT GTGAAACCTG TTAATCGACG TCTTCTACCC AAGTACTACG AAGTTATCAG CGATCCCATC GACCTGCAGA CAATTCGAGA TAAGATCAAG CGGTACGAAT ACAGATCTGC CGACAACCTT GTTCGCGATT TTGACCTCAT GAAAAGCAAC GCCGTCAAAT TCAATGGTCA AACCAGCCCC ATCGCTCAGG AAGCGATCGC AATTCACGAG TTTGTTTCGA ATCAAATTGA ATCACACCGG TCCGAACTTA GCGCTCTCGA GACAGCGGTA CAGGATCAGA TGAATGGAAA GCCGAAAAAG AAAGTCAAGA AAGGCCTAAT GAAATCTAGT GGATCTGGAA ACACTGCAAG AATAGGAGGC ATATCAGTCA ACCTTGGAGA TTTTCAGGGA ATGCAATTCG AAGGGAACGA CTCAGATAGT GGGGATGAAG TTTCGTTTAC AGGGCTTTTG GATTTTTAGA GGGAGATTAA TGATCTTCTT AACATACTTT TCTGATGTTA AGGGTTCTAT TGCCATGTAC CTCGATACAT GACGACTTCC TCTCCTTCTT CAATATCCTC CGTTGCCACC ATGAACACGT CGAAACTGAC CCGATTCCGT TCGTAGACAG GCGAATACTT AGGGCGAATT CCCGACGGAT GATTGGGCAC CCATTTGTCT ACGTTTGCCT CCTCTCTTCC GTCTACACGC CGAATCAAAA CGGAACCGCC CAATTCGACG TAGTGCTGGG CACTGCCATC GGCCGCGCTT TCGTGCGCGT AAGATTCAAA AAATCCGAGC AAGTCCTGAA TCACTGCCAC ACGACCCCCG CCCACTTCTG TATTCTTTTT CAATCCTTCC AAGTTGCGGG TTGTGACTTG CAGTGAGCTG GCGAGATGCG CTGGCATTAC AAATGATCCT CTCGGAATGC TCGTAGTTGC GTACACTTTC GTCGAAATTA TCTTGCCACT TTCGTTTTTG TGAGTTTCAA CACGGAACGA ACTGTTCTCT TCATTTTCCA AATCAAAATC GTGCAATTCT GCCTGCATAT TCATGGTACG GTAGGCACAC TCAAACGGCA TTGGTTCACG ACGACAGTAG ACGGTCTCCC ATCCCTTCTT GGGCCACTGA TACGAGCGCT GGGTGGTCCC ATCGAAATAG GTCAAGGCCC GAACTTTGTT ATGCGTTCGA ACGATCCGAT CATAAATTTC GTAGTCAACC TGATCGCTAC GAGCGTACCA GCGGCTACGG CAAGTGACGC TCTTACAGAC
|
Protein sequence | MAAPPLDFEA LLSREYRPAR IARLIPREAY HPENNQKKKS VSIKHSTAAL EDATRRRAEE ETSRLVLSLV TQTSAFIQHL PESERRDVSS LLTKRPPVVS SWPAYETIPA RFHEIELGDW ESKVRWKIEE EPEGESKYSN RDPTDLLRRP RNPYLDNLVL DESTICWDGS LEKLQEKARN TPLILELGVA GQSVARHVYQ NTVLSAQRPT PALKSDAYQM RREREWANPI TSTAEVSKAG SLHADKDKMA ALIEARQKQR AQMAEDKTNR VTEAMGTLAL GGGKGRTITS SLMGPGGTER SGRPSRDVGS SGLHEAEYIE QLDMINSHSL VRDLSKVLLR QYHRPKLPLS VVRQDLSWQF QIRFAPTSKK TEVTGASGSY QAIMTGAHAG AISKAKLRSE ADLSPTEGKL ILLEYCEERP PIQLTKGMAS KIVNYYRGDK AHCPVSAGGG DRPARRKRAE PIAGEADARS SRAERLPRLE GPSRETSVLE WVGKVPKKSQ KERAEQDAID ILPEGVTENL HPKVHGPFLG EVEDSTTVTG IITNLFVAPM FLHEPETTDF LMVLTPPSGA ARPGQRESMS VILRDLPTST FTVGQTEPRV RVFAPNTQGE KNFVGPFVSY QIARALARSQ GREGHGLRFD EIQDRVLPNL ELPSNALRPR LKQVALYDKN TQIWTTKQIG FEEYPGVDAL GRTIAPEGVA AFESACAASR RLSDLGIHQL LAGSHTVLSV GVIMVYISGQ LNAAKDLSRK MKKLAELRRS NKSISAVQVA FYEQAAAIIE SHFKILRQKH EIAQFIYEQL QLAPWHLTGE FIDVHKKGDG TGMMKLTGLG DPSGQGEGFS FIREADSKPS KSVGNAALSA EVKKITGTED DLRKLTMKQM ASLLRSYGMT QEKIDTLKRW DRVHVIRDLS TKAASDGIGD GLERFARGEK MKLSEQKQMY RDRVRVIWRR QIAALSMDDK VAGSTEGAAI ADGENEVSGM AQQSQSNKPD STSKLGSDSD SSDDDDDLAA ALEDEMLDRS EANQLVAEHT GGGEADGGLG QLRAAAQDHE MNKDARELAA LKRQREEERA VREGLQSNKP KVESFDTQMR SNRKVIRKKV VKTHPDGHQT TTFKFVLRPD EVGKIMARLQ QDNSEDHRRK KEFQYEANSD EKPPGQALFE DEDDFEYSSR GRFADKRGGN RKRRAGGRAT PRGTLQFGKL KSKISKEERM RKRKREEEEL EVYTASAKHK GTNNRKERGS IRNRRPHVIF SEKLEAIRSA VEARPGALPF VKPVNRRLLP KYYEVISDPI DLQTIRDKIK RYEYRSADNL VRDFDLMKSN AVKFNGQTSP IAQEAIAIHE FVSNQIESHR SELSALETAV QDQMNGKPKK KVKKGLMKSS GSGNTARIGG ISVNLGDFQG MQFEGNDSDS GDEVSFTGLL DF
|
| |