Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50493 |
Symbol | |
ID | 7199281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 204946 |
End bp | 209022 |
Gene Length | 4077 bp |
Protein Length | 1358 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185446 |
Protein GI | 219130592 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCGG GACACGCAAA GGCCCCTCGT GCCAATCCAG CCTGGCCCGC GGATTCCAAC GGCGCAACTG TCGTGAAGTC GACGACGACA ACAACGACCA ACACCCCTCC TCCATCGGTC ACCGAAGGAA CGCCCCGGAA GAAGGCAACG ACGAATAAGC TCCCCCGCCG GACCGACACG GCCGACGCCC CGACTCCGGC CTCGTCGTCC GCCACCCCAG CGACTCTGGA ACCCACCACC AACGGCCACA GCCACCACGG AACTGCGGCA ACCAACGCGA ACGGAAACAA CGCCACCAAA CCCAGCCGGA AGCGAAACTC GTCCCTCAGT GGTTCCAAAA CCTACCTACA GATCATTCAC GAAGCCATCC TCGCTCTCGG TGATCGCACC GGCTCCTCCA TTCCGGCCTT GACCAAATGG ATCCTTCAAG AATACCCGCA ACTCGACGGG CCACAGTTTA AAAATAGAAT CCTGCAAGCC GTCAAGTCGG GAACCAAAGC CGAGCGCTTC CAAAAGGTCA AATGCAGTTA CAAAATTGCC GCCGTCTTTA AAGAAAAGGA ACGCCAAGCC GCCCGCAAGA AAAAGAACGC CGTTGCCGCC GCGGCCGCTG CCGCCGCCAA GAAGAAAAAG AAAGCCGCCA CGCCCCCCAC CCTCGCACAG ACGCAGGCCG CTCACGAACA AAAACTGGCC CAGCTACAGA AATCCTTGTC CCCCGAAGCC CTCGCCCGCA AAAAGGTTGA TCTCGCCCGT CAGGAAGAAG CCACGCGACA ACGACTCCAA GCCGAAAAGG TTGCCAAAGA ACGCGCCGAT CGTCTCCGCC GACGGCGCTT TCCCATGGAG GATACCCGCT TGCATCAGGA AGACGCCGAA TTGCACGTCA AACCACCGGC GGATGTCTTG GCTCGACCGT ATTTGCCCTA CTTTTGGTAT ATGACGACTG ATCTACACGA TCCATCCCGC CACGGCAAAA CATCCTCGCA AATTTTACAA GCGTCCAAAG TAGACGGACT CGACACGGGG AATCACGGTT TGGTGCCGGA TTTGTTGGGC GTCTACCACT TTTTTCGAGG GGATGTACAA TTTAGGATTC CGGACGAACG GGAGCTCGTG CCCCCATTTT CGCTGGCGCA ACTCGTGTTT GCGACGGAAC AGATTCTGAA CGGTAACGCC AAGCGATCGC GCATCGTTCC ACCCTTACTG AGTTCCTTAT TTTGCACTTG CTTGCAAATT TTGTGTAGAG CTCCGGAAGA CACTGCCGAA AAAACGTCAC AAACGCAACT CCAAAAGGAC CTGCACCGGT ACCTGGCTCC CGCGCTGACT CCGGCGAGCT GGGCCGACGT ACTCTACCTG TACATGGACG CCATGGAAAG ATTTTACGCG ACGGATGCTT CACGAGACCC GAACGTTTTG CCACCGCTGC AGATTGACGT CGAATACTTG TTGGCCGTCC GCGACGACAT TGTCGTGCCC ATGACACCGG CGACGACCGT AAAGGGCAAA AGTAGTGAAA CGAATTTGAA TCCTTTGCCC GGCGGTTACT TTGCATATCT GGGGGACCCG CGCGGAATCT TGTTTCGAGC CTTTAGCAAA CTGGGTCGAC AGGATCCTTG GTTGTTGACG GCGGAAGAAT TGATAGTCCT GCTACGGGTT CTGACAGATG ATATCCTGGC GTCCCACCCT GCCATTAGCC AAGACGTGGC CGCTCGGGAG GAAGAAATGC TCGCGTTGTC CAAGGCGAAG CGTGCGGCGG ATACTAAACT TCGCAAAGTA CGGCTGGCCT TTGAAGGCCC CAAGAAACCG GCGACTGCTA CTGTGAAAAA GCCGGAAGAA AACAAGGAAG TGACCAAGGA CGAAAAGACG TCCGAGACGG AGAATCAGAA CGGCAACGAA GACGAGAAAG AAGAAATTCT GTTCAAGCCG ACGGCTACGC AGAAGCAATT GGAATCGGCT GTGAAAGCCC AGCAAAAAGC GAGCGAAGCT TACGAAAAGG GTATTCGCAA ACTGACGGCA CGAACGGAGC CCATAGGTTA TGACCGCAAT TTCAACGCTA TCTATTGCTT TCGACACGAT CCTGAGGTCT TGTACGTGGA AGATTTACGA CAGCCTTCAA CTGTAGCGAA CCATTTGCCA CTTGACATGC AATTTAAGCG TCGTTCATGG CACTTGATTG AGACTACAGC CTTGTTTGAC TCATTTACGG GGAGTCTAGA CATTCGAGGA AGAAGAGAGC ATGACTTGTA CGAAGAACTC GTTGGCCCAC AGGGAGCTCA GCAGTCGCTT CGTCGTTTCT TGCATGACAA TATGAAAGAG CAAAATGAAG CAGTCGCACG AGTGCGAGAA CTGGAATCGC TGAAAAAACG ACTGGAGGTG GCTAGAGTCA AGTGCGACGA AGAACAAGGG CGTAGAAGCG GGCGTTTGGC AGGACAGGCG GGAGAAGAGC TGAGCTCTCT CGAGTTCCAA ATTGAGATTC TCGAAAGCAG GATTAACGGC ACGTCAGCAC CAGAGCCGCG TGATTACGAG GAGTTGACAG GTTTGACTCT GCTTCGCAAA TTTGATTCAA ACAGTGGCAT GGGAGCTCGT CGAACCCGTG AGCAGAAACA GCAAACCAAG ACCCACAACC TGCCGATTTT ACCTTGCAGC AAGATGTGCG GCACCGGCAA TATTGACGGT ACAGGATTGG TTGGCCTTTT GGTATCGGGA CTACTAGAGG TCGAGGAAAT TTGCGAAACT CTTGCCCCAT GGGAGCGCAC AGACACAACG CGAGGCGAGT GGGTAGCTCG TTTGGAGAAT GCTGTTCATA CATGGAATGC TGTCAGTCCC ATGGTCCTTG GTTTAGCGGA TGCGCCGCCG ATGAAAGTTA TTGGCTCACC GGCTCAGTCC AGCCCTCCAA ACGGGTCTTC GGGTGTACAA CGTAGCGCCA GACGCACATC TCTCGATAGC TTGGAGTCGG CTTCGAAGAA ACGCAAGGTT GAGACTCCTC CTCCAACGTC AACGACTTTT CATTCTGCTG CCAATATAGT TTCGATGATT CGCCAACCTC TGATTGATCT TGAAGCCCGT GTAGCGGCCA TAACAAATCT AGACGTGGCG AGCAAAGATG CGGATATCGC CGACGATAAC TTGTCAACGG ATGGGTCAGA AGACGATCAA GCCATCAAGG AAAAACTAGA GCGCGCTTGG AAAAGGCAAA TACACCGTCT ACGCAATACA CTGACCCACC GTTACGGTCA AATTCGTGAA TTCTTAGTTG CTGCGATTGC GGCAGCTCGC AAGGCCCATG TGCCAGAAGT AGTAGCCGAG CTCCGGGCGG CTCTCTTACA ATATCATCCT GGAGCCGCTT CCGAGTGCAA GTATGCAGCA ATCAAGGTCT TGCTCGCACA CGGCGATTAC GAGCCCGACG AAGACGAGGA AGAAGAAGAG CAATATGACG TCAGGGAGGA AGGAGACGAA GTGGAGATTC CATCGGTGAT TTGCGCAGAA GCTGCGATGC TCGCTAGCAG CCTAGATGGG TCCGAGGACG CCACGCGAGC GGATTGGATA GATTCCGTCA AGGCGTGCAA GACAATCTCG CGGTTAGCTT CATTGGTCGG GGCCTTTGTC AAGAATGCGC AAGATAAGAT GGGAAAGCTA GAAGACGAAC GGGACGACTT GTTGGCGGCT ATTAAGACAT GGGAGAAGGA GGAAGAACGC CGAGTCAAAA ATCGCGGCGG CAAAAAGCCA GTCGGTCGAC CCACAAAAGA TAGTGTTGGA CCCTCGGAAG TGTGGGCCAA CGTGCGTTTC TCGGACGAGA TTTGTATGGC CAAGGCGGAA AACTGGCCTT GGTGGCCAGC TCGCAAGTGT ACGCCAAAGG ACGGTTCTTT GGCTCGATCT TTGGCCGGTC TTGATCGCTC TTTGGTTGCG CTTATCGGTG AAATGGGAGG ACTCCGGGTG GTTAAAACTG AGAGTATTAA GGCTTTCTCG GGAACGCTGG TGGAGGACGA GGACCTTGGG CAGTACAACA AGAGCGTTCG ATCGCAGTTG GATGATTGTA TGGCCATGGG TCGTCGTATT GCGCGCGGGA TGGAAAAGAA GCGATAG
|
Protein sequence | MKSGHAKAPR ANPAWPADSN GATVVKSTTT TTTNTPPPSV TEGTPRKKAT TNKLPRRTDT ADAPTPASSS ATPATLEPTT NGHSHHGTAA TNANGNNATK PSRKRNSSLS GSKTYLQIIH EAILALGDRT GSSIPALTKW ILQEYPQLDG PQFKNRILQA VKSGTKAERF QKVKCSYKIA AVFKEKERQA ARKKKNAVAA AAAAAAKKKK KAATPPTLAQ TQAAHEQKLA QLQKSLSPEA LARKKVDLAR QEEATRQRLQ AEKVAKERAD RLRRRRFPME DTRLHQEDAE LHVKPPADVL ARPYLPYFWY MTTDLHDPSR HGKTSSQILQ ASKVDGLDTG NHGLVPDLLG VYHFFRGDVQ FRIPDERELV PPFSLAQLVF ATEQILNGNA KRSRIVPPLL SSLFCTCLQI LCRAPEDTAE KTSQTQLQKD LHRYLAPALT PASWADVLYL YMDAMERFYA TDASRDPNVL PPLQIDVEYL LAVRDDIVVP MTPATTVKGK SSETNLNPLP GGYFAYLGDP RGILFRAFSK LGRQDPWLLT AEELIVLLRV LTDDILASHP AISQDVAARE EEMLALSKAK RAADTKLRKV RLAFEGPKKP ATATVKKPEE NKEVTKDEKT SETENQNGNE DEKEEILFKP TATQKQLESA VKAQQKASEA YEKGIRKLTA RTEPIGYDRN FNAIYCFRHD PEVLYVEDLR QPSTVANHLP LDMQFKRRSW HLIETTALFD SFTGSLDIRG RREHDLYEEL VGPQGAQQSL RRFLHDNMKE QNEAVARVRE LESLKKRLEV ARVKCDEEQG RRSGRLAGQA GEELSSLEFQ IEILESRING TSAPEPRDYE ELTGLTLLRK FDSNSGMGAR RTREQKQQTK THNLPILPCS KMCGTGNIDG TGLVGLLVSG LLEVEEICET LAPWERTDTT RGEWVARLEN AVHTWNAVSP MVLGLADAPP MKVIGSPAQS SPPNGSSGVQ RSARRTSLDS LESASKKRKV ETPPPTSTTF HSAANIVSMI RQPLIDLEAR VAAITNLDVA SKDADIADDN LSTDGSEDDQ AIKEKLERAW KRQIHRLRNT LTHRYGQIRE FLVAAIAAAR KAHVPEVVAE LRAALLQYHP GAASECKYAA IKVLLAHGDY EPDEDEEEEE QYDVREEGDE VEIPSVICAE AAMLASSLDG SEDATRADWI DSVKACKTIS RLASLVGAFV KNAQDKMGKL EDERDDLLAA IKTWEKEEER RVKNRGGKKP VGRPTKDSVG PSEVWANVRF SDEICMAKAE NWPWWPARKC TPKDGSLARS LAGLDRSLVA LIGEMGGLRV VKTESIKAFS GTLVEDEDLG QYNKSVRSQL DDCMAMGRRI ARGMEKKR
|
| |