Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50465 |
Symbol | |
ID | 7199315 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 140844 |
End bp | 145029 |
Gene Length | 4186 bp |
Protein Length | 989 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185386 |
Protein GI | 219130467 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGTCTCGTG GTAAATCCGA CACTCGTGGA CCGTTGGTCT CCTTGACAGC CGTTCGTATT GTTTCGGGTC TTGTTAGAGT ACGAGTGGAG CATTTGATTG CCATTGTATT TGCTTGAGTC ATTGATTTGC CTTGGAAAGC AGCATCCCCA AATACACAAC GCATTGCATT GAATTGCATT GCATTCGTCG TTGTCGTCAT GAAGCGTGCG TTGGCGGAAC GTGCCAAACA ATCATTGCCG GCCGGAGACT CCTCGACTCG GTCGGGCAAC AGTCGGACTC CGTCACCAAC TCAACCTCCC GGCAAACTCG CCCGCTACGA AGGCAACCAT GCGCATCACC ACCACCGCAA CAACAACAAC AACAACAACA ACAGTAACCA CAACCTCCAC CACAGTAACC ATCACGACGT CCCACACGAA TCCGACCCCA CCACCATTGG TGTGCAGAAT TCCGCACTCG CTACACCCCA TCCGAACCCC GTCGTTCCGG AATTTTCCAG CCGTACACAG ACACCACCAT TTTCCACAAC CACAATAGCA CTCCCCACGC CACCGATGGA TTTATCGTCG GGTACGTTGC ACCGTACACG TTCCCCGTCC CCCCGTCCGT CCACTACGGG ATCGTCGTCC ACGCACGGGA TCGCAACCCT CAAACGTCAA CGACTACCGG CATCCCGTCA CGCCGACCAG GACGACGGTG ACGACGATCA CACCGGATCC GCCTTTTACT TGCGGCAACA AAATAAAGCC TTGGCAACGG AACTTCGCAC CTTGCAAGCG CTCACGCGCA CGTTGACCGT GGAACGCGAC CACCGACGCC GGATCTGTCA CCAAGCCGTA CAAGCACTCG ACAGTTTACA GGCAACCTGG ACGCAATTGG AAACATCGTT ACGGATCAAT GTCACAGGTA CAACCGTGTC TACACACCCC AACGGCTGGC AGACTGCGTT GTCCACGCCG TCGCTGACTG CGAGTGGTGA CGCTCCCCCC AGTACCTTGC CGTTGCCCCA GCATCCCGGA GCGGAAGAAG AACGCTCTCC CATTCTATCC GTGGAGTGGA CTAAGGCCTT GACACAAGCC TTGACTTCGC TCGGGACGAC GCCGTCCGCA GTCGACACTT TTCAAGCATC CGGGGACGAT TGGTCGACCG TGGCCACCAA CGTGGCCGCG CGGGCCGCTG TATTACAAGA AGCAATCCTC CAGAAACAAC AGTCACGGGA CGATTCCGGT ACGCCGATCT TGTCGCTCGC CACCGCCCCG CCCGAATTTG TTCACGAATT GGAACGCTAC CGCGCACAAA CCCAAACGCT GCAAGCTCAA ATTGCCGAAC TCGCCGACGC TAGGGAGCGC GTAACCATCC GCGAACGACA AACCCGACGC GACATCTACC GACTGGCGTC GGGATTGTTG ACGTCCGCCC AATTGGTGGC CCGCTTTGAC TCTACGGACC AAGCCGTGGA CGACGATTTG GATTTGGCGG CCCTCCAAGC GTCCGTGCGC GCCGAAACAC GACCCGCGAG TCCACCGCCG CCGACTCCGA CGGACGCTCC ATCCACTGGT ATTTCCGACG CGCAAATGGC AGCTCTGCAA GCGCAACTAC AAGACGCCCG GCAGCAAGTG GCCGCCCGCG AAACGTCGCT GCAAGAGGTA AGTTACGGTG CAAGTTTCCT CGCGACGTGT GTTGTTACAA GGCGTAGACA ACGACAAGCA GGTAGTCTCA CGATCTCACG GTGTTAATCT ATGCTATTCT AGTTGACGAC GAAATGGCAA GATGCGGAAT TGCGCGTTAA CGCATTATCG ACCAAACAGC TCACCGAGAC GGATACGCGG CAATTAGAAT CCCTGGCGTC GACTCTGGAA AAGTACCGGG TAGTTGAGTC CGAATCTCGG TCGTTGGAGG CCCAGATTGT GCGTTTGCGA TCGGAATGGG CGCAAGCACG AGGCAACGAC GAAGCCGCGC GGCAAAGCAT GGAGGATCTC CAAAGCAAAC ACCGTAAACG GTGGGCGGAA CTCGCAAGCT TGCTCGCGAC GGGCGCGCCA ACCAACGAGG ACGTCAAATC CCCTGAAGTG GCCGATCCCA TTGCTGAAGC CGATTTTGAG ACTCTACGAG ATATTTTGAC CGGGTCCCGC ACAATAGTGG AATTGCGACA TAAACTCAAC CAAGCACTGG GACACGTGCG GCAGGTGGAA ACGGTTCGTG ACAATCTGAG AGATGCGTTG GTGATGAACG AAACATTGAA ACAAAAAGTG GACGAGTACA GGGCGAAAGC CAATGCCGCT ATTGCTTCCC AAGCAACCAA GTCGCCGCCA TCGTCCCGAC ACGGCGAAGC CGGGATTTCT TCCGCCGCCA AGGAAAAGGA ATCGGAAAAG CCTGCGACGG AAAAACCATC GTCGTCGTCA AATCACGACA AATCCGACAA GATGCATCGG GAATTCCGTC GTATGCGCAA GGATTTGGCA ACCCTTACGA CAAGTAAAGA TGCCGCCAAG GCAAAACTGG AACGTTCCGA GCGGGAAAAG GAAACTCTTT TGGATGCAAA TAGCAGGTTT CTTCAACAGA TTGCGGAAAA AGATGAAATG AACGCCAAGT CGCTGTCTAC AATTCTTCAT TTGAAGCAGA TGACGGAGCA ACTGTCGTCC GAAAGGGACA TTTTGGAGCA GCAGGTGAAG AGCGCAAGTC AATTGGCATT GGCAGCAAGG CTGGCAACAA ACGCCAAGGA ACGTGTGTCC GAGGAACTCG TCAAAGAGCG CCTGACCTTG GACAAGCGCG TCAACGAATT AGAAATACAG CTAGCGTTGC TGAACAAAGA CCTGGCGCAG AAGACTGTGG AATGTTCGGA AGCAACAGGA AGAATGTCCA TCACCAAAGC AGAGCTCGAG AAGGTGTTGG CCAGGAACAA TGAACTCGTT GAAGAGGCTG AGAAGAGAGA GACTGACATT CGGGGACTGG TAGATAGCGC CAACAAAGCC GAGAGAGAGG CTCGAGAAGC AAAAGGGAAA CTCGATAATT TGACACAACA GTCCGGCGGA GATCTGAGCG CTGCATCTTC CTCGTCCACT GTAAATCAAT TGAATACCCA AATTTCTGTC CTGAAATCTC GATTGGCGTG TCCAGTTTGT CACTACCGAG ACAAGGAATG CATCATCATG CGCTGTCGAC ACATGCATTG CAAACAGTGC GTTGAGGAAC GTATTTCGAA CCGAAGCCGG AAGTGTCCAA CTTGCAACAA CAAGTTCAGT GACAAGGACG TGGAAGATAT TTGGTTGAGC TAAACGAGCG TTGTGCTTTG TTTTTTGGAT GTTTGAAATG GAACGCTACA AATGATCACA TCACAGGAAT TCCCGATTCT GGGTTTATAT TTTGCCGCTC AGAAATTTGA CCTTAAAACC TCCTCTTCTT TTTCCTGATG CCGCTCAAGC TCTTCGCTTC CTTGTATTTG CCTGCCATGC TTTGTTGTGT CTCCCCTTCT AGTTGAAAAA GCGATCGCAT ACGCCGCGTG GCTTTGCCAA AACTATGTGC CTTACGAAGT CGCTCCGTTT CGCCTACAAC CCAGCCATAC ATCTCAGCCT GTCCACCTTT CCAGCCAGAG TCATCTCTCC TCAAGACCAA ATGCCTTTCT TCGTCAATTT GAAAGGCCTC GATGGTTTCG TACGTTTTTA TTGAAGAAAA GCGACACCAG CCAACTATTT TGGGAGCGTC TGGTCGGCTG AAAGAAAATT CGTTGCCGAT TGATGAGATT CCTGGCACCC CTGGAGGAGA TTGTAGAATT GCTATTCTTT TCCCTATCAG AGATGGTGGC AGGACGTATG AACGCGTTTC CACGATCTTT CGACCTTCAA GCAACGATTC GGCCCAGGGG GACTGCATTT CCAAGGCAAA ATCAAATTGG GACCAACCGC GCCATAGTGG TGGCTTTGTA TGGGCGTCGT CGGTATCACG CTGAAGAGGC GGAGATGACG TTGGCAACGC TTTCAGGTCT TTTGATTTGT ATGCATACAG AGCTCTTCGT AAATCCTTGG AAGCAGTCCT TAGGTCTGCA GGCAATCCAG CCATTGGTGA AAATGCTCGC TCACCGCTTT GTTGAGCATC AATTGCAGCC AGGAACGAAA AAGGACACTG CATTGGACGA CATTCAACTC CCATACTGTG AAAAACATGG TCCTTTTCGC GCTTGATCCA AAGTTC
|
Protein sequence | MKRALAERAK QSLPAGDSST RSGNSRTPSP TQPPGKLARY EGNHAHHHHR NNNNNNNNSN HNLHHSNHHD VPHESDPTTI GVQNSALATP HPNPVVPEFS SRTQTPPFST TTIALPTPPM DLSSGTLHRT RSPSPRPSTT GSSSTHGIAT LKRQRLPASR HADQDDGDDD HTGSAFYLRQ QNKALATELR TLQALTRTLT VERDHRRRIC HQAVQALDSL QATWTQLETS LRINVTGTTV STHPNGWQTA LSTPSLTASG DAPPSTLPLP QHPGAEEERS PILSVEWTKA LTQALTSLGT TPSAVDTFQA SGDDWSTVAT NVAARAAVLQ EAILQKQQSR DDSGTPILSL ATAPPEFVHE LERYRAQTQT LQAQIAELAD ARERVTIRER QTRRDIYRLA SGLLTSAQLV ARFDSTDQAV DDDLDLAALQ ASVRAETRPA SPPPPTPTDA PSTGISDAQM AALQAQLQDA RQQVAARETS LQELTTKWQD AELRVNALST KQLTETDTRQ LESLASTLEK YRVVESESRS LEAQIVRLRS EWAQARGNDE AARQSMEDLQ SKHRKRWAEL ASLLATGAPT NEDVKSPEVA DPIAEADFET LRDILTGSRT IVELRHKLNQ ALGHVRQVET VRDNLRDALV MNETLKQKVD EYRAKANAAI ASQATKSPPS SRHGEAGISS AAKEKESEKP ATEKPSSSSN HDKSDKMHRE FRRMRKDLAT LTTSKDAAKA KLERSEREKE TLLDANSRFL QQIAEKDEMN AKSLSTILHL KQMTEQLSSE RDILEQQVKS ASQLALAARL ATNAKERVSE ELVKERLTLD KRVNELEIQL ALLNKDLAQK TVECSEATGR MSITKAELEK VLARNNELVE EAEKRETDIR GLVDSANKAE REAREAKGKL DNLTQQSGGD LSAASSSSTV NQLNTQISVL KSRLACPVCH YRDKECIIMR CRHMHCKQCV EERISNRSRK CPTCNNKFSD KDVEDIWLS
|
| |