Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49830 |
Symbol | |
ID | 7198659 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 15762 |
End bp | 19814 |
Gene Length | 4053 bp |
Protein Length | 959 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184716 |
Protein GI | 219129060 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.253319 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTGC GACAGGTAAA AGATACCAAC AGGAACGTGG GTGGTGATCA TGTTGAGGAT GGCGATGGAA CCGTTGAATT TGTGTCGGCC TTATTGTACG ACACGATGTA TTCGCGGGAC GATTATGGGA ATCCTTTGGA AGAGGATACA GGAAAGGATT ATACAAATGA ATGATAGGAT CCAGTCCCAC AAATTATGAG GGAGTAATCA AGGAGGGATG AATACTCGCT ACAAGATTCT GTGTGGGATT TATATTTAGA AGAAGAAGAC TCGTACATGT GAGTAAGATT ATCTGGAACA TGGTAACGGC AGTGCAGTGA ACTGGACCTC CCTAGCCTGA TGGACTAGTA GTAGAATAAT CTTCCTCTCA TTGTTTTAGA AAGTCATTGA TCCAAATATT GCATTAGCAT AGGTCCTCGT ATCCGCTGTC TAACCCGTGG GGAGCGTCGT AAGCGAGTTG AGTTTTACAG TCCCTAGGCT TGCTTAGGTG TATCGCAGGT CGTTCGTAAT AACGCCCAAC CTTCGTTTAC CAGACTTTGT CGAAACTGCG TTTTTCCTGT CTTTCGTACG TACATTGGTT GTACTACGTA ACAGATTAAC AGTCTTGACA GCACGTCAAA CCACTGCTGC TCATTCGCAC AGTCGTTACT GGTAAACACC CTTAAGTCTA CTTAAGGTCG TTCTCAATTT TCCACGAGCT CCGTGAATTT TGAGAACTAA TCGCTTACAT CCATTCGTGT TGTATAAGTG TCAGAGGTAT CACTTAAGGT GATATTTATC GGCTTGTGCG ACACATTGTT TGAAAGCCGC TTCCCTACTT CCTCACGCTA ACTGGTACCG AGGACCCACT AGTACTGCTA CAGGGTTTTT CGGTAAGTTA CTCGTGGCTT CCCTACTTCC TCGGAGTCGC CGGTGCCGAA ACCCCCTTGG AGTCTTCAGG GCTTGTCGGC AGGGCCGCTT CGTGACTACG GGTTGCTCTA GTCAGGTGAG GTAGAAATAC CAATAGGCCT GAAGACCTCA GCCCTGATTC CGGTTGAGAC GGACTTACTG CCCGGCTTAA AGCAGTGCCG GGGGGGGGGG GGGGGGCTAA TCAAACCCTC GTCAGTCCCC GAGGACACTA CCTACGGAGG TAGAAGTGAG TGTACCAGTG CTAGCTTTGT TCACTGGTGT AGGGTCATAC TAACGAAGTC CGCTATAGTA TACAGCGAAG GACCCGAACC CATAAGTAGT ATTTTTCTTG AAGAACAATC TATTGGTCCG GCTACCCAGC CCACAATGGT ACCTGCCACC CGGCAAATGA CAAGCGAGGC TGTCTATGCA CACATCTTGG ATAACATTCT CTTGTTGTCC CAAGAACACC CTATCCGTCT TAGCTTCCAA CAGCAGGGAT ACGAAACAGC CATCGACATT CTCTCTATCT TTGAGAACGA ACTCGATGCC CTCGGTTACA AGTCTCCCAC GCCTATTGAC GGTGTAGACA ACCCACGGAT CCCACTGCTC ATGGCGCATC GACAAATCCT GCGTCATTTT CTACGTTGGC AAGCATCCCT TGAACGGCAA AAGGGGAGCC CCATGAAGCC TTCGGAACTC ATTGCGTTGA ACAACGAAGA CTTTGTTCAG TATCGAGGAT CAGCACTTGG CCAGGTATCG ACAACCAATA GTCCCTCGAC TTTGGCCCCC ACCTCAACGA GCATCACCCC TAAAGTGCGA TCTGCTGCTG ACGACTTCAG GCGTGGCGTT AAATGTGACA AAACGCATTA TCCCGTACTC AAGGATGACA AGTACTGGGA TAACTTCTAC CGTTTGTTTG TGGTCACTGC AGTCTCGCAT AATGTTGAAA AGGTCCTTGA CCCAATGTAT GCTCTTACAG AGCCCTCTGA CAAAGCACTT TTCGAAGAGC AAAAGAAGTT TGTGTACTCT GCACTAGAAC ACACACTGCA GACAGATATG GGTAAGAACC TCGTTTGAGA ACCCAGCTTT GACTTCGATG CGCAAGAAGT GTTTTGTAAA GTTGTCAAAC ACTATACGGA ATCAGCAAGT GCGAAGATCA GTTCAGCCAC CACGCTTGGT TACCTAACTA CGGCGAAATA TGGATCCTCA TGGACAGGCA CAGCAGAAGG ATTTATCCTT CATTGGAAGA ACCACCTTTG TATCTACCAC AATACTGTAC CAATGGCTGA GCAACTCCCA AAGCAACTGT GTCTTAGTCT TTTGGAGAAT GCTGTCCACA ATGTACCGGA ACTTTGTCAA GTGAAGATCA CAGCCACTCT TGACCTAGCC AAAGGAGGTA ATCCTATTAG CTACGAGAGT TACCTTAGTC TTCTACTTGC TTCAGCGTCA CTCTACAATA AAGGAAACAA CTTCTCTAAT TCCCGTAGTC CCAAGGAAAA GCGTAGTATT CATTCTACTG ATCTTTCTTA CCATCCTACA GACTTTGACA CTGATCCGGA TGTTAACTAC AATATTGACT TGTCACCTTC CGTACTCTAT GAAGCCAATG CGCATGCCCG CAGGGAGAAT TGTACTAATA ACCGACATCA TAGTACGCCC ACTAACCGTG AACGACCTTA TATCCCTAGA GATATGTGGA ACCAGCTCTC AGATGATGCT AAGGCTATAC TTCAGGGATT GGCTGCCCCA GCTAAGGTTC TCCCGGTTAG TAATGGATTG GCACGTCCCT TGATGGCGCA TGTACATGCA ACTGGCAGTC GCGACGCAGC CCCGGACGAC AACAATGGAA CTCCCGTGGA TACCTTTCAT GATTGTGCAC CAGAAACGGA ACTCCTAGCA CATCTTTCTG ATCGGGTTGG ACGTATGGAT CCAGGCGACA TACGTAAGGT ACTTGCAGCG TCACGCAACA CCGTATCAAC GGGTTCCCTC CGTCCTCATG GTACCACCAA GTCACTTCAG TCAAATGTGT TACAGTACCA AGTCTCCCGA CACACAGTAC AAAATACCAC ATCTGCACTT GTCGATCGCG GTGCCAACGG TGGACTTGCG GGGAGTGATG TTACCGTACT GCACAAAACT GGACGTTCAG CCAACATAAC CGGTATAAAT GAACATACCT TGTCCAATCT AGACATTGTC ACGGCTGCCG GTCTTGTCGA ATCGCAACGA GGGCCCATTG TTGTTATCAT GCATCAATAC GCGCATCTTG GTAAAGGTAA AACCATCCAT TCCAGCGCCC AACTTGAACA TTGCCACAAT TGTGTTCAAG ATCGGTCTCG TACCGTGGGC GGCAACCAAC GTATTGTTAC CCTGGATGAT TACATCATCC CACTAAACAT ACGACAGGGT CTTCCGTACA TGGATATGCG CGCACCCACC AGTCATGAAC TTCATTCCCT TCCCCATGTT GTTCTTACTT CTGACGTTGA CTGGGACCCA TCAGTTCTCG ACAACGAAAT CGACATGAAG GCTGAATGGC ATACCGACAT CCATGACTTA CCTGGCATGC CATATATTGA GCCACGATTT GACAATCTCG GCCAATACAT GCACCGACAC GTTGCCATCT GTAATTCTCA ACGTCACGAT GCTCTCGACC GTATACTTAC GTGTAACAAG CATGCTGTCC AACGCAATGA ACATGATTAT GATGCCCTCC GTCCTTGCCT GGCATGGGTC TCCAGTGATA CCGTTCGCAA AACCATTTTT GCAACCACTC AATATGCGCG TGAGGTCTAT AATGCACCTC TACGTAAACA TTTCAAGTCT CGTTTCCCGG CTTTAAAGGT CCATTGACGC AATGAGGCCG TTGCCACAGA TACAATCTGG TCCGACACCC CTGCCGTCGA CAACGGAGCA AAGTATGCGC AACTTTTTGT AGGGAGACGT TCCCTGGTCA CTGACGTCTA CCCCATGAAA ACCGACAAAG AATTTGTCAA CGCCCTAGAG GATCATATCC GTTACCGTGG GGCCATGGAT AAGCTCATCA GCGACCGTGC TCAAGCTGAG ATCAGTAAAA AGGTTACTGA CATTACACGG GCTTACCACA TTGACCAGTG GCAAAGCGAA CCACATCATC AGCACCAGAA CTTCGCTGAA TGA
|
Protein sequence | MSVRQVKDTN RNVGGDHVED GDGTVEFVSA LLSFVITPNL RLPDFVETAF FLSFPLPYFL TLTGTEDPLV LLQGFSSGEV EIPIGLKTSA LIPVETDLLP GLKQCRGGGG GLIKPSSVPE DTTYGGRIYS EGPEPISSIF LEEQSIGPAT QPTMVPATRQ MTSEAVYAHI LDNILLLSQE HPIRLSFQQQ GYETAIDILS IFENELDALG YKSPTPIDGV DNPRIPLLMA HRQILRHFLR WQASLERQKG SPMKPSELIA LNNEDFVQYR GSALGQVSTT NSPSTLAPTS TSITPKVRSA ADDFRRGVKC DKTHYPVLKD DKYWDNFYRL FVVTAVSHNV EKVLDPMYAL TEPSDKALFE EQKKFVYSAL EHTLQTDMAS AKISSATTLG YLTTAKYGSS WTGTAEGFIL HWKNHLCIYH NTVPMAEQLP KQLCLSLLEN AVHNVPELCQ VKITATLDLA KGDFDTDPDV NYNIDLSPSV LYEANAHARR ENCTNNRHHS TPTNRERPYI PRDMWNQLSD DAKAILQGLA APAKVLPVSN GLARPLMAHV HATGSRDAAP DDNNGTPVDT FHDCAPETEL LAHLSDRVGR MDPGDIRKVL AASRNTVSTG SLRPHGTTKS LQSNVLQYQV SRHTVQNTTS ALVDRGANGG LAGSDVTVLH KTGRSANITG INEHTLSNLD IVTAAGLVES QRGPIVVIMH QYAHLGKGKT IHSSAQLEHC HNCVQDRSRT VGGNQRIVTL DDYIIPLNIR QGLPYMDMRA PTSHELHSLP HVVLTSDVDW DPSVLDNEID MKAEWHTDIH DLPGMPYIEP RFDNLGQYMH RHVAICNSQR HDALDRILTC NKHAVQRNEH DYDALRPCLA WVSSDTVRKT IFATTQYARE VYNAPLRRRS LVTDVYPMKT DKEFVNALED HIRYRGAMDK LISDRAQAEI SKKVTDITRA YHIDQWQSEP HHQHQNFAE
|
| |