Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45660 |
Symbol | |
ID | 7200445 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 847320 |
End bp | 849224 |
Gene Length | 1905 bp |
Protein Length | 593 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179928 |
Protein GI | 219118301 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.912846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACGACAGT GACAGTTTGA CCAAAATCTT GGCAAGCAAA GCTAAGTACA AGTGCCGAAA TGGTCCATCG AACCAAACAC ACGTAGTGTA CGACATTATG GCGATCCACA CCCTGCAAGT ACAGTGCCTG TTAGTGATGC TGGTATTATT GGCATGCAGC TGGGGTAGTG CCAGTTGCGC GGGGAAACTA GCCGTGGCCT TTGCATCGAC GCGACCGATA CGGTCGACTT TTCCTTCAAC CGGATTCGTA AAGGTGAATC TTCATCGTTC CTACCAGGCA CCTCTTTACG TTCTACCACA GATTCCCGAA GAAACGCCCT CGTCGCCTTC AACGAAAAGA TTTCGCTTTC TTTCCAAATT GGGTATTGGT ACGGCTAGGA AAGATGGCCG CGCTATTCGC AATATCGCCC GGACCGAAAA AAAAGCCCGA ACCATTTACA ACATCACCAC ACAGATGGAC TTGGACAACT ACTGGAAAGA TGATCAACGT CGGTTCCGCA AGGATGAAAA GGGTACCATC GATTATGATT GGCTGATTCG GTCACTGAAC GTGAGCGGCG ATACCCAAAT CATAGGGGAC CCCTCACGTC CGGAATATGT TCATCCCGTC GCTCAACTCG TACACGAGCG CCAGCGGCGG GGTACAGCCT TGGGACAGCA CAAAGATGGT TGTAAACTCG CCTTGGCTGT CGAAGGTGGA GGAATGCGTG GCTGCGTGAC GGCCGGGATG ATTTGTGCCC TCCATCACTT GAATTTGACG TCCGTATTCG ACGTAATCTA CGGCTCGTCG GCAGGGAGCA TCACCTCCGC GTACTTTATT ACGGGACAGT TGCCCTGGTT CGGACCAGAA GTCTACTACG ATCAGCTAAC GACGGCAGGC AAGAATTTTA TTGATAGTAG TCGCTTGTTT CGGGCACTAA AGGGCTCGGT TTGCTGGATC CGCGGTTGTA CCGGGATGTA ATAACACGCC CCGACGGCAA ACCCGTCTTA AACCTCAAAT ACTTGCTCAA AACGACCGTC AAGGATACCA AACCATTGGA TTGGGACAAA TTTTTGGAAC AGCAAACGCT CCAACCTTTG AATGTGGTAA CCTCCGGCCT CAAGAGTCAG CGATCGATTG TCCTAAGCTA TGAAAACGGT GGTTTCGAAA ATTTGAATGA ATTGACAGAT TGCATGCATG CGTCCTGCCT ATTACCGGGC ATTGCTGGGC CTGTCATGAA CCTGGATATG CGGTCTACTT CTCAACGGGG GAAAACTCCC AAATTGATGC TTGGTAATGG ACGGATGGAA GACTACCTCG AACCGTTGGC GGACGCTTTG ATTTACGAAC CGCTCCCCTA TCGATCGGCC GTTGCGGCCG GCGCGACACA TGTCGTAGTC TTACGGTCGC GCCCAGACGG TACTGATGTG ACTGGAAAGG GTGGTATTTT TGAACGCATG ATTTTTCGAC GGTTTCTTTT ACGCAAGAAC AGACTTCCCC ACATGTTTCA GCGACTTTCC CAGCAACTTC ATAAGAAGTT GTATGCTGAA CAAGTGATTG AGGTTAACGA AGCAGCATAT AGTAAACAAG ATTTCAAGGA TACTTCTAAC CCACACTTAC TCGGCGTTGC GCTACCTCCT GGATCACCGG AAGTTGTCCG ACTAGAAACC GGCCGTGAGG CAATCTTTGA AGGAATTCGG AGAGGTTTCG CTCGGGCGTA CGATTGCCTC GTTGAAGACC CAAAGGAGCG CGGTCGTGGA CAAATCGTAG CCAAGGAGTA CTTTCCGGAC GAGATTTTGG ATTACGATCC TTTGACGATT AGTGAAACTG ACAGATCGGC CTTTGAAGTC TACATGAAAA AGAGTGGAAT CACACCAAAA TCATGGGGAG ACAAAGAACA CCGAGCTAGA CCAACAGTGC GATGA
|
Protein sequence | MAIHTLQVQC LLVMLVLLAC SWGSASCAGK LAVAFASTRP IRSTFPSTGF VKVNLHRSYQ APLYVLPQIP EETPSSPSTK RFRFLSKLGI GTARKDGRAI RNIARTEKKA RTIYNITTQM DLDNYWKDDQ RRFRKDEKGT IDYDWLIRSL NVSGDTQIIG DPSRPEYVHP VAQLVHERQR RGTALGQHKD GCKLALAVEG GGMRGCVTAG MICALHHLNL TSVFDVIYGS SAGSITSAYF ITGQLPWFGP EVYYDQLTTA GKNFIDRLGL LDPRLYRDVI TRPDGKPVLN LKYLLKTTVK DTKPLDWDKF LEQQTLQPLN VVTSGLKSQR SIVLSYENGG FENLNELTDC MHASCLLPGI AGPVMNLDMR STSQRGKTPK LMLGNGRMED YLEPLADALI YEPLPYRSAV AAGATHVVVL RSRPDGTDVT GKGGIFERMI FRRFLLRKNR LPHMFQRLSQ QLHKKLYAEQ VIEVNEAAYS KQDFKDTSNP HLLGVALPPG SPEVVRLETG REAIFEGIRR GFARAYDCLV EDPKERGRGQ IVAKEYFPDE ILDYDPLTIS ETDRSAFEVY MKKSGITPKS WGDKEHRARP TVR
|
| |