Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_24006 |
Symbol | |
ID | 7199199 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 176690 |
End bp | 179052 |
Gene Length | 2363 bp |
Protein Length | 749 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185286 |
Protein GI | 219130258 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACAAACCTG TAAATGTTGG CCCCGAATGT GTGGAACGTC GTGTTGTCTA CGAGACGAGT GTGTGGTCGG ACACTTGCGA CGAACGGTCG GGTTGCCGTG ATGCGCAAGA CCCTTCGTCG GACCCGCGGG TATTGGCATG GCCCCGGCGG CGACGGCGGC GGGCCTACCC GAGTCGATTG TCGTTCCTTG TCCACGAATG TCATTCCCAC CAATACTCCG TTACACGACG GTCTTTTTGC CATTCCGGAA CTCCAAACAC CGTCCGATTT TCTGCAGTGT GCCGTCCAAG CCATGCACCA GTGCGATCAA TTGCGCGATG CACTCGGTGA CCGCCACGAC GACCACGCCA ATAGCATCAC GTCGTCGTCG TCGTTGACCA AATCGTGTGC GGTGGATCGT TTGTACCAAC TCGACGAAAT TAGTCGCGTC GTTTGCAACG TAATTGACGC GGCAGAATTG TGTCGTAGTA CCCACGCTTC GGAAGAATGG CGTGACGCCG CCCACCGGGC CTTTGTCCTA CTCAGCGACT ACATTGCTCA ACTCAATACC GATCCACGTC TTTATCGGGC ACTCACGACC ATATTCGGGA CGTCGTCGTC CTCCACCACC ACCACATCGA ACTCCTCCAC TGCCCACGTC GCACCGTCCG TCTTTCCGGA ATTGACGACG GAAGAACAGC GCTTCGCGCT TTTGTTGCGC GCCGAGTTCG AACGCGACGG CATACACTTG CCGGAACACG AACGAGCGCA CGTACGGAAT CTCCAAGCGT CTCTCACCGC ATTGGAAACC TCCTTTAGTC GCAATCTCGT TACGTCTCAC AAAACCTTTG CCGCACACGC CCAGGACGTG GCTGACGTAC TGCCTCCGCA CGTACTCCAA GCCTACCACA TTGTCGTCCC ACCTGTCACA CACGAATCAT CCACTGTCCA ACTCGGTACT TCCGAACCGC AAATATTGCA AACACTCTTG CGCTACTCAA ACTCGCCGTC GTTGCGGCGA CAAGTGTATC TGGAAAGTCA CACGGCCGTC CCGGAAAACT TGGAAATACT GGACGGCTTG GTACGGGTAC GCTACGAATT GGCCCAAGCG CTGGGCTTTG ACTCGTACGC CGATCGCTTT CTGCGCGACA AAATGGCGGC CCACCCGAGC AACGTGGCCA GCTTTTTGCA AACGCTGCAA CGCAGCAACT CTCCCTTGTA CCGACAAGAA ATGACCATGC TGTCACAAGC CAAGCAACAA GTGGAAGGCA ATGACGTGGT CGAGCCGTGG GATGTACCCT TCTATATTGG CCTTTTGAAA ACACGGGACG GCTTTGATGT GCACGACGTG TCGGAGTATT TGACGCTGCC GCAGTGCCTC GATGGCATGA AAACGCTCGT CGATAAGTTG TTCGGAATCA ATATGCAAGA ATGCGAATTG ACCGACAACG AACGATGGGA CGGGCCCACT GTCAAGAAGG AAGAACGAAT TCGAAGGTTT GACTTTTTGG AAAAGAGCAC CGGTCGGAAG TTGGGAACGA TCTACCTCGA CCTGCATCCG CGTGAGGGCA AGTACGGCCA CGCCGCCCAT TTTACCGTCC GTTGTGGCTG TGTTCTGAAC GGACCCAGTG ATCCACCCAA GTATCAATAC CCCATCGTGG CTCTGGTGTG CAACTTGTCT AGCACCAACA CAAACTTGTC CCATGCCGAG GTCGAGACGC TCTTTCACGA ATTCGGACAC GCCCTGCATT CCTTGCTCAG CCGAACTAGA TTTCAGCACA TGTCCGGAAC TCGTGCAGCC ATGGATTTTG TCGAAACTCC CTCACATTGG ATGGAAAACT ACGCGTGGGA TACCGACTTT TTGAAAATTT TGGCGGTGCA TCCGCAAACG GGCGCGTCGA TTCCCGACGA GTTGATACGG GCCCTGCAAA AATCACGGTA CGAGTTCGCT GCGATTGAGC GCCAAAGCCA GATATTATAC GCCACGTTTG ATCAAAAATT GTTCGGGGTA CCCACTACGG CCGATACAAC GGCTCTTTTT GGACAGCTCC ATCGAGAGAT TGGCGTCCCC TACACCGACG GTACGCATTG GCATTCGCGG TTTGGCCATC TCGTCACGTA CGGTGCGGGA TACTATGGCT ACTTGTATGC GCAAGTGTTT GCCGGCGACA TTTGGCGACA CCTGTTTCAA GGGCGATCCA TGGAGCGAAA GTCGGGCGAC GAGCTGTGGC ATAAAGTATT GATTCACGGA GGCGCCAAGG ACCCGAGTGA TATGTTGACT GATTTACTCG GAAGGCCACC GCAGGTAAAT TCGTTTGGGC AATAAAAAGA CTACCACAAA CTCATGGAAA TGACCAATAG AATAAGATTT TCGTGAGTGT GAA
|
Protein sequence | MLAPNVWNVV LSTRRVCGRT LATNGRVAVM RKTLRRTRGY WHGPGGDGGG PTRVDCRSLS TNVIPTNTPL HDGLFAIPEL QTPSDFLQCA VQAMHQCDQL RDALGDRHDD HANSITSSSS LTKSCAVDRL YQLDEISRVV CNVIDAAELC RSTHASEEWR DAAHRAFVLL SDYIAQLNTD PRLYRALTTI FGTSSSSTTT TSNSSTAHRF ALLLRAEFER DGIHLPEHER AHVRNLQASL TALETSFSRN LVTSHKTFAA HAQDVADVLP PHVLQAYHIV VPPVTHESST VQLGTSEPQI LQTLLRYSNS PSLRRQVYLE SHTAVPENLE ILDGLVRVRY ELAQALGFDS YADRFLRDKM AAHPSNVASF LQTLQRSNSP LYRQEMTMLS QAKQQVEGND VVEPWDVPFY IGLLKTRDGF DVHDVSEYLT LPQCLDGMKT LVDKLFGINM QECELTDNER WDGPTVKKEE RIRRFDFLEK STGRKLGTIY LDLHPREGKY GHAAHFTVRC GCVLNGPSDP PKYQYPIVAL VCNLSSTNTN LSHAEVETLF HEFGHALHSL LSRTRFQHMS GTRAAMDFVE TPSHWMENYA WDTDFLKILA VHPQTGASIP DELIRALQKS RYEFAAIERQ SQILYATFDQ KLFGVPTTAD TTALFGQLHR EIGVPYTDGT HWHSRFGHLV TYGAGYYGYL YAQVFAGDIW RHLFQGRSME RKSGDELWHK VLIHGGAKDP SDMLTDLLGR PPQVNSFGQ
|
| |