Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35233 |
Symbol | |
ID | 7200718 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 393598 |
End bp | 396427 |
Gene Length | 2830 bp |
Protein Length | 921 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179637 |
Protein GI | 219117693 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGGGA ATTCCAATCA ACAACACGTG GGGGATTCCC GTCGCGTAGC GCTCGGTACC CGTGTGCCGT TGGGACGACG CGGAACCAAG GACGATGCGA AACCCACTCC CAAGCCGTCG TCGTCGTTGC ACGTGACGTT CCCGCGAACA ACGCCCCTCG ATAAGGAAAA TGGACACGGA ATCCGTCGTG ACGGCCGGCC GACAACAACC GCAACGAAGA CTAGTACTAC TGGTAGTGGT AGTAGTGCTC GTCTGGATGC CAAGAGCACG ACGACGACGA CGACTCCACG TGAGGTGAGC TTTCGCAGTG TGCTACGGGC CACCGTTACA CCCCGACGTC CCCACCAAAG TACCGTTCCG TCACGCTTGC CACCGCACAC TCCGGCGGTT CCCGCACCCG GACGACACCA GATCGCGACA ACGCCGCGGT CACTACTCAC CAACGAACTC GAGTTACTGG ACGGTGACGA TACCTTCTTG TGCTCACCAG CCGTCGGGAC ACCGAGACTG CCTTTACCAA CGTTTCACAT GCCGGAAGCG ACTCCAGCAA TCCAGGAATC GGCAAAAGCG GTTGAGGGTC CCGTACCGTC CATGGAACCC ACTAGGTCTA CCCAGCACCC GCGTCTCCCA CCCGTAGTTC CGGCTGTCCC GATTTCCGTT CCCAACTCCC AGGCTCCCTC CACCCGACCT CCCGTAGCGG AGCCACGCAT GCGTCCACAG GCCAGTCAAC AACTACCTAC CACTAATCGC ACAGTTGTAT CCCGTGTTGA GGCTCCTCCG CCTTCCACGC AATCGCGCTG GGCCCGGAAT CGTCTCTCTA CCTACCCGAC ACCTTCGCCG ACTCCCTTTA CGCCACGAGG CGTCTGTATG GACTTGTCCG ACATGTTTCA AGACGCTTCG TTTTCTACTC GCGCCACGGC ACACAAAAGT GCTTCTCGTG CTCACGTGCC GCACTCCGTT ACCAGTCGAC GCGCCTTGCT CGCCTCCGTC GCCAAACCCA CTCCGGCGGA TACTTCTCTA CTAGACAATG AACAGGATCA CGACTGGGCT GATCAACAAT GTCAGGCTTT TTCTTCCTGG CTCAACTACA CCTTTACTCC CTCGGAAGAC AAGGACCACG AAGCGGCCTT GGCGTCGGAA ACCACGCACG AAAGGGGTGT CGCCTTACGC ACCCTCGTAC TACACCAACG CATGGCCCAA GCCCGTCGCT CCGCCCTCGC CCTCTTTCAC ACCGATCCCG TCCTCCAAAA GAGCCGTCAA CGATTGTTGC AGGAAATCAG CAAGGGAAAA CTACGGATTC GCCCGGACCG GGATTTGGCC GTCAATCTCA CGCTCCGCAA CCAAGCCGTC GCACTCTGTT TGTCCTACTC CACACCCTGG CTCCGACTCG GGTTGGAAAC ATTGTTCGGG GAAAGCATAC TGCCGTCCGT ACCGCACCAC TTTTCGCCCC ACGGTAACCC TGTCGCCTCG CGCAAGGTGC CCACTACGCG CATGAAAGCG GCGCTCCAGA CCTTTTTGAT TCAACGCGTC TTGTCCGACG ATCTCGTCCT GGCCAAGTAC ACCAAAGGAC TCTGCAAGGT CCCGTCCGGT AGCTTCGAAA CCAAATACCG AGCCGAAATA CGCAACCTCG CCCTGTACCG TCTCTTGCTC CTTTTTCTCT TTCTGGATCG TGCGAAAGAA AATAATCTAT TGGACAAGGC ACCGCGATTG TTCGCCAAAA CGGCCTCGGT CAAATCGACC CGGGAAGTAC TGCTCACGTT TTGCCGCGAT TTCTTGTCGT CCGAAGGTGA TTTCGTTAAA CACTTGTCCC GCATGGGAAT CCAAGTGCAC TACAAGCAAG AACCTGTGGA CGAGTTGGAT TTCACCATTA CCAACCTTGC CGTAGATCTG CGCGACGGAG TGCGCTTGGC ACGCTTGCTC GAAATTCTCT CGCATGCACC GCGCAAGTCG CTTTTGGTGA AACTGCGGTT ACCAGCTGTC TCTCGTCTGC AAAAGCTCCA CAATGTCGGG CTGGTACTCC GACGTTTTCG GAACATGGGG GTGCCCCTGT CGGACGAAGT AGTCGCTCAC CATATTGTCG ACGGACACCG CGAAATGGTG CTCAAGCTCA TGTGGGCTGT GGTTGCCCAT TGTTGTTTGA ACGATCTCGT AAACGTTCAT GCGGTCGAAG CCGAAATTGC TCGTGTGGAA CGCGCCCACC GGCAAGCCGT TGTTTACCAG AATTACGAAC CCGACGTTAA GGTCCCTTCC GTACTGGAAG AGTTGCATTC TCTCCTACTA CGCTGGTGCC ATGCCGTCTG CTCCACCTTG GGAACGGCGG TCCGGAACCT GACGACCGAC TTTGCCGACG GCCGTGCCAT TTGTCTGTTG ATTCACTACT ATCACCCCGC ACTACTGCGC TTGTCCGAGA TTCGACCAAC GTCACGTTTT TCGCCACGAT CGCTGACGCA AGTCCGTGCC TTGGAAAATG AAATGTACAA TTCACAACTG GCCAACACAC GCATGTCCGA GCTGGGTGGT ATTCCGAGAA TCGTGCCCGA GTGCGATACG AACAACGTCC CGGAAGCTAA GTCCATGCTA CTCTGCCTTT CCTTTCTGTG TTCCCGTCTG TTGGAGTCTA GTACAGAGAT CCGTGCGATT CTTCTCATTC AGAATCGGTA CCGCGCCTAC CGGAAAGCTC GGTTGCGTCG ACGACAACGG GTAGTGGCCC GTTTTTTGTG GCAAGTCTGG CAGTCGCATA AACATAGATA CTACGCCAAT CAAGCATTCC ACTACGGCCC AGCCGTCCGG ATCATCGAAC GATTCGTACG AAATGGTAAG GAGAGACTGA
|
Protein sequence | MIGNSNQQHV GDSRRVALGT RVPLGRRGTK DDAKPTPKPS SSLHVTFPRT TPLDKENGHG IRRDGRPTTT ATKTSTTGSG SSARLDAKST TTTTTPREVS FRSVLRATVT PRRPHQSTVP SRLPPHTPAV PAPGRHQIAT TPRSLLTNEL ELLDGDDTFL CSPAVGTPRL PLPTFHMPEA TPAIQESAKA VEGPVPSMEP TRSTQHPRLP PVVPAVPISV PNSQAPSTRP PVAEPRMRPQ ASQQLPTTNR TVVSRVEAPP PSTQSRWARN RLSTYPTPSP TPFTPRGVCM DLSDMFQDAS FSTRATAHKS ASRAHVPHSV TSRRALLASV AKPTPADTSL LDNEQDHDWA DQQCQAFSSW LNYTFTPSED KDHEAALASE TTHERGVALR TLVLHQRMAQ ARRSALALFH TDPVLQKSRQ RLLQEISKGK LRIRPDRDLA VNLTLRNQAV ALCLSYSTPW LRLGLETLFG ESILPSVPHH FSPHGNPVAS RKVPTTRMKA ALQTFLIQRV LSDDLVLAKY TKGLCKVPSG SFETKYRAEI RNLALYRLLL LFLFLDRAKE NNLLDKAPRL FAKTASVKST REVLLTFCRD FLSSEGDFVK HLSRMGIQVH YKQEPVDELD FTITNLAVDL RDGVRLARLL EILSHAPRKS LLVKLRLPAV SRLQKLHNVG LVLRRFRNMG VPLSDEVVAH HIVDGHREMV LKLMWAVVAH CCLNDLVNVH AVEAEIARVE RAHRQAVVYQ NYEPDVKVPS VLEELHSLLL RWCHAVCSTL GTAVRNLTTD FADGRAICLL IHYYHPALLR LSEIRPTSRF SPRSLTQVRA LENEMYNSQL ANTRMSELGG IPRIVPECDT NNVPEAKSML LCLSFLCSRL LESSTEIRAI LLIQNRYRAY RKARLRRRQR HSTTAQPSGS SNDSYEMVRR D
|
| |