Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50132 |
Symbol | |
ID | 7198839 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 144576 |
End bp | 149219 |
Gene Length | 4644 bp |
Protein Length | 1477 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185058 |
Protein GI | 219129779 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.408284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGTGTGAGC ATTGGAGTAT CGCCGCGACG TCGGAAACAA TGTCGGGTCG TTCGAGCGCT GCGAGTTCCG CCTCGGCAAC GGGACGCGGA GGAGGATCTG CGGGACGGGG TCGCGGCAAT ACCAAGGGAC GCGGCAAGAC AGGAGGGCGA GGTCCCAAGC CCAAGGCCAA TGCTTCCGTA TCGAAGGAAA CAGGAGCATC CACTCCGAGC TTACAAACCG ACAGCAGCGG CAACGCAAAC AAAGCGCTTC CAACGAAGGA TCGTGGTGGC TCCGGAAGAG AAAACAGGAC GAAACCAAAT CCTGTCGGGT CCAAAACGCA GGGGAACCAG AAACGTAACA CACCCAATCC ACCCACTCTA TCGCCCAAGG ATCAGCAGCG CCATCAAGAA ATAGAAAAAC ACGCACAGGA AGTGGCTGCC GAAAAAGCGC GGGCTGTAGC GTTGGCTAGA GCCCAAGCTG CGGCAATCAA AGTCCGGGAA CAGAAGCAGG CCGATGTGGA TCAGAAGGTT CGGGAGGCGC GGGATCTGTT ACAGAACGTG ATCAATGCCA CTCACGCGCA TCAAAGGACA CGGGCGGACA TGGAGCCGGA AACCCTGCAG GCTTTTCGCA AAACGTTTCA GGCCAACAAG AAGAATCTAA AAACAGATTT GAAAAAGTGT ACCACCTTTG TGAAAAAGGT CAAGAGCGGT GTCGCGTGGT CTATGAAACC AGAGGATATT GTGAAGGACG TGTCAACGCT CAATTTAAGT CGCTACGTGG AAGAAGTAGC CAACGGGATC TTGGAAAGCT CTACTAAACC CAAGGTGTCT GAGTTGCCAA GCGTGCTGTC CCTTGTCACC ACAATGCACC AACGGTACGA AGATTTCTTA TCGATGCTGG TACCGGCCTT GTGGAAAGTG GTGAACGCCA AACCAACACC CGCTATGGCG AAGCTCCGAC GCTTGTACGT CCGAATCTTG ACGGAGTTGG TCCTCAACGG AATCGTGACG GAAACCAAAC CTCTGATTGC ATGTGTCACC GACGCGAGTG GAGCGAACGC GTCACAAGGA ACGAACTGCT CCGTGCAAGA CGCCAATCTG ATGGTAGCCT TTAGCAAGGC GGCCGGCTTT GAGTTGTTAG ATATTGTCCC CACCTCGGTC CATTCCGCGA TCAAGCTGAT TGGAAATTAC AATGAGACGG AAGCCTGTAC CCCAGACGGT GCGGATGCTA TCTTTCCTTC CCAGGATGTG ATAAAGCAAG CCTATCCTGT CGTTGAACAA ATTCCGTCGG TTTTGACAGA AAGAGCCGTG GCTGCACAAG TTTCCGAAGT GTTGGTCACC CACTGTCGGG CTGCCTACCA TGCCTTGTCC ACCAATCTGA CACAAACCCA CCGTAAACTG CTGCAGTTGG AGAAACGCTG CGAGCAAGAC CGCCTCTTGT CGGGGTCCTT GACAGAAGCC CGAGAAAAGG GATTAGTGGA TGCCCGTAAA CTGCAAGAAT CGTTGCAGAA GGCGGTGGAT GCTCTGGCGG ACGTCTTGGA CGAAAGGTTG CCTATTTTGC AGCAGCAAGA AGAAGACGGT GATGCAGACG GTAGTGGTGG ACTAGAAGTG TGGACAAAAG GAACTGACGG GGAAGAAGAC TTGGGCCCGT TCGACGACGA AGAAATGCGA ACTTTTTATT GTGACGTGCC AGATCTGCTC ACCACCACTC CACCTGCTTT GCTCGGTCTG GCACCAGAAG CTATAGAGTC GCGGAAATTA GACAACCAAA GAAAATATGG CGCCGAAGGA GAAGTTGTCG ACGAAGGTGA CGACGGCACC GAATTACCGC TAGGCTCGCA AGAAGAATTC GAAAATGAAG AGGACGAGGA ACTGAAGGAC GAGGAAGGTA CGCTCTATGC TTTTCTCATC CGGTAGTCGG TCCTCGCGTC TGACCTTTGC TGTGTGTTCT TTCTAGACCC AGAGATCGAG GAGCAAAAGG ATACACCGCA CTATCGCTTA ATGGTTCTTC TTGAGGAAGA ACTACCAGAA TGCCATCGTC GTGAGCAGCT CGACGAGTTA ACAGAGAAGT TTTGCACCAA CCACGGGTCT AGCAAAAACT CGAGGAAACG TTTGTCTCGA ACACTTTTCT TGGTACCTCG CTCTCGGCTG GATCTTTTGC CCTACTACTC ACGCATGGCA GCTACCGTTG ATCGCGTTTG GACGGATATT GCATCACCTT TGGTGGCGGA TTTGGAACAT CAATTCCATG GTCAAGCAAA ATTCAAAAAA AACCAAAACA TCGAATCACG TATGAAAACA GCTCGCTATA TCGGGGAGCT AACCAAATTC CGAGTTGCAC CTCCCATAGT ATTCCTACGA TGCCTTAAAC GTTGCTTGGA GGACTTTACG GGAAATAATG TGGACGTTAC GTGCTGTCTG CTGGAATCAT GCGGACGATT TCTCTTCCGT CTCAAGCACA CGAGCGGACG TATTGCGACG CTTATGGAAA CAATTACTCG CCTGAGTAAA GCCAAGGTAA GACGCGCCAA AAATCATTCC AGAATCTGGC CATCATCATT TTCTAATACT ATTGATCATT TTCAGAACCT GGATGAGCGC CATCAGTCTT TGATTCAAGC TGCGTTTTAT GCTGTCAAGC CTCCACCGAG CGGTCCCCGG AAGCAGCAAA AGGAATATCC CCCATTGGAA GCGTACGTCA GATATTTGCT AATGGTTCGT TTGGAACCTA CAGACACGTC TGTTTCTTTT ACCGCGAAGG AGCTCATTCG ACTACCGTGG AGCGATCCAG AACAGCAGTG TGGTGCACTT GTTTGTCGCA TCATGCTGAA GGCTTGTAGA AAAGGCCGAT ACCGAACAAT ATACGCGATT GCGGCCGTGG CGGATCGACT ACGACCTCAA CGATCAGCTT GTGAAGCCCC GGTTCGACTG GTAGACGCGC TACTCGAGGA ACTGCGATGG GCTTTGGAAC ATCCAGATTT TCGAGATCAA CAACGTACAA TCACGTATGC TCGTCTTTTA GGGGAACTGT ATTGCTCGTC ACAAGTAACG GGGCAGCTTG TCATAAACCA ACTCTACGAT TTCATCAACA TCGGTCACGA GATTCCGAAT GCTTTGCGAG AAGCGAGTCA AAAGCTGGCG GCATCTCAAG GAGAGGAGAG TGGGGAACAG CTCCCATTAT TCACCTCAAT GACAGGAGTC TCCCAAACCA TTAAGGAGGA TGAGGAAATG GAGGATGACA ACCTCGATGC CGCCGAACAG GCTCCTGCTC CGCCGACACC GGTGGCAGTT TCGCAGCATT CAAAATACGA CCCACGCGTG CCGTCGTTAG AGGATCCCCC AAATGCTTCT TATCGTGTGA AGCTTGTTTG TACGGTGCTT GAAGTAGTCG CGAAGTCTAT AATGTCGAGA AGCGTTCTGC CCCGTATGAG AGGTTTTCTG GCAGCTTTTC AGAGATACCT TTTCACAAAG ACTACGCTCC CAACAGACGT GGAGTTCTCT TTACTAGATA CTTTTGACAT CCTTGACTCT CAATGGAAAC GTTTTACCAG GACGAGTCGA GACAAAGAAC TGGAGGAAGG ATCGGAAAGC GGCTTTCCAC GATATTCCAG TTGGCTGGAT GCACACAATG TGACCGTGGC GTTCGAAGAA TCGGACGCAC TCCTCGAGCT CCAAAAGCGC AACCATGTTG AAACCTTGGC GGACGAATCG AAAACACTTG GTGATCTTCA TGTGAGCATG GATGCACATT CCCTGAATGA CGATTCAGCA TCGCTGTTGG AAGATGAAGA GGACGATGCT TCCCTATCTG TTAGTGCCAA AGGTAGCGTT GCGGAAGACA CGAACGATGA TGAAACCGAG TCAGGAGACA GTGATGAAGA CTACGAAGGT GTGACCGAAG ACGATATCGA TAGCGAATCA GGTGGCAATT CCGAAGAAGA AGAAGAAGAG TTTGACGAAG AAGCTTATAT GCGTCAGCTG GAAGAGGAAG CCTTTGAACG AGAGCTCCGA CGAGTGACCA TTGAAGCCTT GGAGAAGGGC AAGAACACTT CGCGCAAGTT GGTTGCTGAA AACATGGTTT CGGGATCACA GACTTTCAAA AAGAAACCAA CAGACCTGTC CAAGCCGGTT GGCGCCAGCT CCGCGGTGGC TCCGACCTTT GGACTTGCCG GTACTCCCGG CATCACGTTT CAATTGTTGA AAAAGGGAAA CAAAGGAAAA GTAGAGGCGA AAGAAATTGT CGTGCCCTCG GATACGAATT TCGCCAAGGT TGCCTCCAAG CAAGACGACG CCGCGGCTAG GGAACGAAAC TTTATCAAGC AACGCGTGTT GCAGTACGAA GCCGACAGTG CCGAAGCGGA ATTGTCTGGG GGGAATGTGT ATCTCGAACA AGAGAGGTTG CAGGTGATTC GGAATCGGCC TCTTTCTATG GACGATATTG ATCGCAACTT TGGAACCAGA GGTGGTAATA CGCTTCAATC GGGCAAGGAC AAAGGCAGGA CGGGTGCGGC TTCCGGTGAC CGTCCGGGCA CGAGTGTGAT ATTGGGCCAA CGCGGTGGTG GACGAGGAGG CCCGGGTCGA GGCAGAGGCG GTCGCGGAAA CACCAGCGGA CGAACGCTCT TCCGAGGCTA ATAAATATTC TATAAGAAAG TACCCGTGTT GAAT
|
Protein sequence | MSGRSSAASS ASATGRGGGS AGRGRGNTKG RGKTGGRGPK PKANASVSKE TGASTPSLQT DSSGNANKAL PTKDRGGSGR ENRTKPNPVG SKTQGNQKRN TPNPPTLSPK DQQRHQEIEK HAQEVAAEKA RAVALARAQA AAIKVREQKQ ADVDQKVREA RDLLQNVINA THAHQRTRAD MEPETLQAFR KTFQANKKNL KTDLKKCTTF VKKVKSGVAW SMKPEDIVKD VSTLNLSRYV EEVANGILES STKPKVSELP SVLSLVTTMH QRYEDFLSML VPALWKVVNA KPTPAMAKLR RLYVRILTEL VLNGIVTETK PLIACVTDAS GANASQGTNC SVQDANLMVA FSKAAGFELL DIVPTSVHSA IKLIGNYNET EACTPDGADA IFPSQDVIKQ AYPVVEQIPS VLTERAVAAQ VSEVLVTHCR AAYHALSTNL TQTHRKLLQL EKRCEQDRLL SGSLTEAREK GLVDARKLQE SLQKAVDALA DVLDERLPIL QQQEEDGDAD GSGGLEVWTK GTDGEEDLGP FDDEEMRTFY CDVPDLLTTT PPALLGLAPE AIESRKLDNQ RKYGAEGEVV DEGDDGTELP LGSQEEFENE EDEELKDEED PEIEEQKDTP HYRLMVLLEE ELPECHRREQ LDELTEKFCT NHGSSKNSRK RLSRTLFLVP RSRLDLLPYY SRMAATVDRV WTDIASPLVA DLEHQFHGQA KFKKNQNIES RMKTARYIGE LTKFRVAPPI VFLRCLKRCL EDFTGNNVDV TCCLLESCGR FLFRLKHTSG RIATLMETIT RLSKAKNLDE RHQSLIQAAF YAVKPPPSGP RKQQKEYPPL EAYVRYLLMV RLEPTDTSVS FTAKELIRLP WSDPEQQCGA LVCRIMLKAC RKGRYRTIYA IAAVADRLRP QRSACEAPVR LVDALLEELR WALEHPDFRD QQRTITYARL LGELYCSSQV TGQLVINQLY DFINIGHEIP NALREASQKL AASQGEESGE QLPLFTSMTG VSQTIKEDEE MEDDNLDAAE QAPAPPTPVA VSQHSKYDPR VPSLEDPPNA SYRVKLVCTV LEVVAKSIMS RSVLPRMRGF LAAFQRYLFT KTTLPTDVEF SLLDTFDILD SQWKRFTRTS RDKELEEGSE SGFPRYSSWL DAHNVTVAFE ESDALLELQK RNHVETLADE SKTLGDLHVS MDAHSLNDDS ASLLEDEEDD ASLSVSAKGS VAEDTNDDET ESGDSDEDYE GVTEDDIDSE SGGNSEEEEE EFDEEAYMRQ LEEEAFEREL RRVTIEALEK GKNTSRKLVA ENMVSGSQTF KKKPTDLSKP VGASSAVAPT FGLAGTPGIT FQLLKKGNKG KVEAKEIVVP SDTNFAKVAS KQDDAAARER NFIKQRVLQY EADSAEAELS GGNVYLEQER LQVIRNRPLS MDDIDRNFGT RGGNTLQSGK DKGRTGAASG DRPGTSVILG QRGGGRGGPG RGRGGRGNTS GRTLFRG
|
| |