Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_28833 |
Symbol | |
ID | 7202611 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 848122 |
End bp | 850009 |
Gene Length | 1888 bp |
Protein Length | 464 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181639 |
Protein GI | 219122619 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.841033 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGTCGTTAC CCAAGACTTA CGCCAATTAC TTAACAGAGA CCCTGTGGGA TCGGACGATA CATAGTCGTG TCCAACGCTA TTCGCATCCA TCAGTCCAAC CAATCCACGG ATCTATCCTA TCTACCTTTC GACTACTGAC TGACTGTAAG CTTCCTCGTG AGCCAGAAGC CAGTTGAATC GCGCTCTTAT TCAAGTACCT CGCTACCCAC GGCACGTATA TATATACATA TACTCAAACG CAAACGCAAC TTTGGTTCGT TGTCATGGAT CGTTGGACGT CCTTGCCTTG GACGCCGTTC TATACGCAAC AGTCGGCAAA CGCTTCGACT TCTGCGGTAT CGGTGGACGA CCAGGATTCC CCCTTGCTCG CCTACTGGTC TCTAGCGGCC ACGGTAGCCT TTACGATTGT CGTTTATTCC TTCGAAGGAC ATTTGGACGC CCGGCAGAAG ATATCCTACC AACAGACATC GTTTCCGACG GAACTCGAAA CTACCGTACG CGCAATTGAC CAGGAACGCG CCGCTTCTGG GAAAGAATCG GCCGCTCACG GTGAGTCCAA GGACGGTAAT GGGGACAGCG ACGAAAACAA GTCCGAGGCG GCCAGTCAAC CGCTCTTGTC GCAGTTGCAG GCCAAGTTCC GGAGTGCGCA AACCTACGGT CTAGACAAGA TCAATTTTGG AATCCTCGCC GGTACCTACG ACACGGCGGA ATCGGTCGTC TTTCTATTGC TCGGATTCCT ACCCTACGTG TGGGACTGGT CCTGTCAATT GGGACAAACC TACTTTGGCT ATCACGATGA AGCAGCCTAC GAAACCAACA TCTCCCTCAT CTTTCTCGCC ATTATTACCC TCATTGGTAC CGTCACGCAG TTGCCCTTTG AGCTATATTC CACCTTTCAG ATTGAACGAA AACACGGATT CAACAAACAA ACCCTCAGTC TCTTTTTTAC CGACAAGATC AAATCGCTCC TCTTGACTTG TCTCATTGGT GGACCCTTTG TGGCGCTCTT GCTGTACATT ATCCGCGTCG GCGGGGAGTA CTTTTATTTG TACGTTTGGG CCTTTATGTT TGTCTTTTCT GCCGTCATGA TGACGCTTGT ACCCGTCTTT ATCATGCCAC TCTTTAACAA GTACGAACCC TTGCCGGATG GGGATTTGAA GACCCGAATT TACGCACTGG CCGATCGACT CCAGTATCCT CTTACCAAGT TGTTCGTCAT GGATGGATCC AAACGCTCGT CGCATTCCAA CGCCTTCATG TTCGGCTTTG GAAATAACAA GCGCATCGTC CTCTTCGACA CGCTTCTGAC TCAGGTACAG GAAGACGAAA TCCTGGCCAT TTTGGGACAC GAACTCGGGC ATTGGAAGTT GGGACACACC TTGTCGAACT TTGCCGTTAC CCAAATGTAC TTTGGCGCCG CCTTTTACTT CTTTTCCCTC ACCTACGGCT CCCGCTCACT CTACGCGGCT TTTGGCTTTG ACGACGTCTC CCGACCCGTC CCCACCATTG TCGCACTTTT GTTGTTCTTC CAAACACTCT GGGCACCCGT CGACAAGATA CTTTCCTTTA TACTCACCAT TACCTCCCGC CATAACGAAT TTGCGGCCGA CCGCTTTTCC GTCGACCTAG GAATGTCGCA GAAACTGCAG TCTGGCTTGT GCAAAATCCA TCTCGAGAAT TTGGGCGCCA TGTGTCCGGA TCCCTGGTAC TCCACCTACC ACTACTCCCA TCCACCCCTG GTTGAACGAC TGGGCGCCAT GATGGCTCTG GATAGAAAGA CCAAGTAAAT TTATACGGAC GCCAAAAAAG GAACGGCGTA CGGATTGTAT CGGATCACGG GTCTCCACGA AACAAGTCAC ACTATTAACT CTTGAAAGTT ATATGTTG
|
Protein sequence | MDRWTSLPWT PFYTQQSANA STSAVSVDDQ DSPLLAYWSL AATVAFTIVV YSFEGHLDAR QKISYQQTSF PTELETTLQA KFRSAQTYGL DKINFGILAG TYDTAESVVF LLLGFLPYVW DWSCQLGQTY FGYHDEAAYE TNISLIFLAI ITLIGTVTQL PFELYSTFQI ERKHGFNKQT LSLFFTDKIK SLLLTCLIGG PFVALLLYII RVGGEYFYLY VWAFMFVFSA VMMTLVPVFI MPLFNKYEPL PDGDLKTRIY ALADRLQYPL TKLFVMDGSK RSSHSNAFMF GFGNNKRIVL FDTLLTQVQE DEILAILGHE LGHWKLGHTL SNFAVTQMYF GAAFYFFSLT YGSRSLYAAF GFDDVSRPVP TIVALLLFFQ TLWAPVDKIL SFILTITSRH NEFAADRFSV DLGMSQKLQS GLCKIHLENL GAMCPDPWYS TYHYSHPPLV ERLGAMMALD RKTK
|
| |