Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_27821 |
Symbol | |
ID | 7201491 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 848673 |
End bp | 852172 |
Gene Length | 3500 bp |
Protein Length | 822 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180719 |
Protein GI | 219119937 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATGTAGGCA CACACATCAA TCCTGATTGG ATGATAAATT GCTGAAAAAG TAGATCACTT CGAGAAAGGA GCAGCCAAGA CAGGAAACGA GGAGATGCAC AGGGCCCGAG TCGACCAGTG TTTCAAAAAG ATATTTTCAA ATATTAAGGC CCAATGACCA ACAAAGAATT TGTGACGTTC CCCCCCGCGT CTGAAGCATT ACAACAATTG TGAGTCTGCG CAAGCCAATT TTAATAACCA CTTGCCATTC TTCCTAGCAA GTACTTTCAT AGTCAGTCGC TCTCCCACTC GCAAAAAAAC TCACAGTCAG TCCATTTGTC TCAATTCATT GATTTGATTC GGCATGCGTC TTTGAAGGTA GTTCAGAATG AAAATTGTCT TTCCGGTAGC TACGACGTAC GTACAACCCT GAACTACTTC TTACATAGGG TTCTGCGTCT GACAAATAGT CCAAATCCCT CTCAGTCTGT CTGGATATAC GGTAGAACCA ACGCAGCACC ACACAACACG CACGCGGCGA GATTGTGGAA AGCATCTGCC AATCAATTTT CCTTTTCCGT TATCGAGTTT TCTGCAGAAC GCTACGACCC GAAACACCAC ATCCTTCTAC GAGGACCCCT TACATCAGCG CTTATTGTCG TGAATTGGGT ATAGATCCAC TGATGCGAGT AAGACCGCAC AGCAGATTCA ATTGAAATCC TCTTTTTCCG TTAGTGTTTT GATCTATAAA ATACGTTCTT GATCCCTTCT GAACGACTCG TACGCGCTTT TCCTCCAGTT CAAAGAGAGA AAAGAAAGCC GCTTCCCCTC GTCACCGACG ATGAGATTTA CCCGAAACAC TGTTATTGCT GTAATTATGA CAAATTCTCT CTTTCTTCAG CGCACGTCCC GGCTCGTGGT TCGAGCCTTG ACGACGGCTG CGCCGCTGTC GACCCGTCGG TCCGCTGTGG CCTTGGTGCC CAACGCGGCT TCTGCGACAC GGGCGACTGG CTTTGTGATG CCAACCTCGA CCTCTACACC TTTTGCCCGC ATGCTGGCCA CCAAAGCAAC TGTTGAAGAA GACTTGGACG CCGCATTGGA TGATGTTTTG GCAGGAGCCT ACACGGAGGC CAAGACTCCG GCTGGAGTCG AGCCTGTCAA TCACATGAAG AATTCCCATC CGATGCCTTC GCCATTGGTA GAGCAGGTAA GCAAGAAAGA AAATATATTA TGCACAGGGC CTACTCTTAT CAGCGTAACG ATCTCACGAT ATTCTTTTTT GTTTCGTTTC TTTAAGGATA TTGATTACAA GGACCCCGAA CTCTTATCCA CGAGTAATCC TCGTTGGATC GAAGCCGGTC TCGACCAGAG GGTAATTGAC GTTCTAAGCG AGAAGGGAAT TACGTCATTC ACACCCGTAC AGGCCGAAGC CTTTGGGCCA GTCATGGCTC GACGTGACGT GATTGGTCGC AGTCGTACCG GAACGGGTAA AACCCTAGCG TTTGGATTAC CCGCATTGAC TCGTCTCGTA ACATTTACTA CAGAAAACGG CAAGCGCGAT GCCCGTGGAG TCATGAAGAG TGGACGCAAG GTATCCATGA TTATTCTGTG CCCGACTCGG GAACTGGCGC GGCAGGTTCA GGAGGAGCTT TCGCAAGTCG CCCGCCCTCT TGGCTTGTTT GTTGAAGTCT TCCACGGTGG TGTGTCTTAC GACCCTCAGT CTCGCGCCTT GCGACAGGGA GTGGACGTCA TCGTGGGTAC CCCTGGACGA GTAATTGATC ATATCGAGCG CGGAACGTTG GATCTGAGTG AGTGTGATAT TGCTGTTCTC GACGAAGCGG ATGAAATGTT GAACATGGGC TTTGCGGATG ATGTGGAAGT TGTTTTGAAG AACGTCGGTT CCAATAATCC GCAAAAAACG CAATGTTTGT TGTTTTCGGC CACGACACCG AGCTGGGTTA AGGAGATTGG CCGACAGTAC CAAAAGGACG TTTTGGCGAT TGACTCTACG GCGGATAAGG GCGGTGCTCG AGTGGCCGAG ACGGTTCGTC ATTTAGCCGT TCAGCTTGCT CCCGGCGCCG ATGCAAAAAG ATCTGTTTTG GAAGACATTA TTGCGGTTGA AATCTCCAAG GATGCTGATA TCGGCAAGAT TGAACTCGAA ATTGCCAACC CGATTGCTGC TGCTGCCCAC AAAAGGAAAA ACAAGGGTAA CCAAGCCATG CAGCAAAAGA TTTTTGGTAA GACGATTGTG TTTACCGAAA CAAAACGTGA GGCGGACGAG CTAGTATCGG GAGGAGTTTT CAAAAGCTTG ACTGCCCAAG CACTACATGG TGATGTCGGC CAGAAGCAAC GTGATTCGAC CCTTGCGGCA TTTCGAAGCG GGGCCTTCAA CGTGTTGGTG GCCACCGACG TGGCCGCGCG CGGTATCGAT ATTCAAGATG TCGATTTGGT CATTCAGTTC GATCCTCCGC GAGATGTGGA CACCTACGTG CATCGCTCTG GTCGCACCGG GCGTGCCGGG AAGAAAGGAG TCTCTGTTCT GCTGTTTAAT CAGCGACAGT CCCGAGACAT CGTCCGTATT GAGCGGGATT TGGGGCATGG TTTCAAGTTC GATTTAGTTG GACCTCCGTC CGCTGAGGCT ACTTTGAACG CCGCCGCCAA AACATCGGCG ATTGCGACGC AGAGTATTCC TGAGGAGACG GCTGAGTTTT TCAAAGAATC AGCAGCCAAG CTTCTGGAAT CGCAAGACCC AGTCGATGTG GTTGCCCGTT GTTTAGCTGC TGTCTCCCGA CGTGCGTCGG AAGTGCAATC CCGGTCGTTG CTGACCGGCC AGGTTGGCTT TGCGACGGTT GAGATGGTGA ACGAACGTGG ACGCCCGGTT GCGGCGAACG ATGTCATGTT CACAATTGGC AAGCTGTCAC GCATGAGCAA CCAGGAAGGA GATTTGGCCT TTGACAGCCA GGTTGGTAGG ATTCAGACCA ACAGCGAATC GGGCTCTGTT GTATTCGATA TGAATGTGGA AGATGCCAAA AATTTGGTGA AGTTCAGCAA GACTGTCGAT GCTGGTGGTG CCGCCTTCCA GCTTTTGAAG GCGCTTGCGG TGGAAAGGGA TCGAAACTTT GGACGAATGG GTGGAGGCCG TGACGGTGGT GGCAGGTTCA GCCGCGGACG TGGTGGCGGA GGCAGCTACG GCAGCGGAGG TAGCTACGGC GGCGGCCGTG GGGGCTACAG TGACCGCAAT GGTCGCGGTG GAGGTCGTGG TGGAGGTCGC GGTGGAGGTC GTGGTGGTGG CGGGCAGCGC TTCGATCGTC GTGACGGAGG CGGCGGCCAG TCTGGCGGCT ACTCAGGACG TTACGACGGT GGACGCAGCA AGAACTCGCG CGGGGGTAGT AGCTGGTAAT TTCAGTTCGT CGCAACAGCA ATAACTCCGG TACATAGCTC TTGGCGTTGT CTTGAGAGCT TCAAAATAAG CAGTGATCCT TAACGAAAAC CATCATTGAT AGTAACATAT TTGCACATGA
|
Protein sequence | MRFTRNTVIA VIMTNSLFLQ RTSRLVVRAL TTAAPLSTRR SAVALVPNAA SATRATGFVM PTSTSTPFAR MLATKATVEE DLDAALDDVL AGAYTEAKTP AGVEPVNHMK NSHPMPSPLV EQDIDYKDPE LLSTSNPRWI EAGLDQRVID VLSEKGITSF TPVQAEAFGP VMARRDVIGR SRTGTGKTLA FGLPALTRLV TFTTENGKRD ARGVMKSGRK VSMIILCPTR ELARQVQEEL SQVARPLGLF VEVFHGGVSY DPQSRALRQG VDVIVGTPGR VIDHIERGTL DLSECDIAVL DEADEMLNMG FADDVEVVLK NVGSNNPQKT QCLLFSATTP SWVKEIGRQY QKDVLAIDST ADKGGARVAE TVRHLAVQLA PGADAKRSVL EDIIAVEISK DADIGKIELE IANPIAAAAH KRKNKGNQAM QQKIFGKTIV FTETKREADE LVSGGVFKSL TAQALHGDVG QKQRDSTLAA FRSGAFNVLV ATDVAARGID IQDVDLVIQF DPPRDVDTYV HRSGRTGRAG KKGVSVLLFN QRQSRDIVRI ERDLGHGFKF DLVGPPSAEA TLNAAAKTSA IATQSIPEET AEFFKESAAK LLESQDPVDV VARCLAAVSR RASEVQSRSL LTGQVGFATV EMVNERGRPV AANDVMFTIG KLSRMSNQEG DLAFDSQVGR IQTNSESGSV VFDMNVEDAK NLVKFSKTVD AGGAAFQLLK ALAVERDRNF GRMGGGRDGG GRFSRGRGGG GSYGSGGSYG GGRGGYSDRN GRGGGRGGGR GGGRGGGGQR FDRRDGGGGQ SGGYSGRYDG GRSKNSRGGS SW
|
| |