Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29029 |
Symbol | |
ID | 7202934 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 801996 |
End bp | 804219 |
Gene Length | 2224 bp |
Protein Length | 574 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181978 |
Protein GI | 219123327 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCGGTTGCTA CTGGCGGTAA CCATTTTTAA CTCAATCGTC GTAGTAAATC TAATATATTC ATACCATCTG CGACAAGTAT TCAAGATATT TCGAGAGGAG GCCGCTATAC TTAATCGACC GTAGCTCGGA CGACCTGGGA AGCTCGACGT TTCTCGGTTG GGCTCCAGAT TTAGAATCTT TCCCTCATGT GGTTCTTCAA CATGGTTCTT TTTGCCGTCC TTTGCGGATC GTTTCGACAG CGAATAGTAC AAGCTTTTGT CCAGAAAAAT GTCTTTTTTG CTTCCTTAGC GGCGCGTGCC AAGAGTTCGG GATTAGGGGC AGTGGACGTC GGGACGCCGG CAACCGCTTT TGATGATGGC AAGCGTCCGT TTCAGATTAC AACGCCGATC TATTATGTGA ACGACAAACC GCACATCGGC CATGCCTACA CGTCCACTGG TGAGTTGCTT CGTTTTGAAT GCGATGCTGG TAGCCGTATC CACGTCCCGT CATTCACTCT CACACTCGAT ATATTCTGAA CCCATAGCGT GCGATGTGAT TGCACGATTC ATGAGGCTTT CCGGACGGGA TGTCTTTTTC CTCTCTGGAA CCGATGAACA CGGCGAAAAA GTGGAGCAAT CAGCCGAAAA GCAAAACATG GATCCGCAGG GCTTCGTAGA CCAAGTCTCG GTTAACTTTC GTGAGCTGTT AGAACTCATG AATATTTCCA ACGACAAGTT TATTCGTACC ACGAGCGAGG ATCATAAAAA GTCTGTTCAG GTACGTTTTG TCGAATGAGC TACTGCATTT TCTGGTCGCT CTTTGCACTC ACAGGGATTA AATACTATCT TGTACTACAC AGCACTTTTG GAATGTTATG GTCGAAAAAG GCTACATTTA TATGGGCACG TACTCTGGAT GGTATTCCGT CAGGGATGAA TGCTTCTATA CAGAATCGGA GCTGATCGAC GGCAAAGCCC CGACCGGAGC AGAAGTGTCC TGGGTAGCCA AAGAAGAATC ATATTTCTTT AAACTCAGCC AATTTGAAGA AAAATTGTTG GACCTTTACG AAAGAAATCC TCACTTTATT GCTCCAGAAT CACGAAAGAA CGAAGTTGTG AGCTTTGTCA AAGGTGGCCT GCGAGACCTA TCCATTAGTC GAACATCATT CAAATGGGGT GTACCAGTTC CTGACGACGA GGATCATGTC ATGTACGTTT GGGTAGATGC CTTGACGAAC TACATTTCAG CCCTGGGGTA CCCAGATATG GGCGAGGGAT CTGACTTCAG CAAGTTTTGG CCGGCCTCTA TCCATATTGT AGGCAAAGAT ATTCTGCGAT TTCATGCTGT CTATTGGCCG GCCATGTTGA TGGCCGCCGA ATTACCGCTT CCAAAGCGGT TATTTGCTCA TGGTTGGTGG ACGAAAGATG GTCAGAAGAT TTCCAAGTCC ATCGGCAACG TGATTGATCC TGTAGAATTG GTAAACAAGG TACGTTCATT TAGGTTTCAA TGATGAATAT TTCCTCCTTC TGCTGGTCTC TCAATGCCTA CTGATTGTCT GATCTGTGAT TTCTCTAGTA CGGGGTGGAC CAAACTCGGT TCTTTCTCAT GTCGGAAGTA AATTTTGGGA ACGATGGAGA TTTCTCGGAC AGAGCTATGA TCCAGAAGTG CAACACAAAT CTTGCGAATG AGCTTGGAAA CCTGTGTCAA AGAACACTTT CCCTGGTATT CAAAAACTGC GAAAAGGCCG TTCCTAACCC TGTCGGCGCG TACACACCAG AGGACGAGGC TTTGCTGGAA TCGGCCCGGA ATCTACGGAA CAAGGTAGCA ACCGAGATTT CCCAACAGTC TATTTTGAAA TACGTTCATG CCATGGTCGA AATGATTTGG GAAGCAAACA AATACATTGA CGAAATGGCG CCCTGGGCGC TGAAAAAGAC AGACCCGGAA CGGATGGCCA CTGTGTTGTA CGTTTTATTA GAAGTTCTTC GGTATTCAGC AATCCTTTAT CAACCGTTAA TTCCAGAATC GGCTGGAAAG ATTTTAGATC AGTTGACGGT ACCCGCAAAT GAACGCACGT TCCTGCATCT AAGTGAAGAA TACAGCATCA AGGCCGGTGC AGCGATTTCC AAACCTGTTG GAATATTTCC ACGAATCGAA ATGCCGGCGG ACGAGTTGGT AGAAGCATAG ATAACGTCAC TCTAAAATGG TTTTGACTGT GAATCTGAGT TACAGTGAGG TAGA
|
Protein sequence | MWFFNMVLFA VLCGSFRQRI VQAFVQKNVF FASLAARAKS SGLGAVDVGT PATAFDDGKR PFQITTPIYY VNDKPHIGHA YTSTACDVIA RFMRLSGRDV FFLSGTDEHG EKVEQSAEKQ NMDPQGFVDQ VSVNFRELLE LMNISNDKFI RTTSEDHKKS VQHFWNVMVE KGYIYMGTYS GWYSVRDECF YTESELIDGK APTGAEVSWV AKEESYFFKL SQFEEKLLDL YERNPHFIAP ESRKNEVVSF VKGGLRDLSI SRTSFKWGVP VPDDEDHVMY VWVDALTNYI SALGYPDMGE GSDFSKFWPA SIHIVGKDIL RFHAVYWPAM LMAAELPLPK RLFAHGWWTK DGQKISKSIG NVIDPVELVN KYGVDQTRFF LMSEVNFGND GDFSDRAMIQ KCNTNLANEL GNLCQRTLSL VFKNCEKAVP NPVGAYTPED EALLESARNL RNKVATEISQ QSILKYVHAM VEMIWEANKY IDEMAPWALK KTDPERMATV LYVLLEVLRY SAILYQPLIP ESAGKILDQL TVPANERTFL HLSEEYSIKA GAAISKPVGI FPRIEMPADE LVEA
|
| |