Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43674 |
Symbol | |
ID | 7197519 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 1154388 |
End bp | 1157426 |
Gene Length | 3039 bp |
Protein Length | 854 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178088 |
Protein GI | 219112673 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.300412 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTCATCTTC CCACGCTTTT CATCTCTGTT AACCGTAAAG AGAGTTTGGG AATAAAGGTA AGAGAATGTT TCGATCACTT TTGAGCACAA ATGTCTTCCA ATACGCACGA CCATGTTTAT GGACGCGTAA TTGTAGGTTT TGAAGAGCGA TAGTACTGGT CTATCACAGT TTGCTTCGTG CCAAGACTTA ACTTCTCTCA GACAGGCGAC AATAGATCTC TCTGTCGACC GTCATATACG CTGACAATTT TTGCTTTTTC CTTATCCACA GTCGAAAATA TCTGATTTTA TTGACCATGC CTGATTTGAG TGACCAAGCC GAAAAGGGCG AGGTTCAGGA AGGCCCGAGC AACAGCTCTG TGTCTTCCCC GGCCTCAAAC GCGCGCAAAC GATGGTTTAT CGCAGTCGCC ACGCTTTTGG CCCTGACAGG TGTTATTCTC GCCATTGCCA TCCCAGTCTC GCGCAAAAAC GACCACGAAA AGGTCTCCAT TGCATCAGAC GAAGACTCGA ACATGGATGG AACCGTACAG CAAACTGCGA AGGCAACTGC CGACATAAGT GGATACTTCA ACAGCTCATT TTCTCTTTTT GGTGAAGACA TAACGAACGG ATACATTTCC CCCGATGAAT TTAAATCGGA CCTGCGTAAC GTCGCACGAT TTTTACTAGA TAGTGTGGTG AAGCGCAATC TCGGCTCTGA GGTTAACAAT GCTGCAGGTG GGGACCTTGA AGTTGAGCCC GGCGTAGGTG TTGCCGCTGA AGGAAGCAAC TCAGATATGC GCGCTCCCGA TGTCGGAGAT AACGTGAACG ACTTTGGAAC AAACAACCAG GAAGATAGCG TCGAAGAAGG CGACATCATC GTATCCAACG GTAGACACGG TATGTGGAAA AGCATGTGCA GGTTTTGCTA AAGAGAAACA AAGAAAGACA CACTCGCCAC TTTTGCTGAC AAGCCCTTCG CGTTCCTGGA TTATAGTGTT TGCAGTATAC GGTGATCGCG TCGTGATTTG GGATGCAACC ACTGGCGACA TGTTGTCTGA TATCAAGATG CCCACCTTTG ATGAATCTCT AAATAGCACG AAAGGAAGCG CGGCTGCTAC ATCTCGTTCG GATATCGATT TTTTCTACAG CGGGCCGTTC ATCAATGATC TTCTATTAGA TGGAGACAGA CTCGTGGTAG TAGTCGGTGG ATATGGTAAT GCTATGCGAG CAGCTCCTGG TGCTGAACAG CCCATTTTAT ACGACTACAA CGGAGCCCGC ATAGTGATTT ATGACATTTC CGCGCTCGAC AGTACTGGAA CAATTACTCA GCTCTTCTCA GAAGACATAA ACGGAAGTTA TAACTCAATG CGAGCCATCG GAAGCAATCT TCACATTGTT ACTATGTCGG GATTAGACAC ATACACTCAT TTGGTCGCGC CTTTTGAACG CTGGAACTAC CCAAATGTAA CTGACGAAGA ATACATCGCG CAAGTCCAGG AAGCTGCAGA AGGCAAGGTC ATTACAAAGT TCGTCGAGCA ACTGGCCAGT GAACTAACCT TTCACGGAAA GCTTCCTGAT TTTGCTCGGA TTAGTCTCAT GCAAGAAGAA TTTTCGGGTG GTGCGCATGA GAGGGTAACG TATTCTGACG GTGTAGCGAA CTCTGTCGTT CAAGTCTTCT CTCTTGACTT GGCTCAGGAT TTTTCAATAC TTGGAATTGG GGAGACGCCC TTTAGCGTGT CGGGTGCTTT CCTTGCCCCT TATTATGGCG AAGTTTACGC GGCAAATGGC ATGTTGATCA TTGCGAGCAA TGGATGGGGA TACAACAGCG AAAATGGAAT TTCTGAGGAC TACACGTACA TTTTAGCAAT GGCTCTCAGC GGCCCTTCTT CGACTCCCCA CTCTGTCGGT ACCGTGAAAG GATACTTTCT CAATAAGAAT TCAATTGATG TTGTCGGTAA CGTGCTCCGA ATCGCAACAA CAATTCAAAA CAGGTGGCGT TGGCTGATGC CTGAGCCTCT GATTCCTATC GACGGCGATG GAACGGACGG AAACGGAACC TTGTCTCGGC CGGCCGTCAT GCCTGAGCCA GTCCAAGATG AGCCTTCCAC TGAAAACTAC ATTATTATGT TGCAAATGCC GGGTGTAGAT GGCACAGACC CAGGTACGAT GCAGGAGCTT TCTCGGCTTC AGCTTGGAAA AATTAACGAG GTCTTTACAG CCGTCCGCTT CTTTGACAAT ATCGCCTATG CCGTGACATT TGAAAGAACG GATCCGTTTT ATGTCCTCGA CCTAAATGAC CCATCCAATC CCGAAATTCT CGCCGAGTAC AATATCACCG GCTTCTCTAG TTACTTGCAC TCCATGAACA CCGATAACAG TCTTATCTTG GCTATTGGAG AGGAGGCCGA CGGGGATGGA ATGCCCATTG GTCTTCAGAT CACAGTCTTT GACGTTCTGG ATCCTCGCAA TCCAGTTGCT GTCCAACGCC ACCTTATTGA GAACGATCCA GATACTTACT CGAGCACTGA TGGTGCATGG CAATTCAAAG CCGTTCGATA TGAAAAGACA TCTCAACGTC TCATTATTCC TGTGAACATC AACAACTGGA ATGATCCGAC CTCGAACTAT AATGGGTTCA TTGCATACTA TGTCAGTGCT ACTTTGATCG AAGAAAGTTG CCGCATTGAG CACGATGCAG GCTACGATGT CTTTATCGAT CCTATCTTCG TCGATCCGGA TTCCAATGAG ACCGCTGTCG AAAACGAAAC GCTTGTTGGT CCAGCTGATA CTATCGACGT TGCCCCTTCG GATTGTGTTT ACTGTGCCTC ACTCCAGCCT CGATCAATGA TCTTCAACGG AAACGTTATG ACAAGCAGTG GCCACTTTAT CCGTAGTACA GACTTGAACA CATGCGAGCA AGCTTGGAAA TTGGATATCG CCGAGGGCGA GTCAAACTGC TGCGGTGCCT GGTTCTAGAG AAATATTCCT CACGCAAAAG TTACGGCATA AAATGCCTTC ATAAAACGTA TGCATAATTA GCAAAGATCG ATGTCATCT
|
Protein sequence | MPDLSDQAEK GEVQEGPSNS SVSSPASNAR KRWFIAVATL LALTGVILAI AIPVSRKNDH EKVSIASDED SNMDGTVQQT AKATADISGY FNSSFSLFGE DITNGYISPD EFKSDLRNVA RFLLDSVVKR NLGSEVNNAA GGDLEVEPGV GVAAEGSNSD MRAPDVGDNV NDFGTNNQED SVEEGDIIVS NGRHVFAVYG DRVVIWDATT GDMLSDIKMP TFDESLNSTK GSAAATSRSD IDFFYSGPFI NDLLLDGDRL VVVVGGYGNA MRAAPGAEQP ILYDYNGARI VIYDISALDS TGTITQLFSE DINGSYNSMR AIGSNLHIVT MSGLDTYTHL VAPFERWNYP NVTDEEYIAQ VQEAAEGKVI TKFVEQLASE LTFHGKLPDF ARISLMQEEF SGGAHERVTY SDGVANSVVQ VFSLDLAQDF SILGIGETPF SVSGAFLAPY YGEVYAANGM LIIASNGWGY NSENGISEDY TYILAMALSG PSSTPHSVGT VKGYFLNKNS IDVVGNVLRI ATTIQNRWRW LMPEPLIPID GDGTDGNGTL SRPAVMPEPV QDEPSTENYI IMLQMPGVDG TDPGTMQELS RLQLGKINEV FTAVRFFDNI AYAVTFERTD PFYVLDLNDP SNPEILAEYN ITGFSSYLHS MNTDNSLILA IGEEADGDGM PIGLQITVFD VLDPRNPVAV QRHLIENDPD TYSSTDGAWQ FKAVRYEKTS QRLIIPVNIN NWNDPTSNYN GFIAYYVSAT LIEESCRIEH DAGYDVFIDP IFVDPDSNET AVENETLVGP ADTIDVAPSD CVYCASLQPR SMIFNGNVMT SSGHFIRSTD LNTCEQAWKL DIAEGESNCC GAWF
|
| |