Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_15186 |
Symbol | |
ID | 7194700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 78387 |
End bp | 81266 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183027 |
Protein GI | 219125523 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATACCG ACACTGTTGT CTGGACCACC GACAAGGTCC GGAGTACCTT TATCGACTTT TTCGAACAGG AACCGCGATC ACACACGTTC CAACGCAGTT CAGCCTGTGC CCCGCTCAAC GATCCGACGC TTTTATTCAC CAACGCCGGA ATGAATCAGT TCAAGCCCGT CTTTTTGGGA CAAGTCGATC CCAATTCGCC CCTCGCCAAT GTACAAAGAG CCGTCAATAG TCAAAAGTGC ATTCGAGCCG GTGGTAAGCA CAACGACTTG GAAGACGTGG GACGGGACAC CTACCATCAC ACGTTTTTCG AAATGCTAGG ATCCTGGTCT TTCGGTGACT ACTTTAAGAA AGAGGCCATT GACTACGCCT GGCAATTGCT CACCGAAGTT TATCGCCTCG ATCCCGATCG ATTGTACGCC ACATACTTTC AGGGTGACGA CGCCCTCGGA CTCGCACCGG ATTTCGAAGC CCGTGATTTG TGGCTCCGTT ACTTGCCGGC AGCGCGGGTC ATTGGCTGCG ACGTCCGCGA CAACTTTTGG GAAATGGGAG AAGTCGGACC CTGTGGACCC TGCAGCGAAA TTCACTACGA TCGCATCGGC AACCGCGACG CGGCCGCCTT CGTCAACGCC GACGACCCGC AAGTCATCGA AATATGGAAC ATTGTCTTTA TTCAGTACAA TCGGGAACCG GATAAACTGC AAACTCTCCC GGCAAAGCAC ATTGATACCG GTATGGGACT AGAACGACTC GTGTCTATTT TACAAAACAA GGATTCCAAC TACGATATCG ACGTCTTTCA ACCACTCTTT GCCAAACTCG CCACGCACAC GGACCGGGGA CCCTACACCG GCCGGGTCGG CACAGACGAC GTCGATTTAA AGGACACGGC CTATCGCGCC ATTGCCGATC ACGCCCGGAC CCTCGCCTTT GCCATTGCCG ACGGCGCCGT CCCCAACAAT GAAGGACGCG GCTACGTCCT GCGACGCATC CTCCGACGGG CGACCCGATA CGGACAACAA ATTTTGCACT GCGCACCCGG ATTCTTCGCC ACGCTCATTC CCGTTGTTGT AGAAACCTTT GGAGAAACGT ATCCGGAACT ACGAGCGGCC GAATCTACCA TTCTCGAAAT CGTTCAGGAA GAAGAACAAG CCTTTGGTGA CATGCTCGAT CGTGGTATCA AGTTTTTTAC CGAACTGGAA GGAGAACTCA AGGAAAATAA GGTTCTGGAA ATATCAGGAG AAAAAGCATT CTTCATGTAT GATACCCTTG GTTTTCCAGT TGACTTGACC GAACTAATGG CCCAAGAAGC CGGCTTGTCG GTGGACATGG CTGGCTTTGC CAACGCCATG GAAACGCAAA AGACTCGATC CCGGCAGGCT CAAAAGACGG CTCGCGCTGG AAATGCACCC GTCCTCGAAC TTGTGGCCGA ACAGACGGCC TGGTTGGTGG ACCAAAATGT TTTGCCCACC GACGATGGGT TCAAGTACAA GTGGGACGTT CAGTTGCCGG CCACTGTCAT GGCCCTGTAC GGGAAGGACG GCTTTTTGAC CGGTGACTCC ACCGCAGACC AAGGCGACTT TGTCGGTATT GTGCTCGACA AATCTTCTTT TTATGCCGAA GCAGGGGGGC AGGAGGCTGA TGTGGGCACG TTGGAGTTTT TGGACGAAGC CGGCACCATT ACGGGTCGAT TTACCGTGAC GGATGTTCAA GTATACGCCG GATTCTTGTT ACACAAGGGC GCCGTGGAAG AAGGTTCCAT TGCCGTAGAC CAAGCAGTGA ATTGCAAGGT TGACTACGAG CGCCGGCGTA TGATTGCGCC AAATCATTCC ATGACGCACG TTCTGAACGC GGCCCTGCGC AACGTGTTGG GCGAAAAGTG CGATCAGCGT GGTTCGCTTT GCAACAACGA AAAGTTACGC TTCGACTTTG CCCACAAAAG AGCCATGACA ATGCTAGAAA TCAAAGCAGT TGAAGAATTT TGCCAGAAGA GCGTCGCTGA CGCCCAGCCT GTCAAATCCA AAGTGATGTC CTTGGCCGAT GCCCAAGCTA TTGACGGTGT CCGTGCCGTC TTTGGTGAAG TTTATCCCGA TCCGGTCCGA GTCATTGCCA TTGGCGACAG TAGTTCGGTT GAGTTTTGCG GTGGGACACA TTTGGAAAAT ACAGCCGAGG CGGAAGCCTT TGTTTTGGTG GAAGAAACTG CCGTCGCCAA GGGTATTCGT CGTGTGACAG CCGTAACCAA GGATGCCGCG AAGCGAGCTC TGGCCGGAGG AGTCAAATTT CAAGCATTGG TCGACAAAAT TGAACAACTA CCAGCTACGA CGTCTGGATT ATACAAACAG GCTGGGTCAG CGCGGAAAGA TTTGGATGCG GCGTTTGTTT CCGCTGTACT CAAGGCAGAG CTACGAACAA GACTAGAAGC CATTCAAAAG AAAGCCAATG ACGCGGCCAA GAAAGCATTA CAGCAGCGTG TGGATTTGGT ACTGAACGAT GTGAAAAAGG ATGTGGCGGT AGCATTGGAA GAAAAAATGC AAACGCTGGT ACTTAATGTG GACATTGCGG CCGACTCCAA GGCCTCGCAG CGTGTCATGA ATACGGTCAA GGAAATCGCC CCGGAGATGG CCTTTTTGGG CGTGAGTGAG GCCGAAAGCG GAAGTGGTGG CAAGATAATG GCGTTTGCTG TAGTGCCCGA CAGGCTAATG GAAGAATTCG ATCTCAGGGC GGACGAATGG ATTCGTGCGA CACTGGAATC TTGTGGTGGA CGCGGCGGGG GCAAACCCGG CAGCGCACAA GGGCAGGCTC AGGATTGTGC AGACGTTTCC GGTGTTATGG ATGCTGCGAA CGCATTTGCG TCGTCCAAAG TCCAAAGCAA AACCTTATAA
|
Protein sequence | MDTDTVVWTT DKVRSTFIDF FEQEPRSHTF QRSSACAPLN DPTLLFTNAG MNQFKPVFLG QVDPNSPLAN VQRAVNSQKC IRAGGKHNDL EDVGRDTYHH TFFEMLGSWS FGDYFKKEAI DYAWQLLTEV YRLDPDRLYA TYFQGDDALG LAPDFEARDL WLRYLPAARV IGCDVRDNFW EMGEVGPCGP CSEIHYDRIG NRDAAAFVNA DDPQVIEIWN IVFIQYNREP DKLQTLPAKH IDTGMGLERL VSILQNKDSN YDIDVFQPLF AKLATHTDRG PYTGRVGTDD VDLKDTAYRA IADHARTLAF AIADGAVPNN EGRGYVLRRI LRRATRYGQQ ILHCAPGFFA TLIPVVVETF GETYPELRAA ESTILEIVQE EEQAFGDMLD RGIKFFTELE GELKENKVLE ISGEKAFFMY DTLGFPVDLT ELMAQEAGLS VDMAGFANAM ETQKTRSRQA QKTARAGNAP VLELVAEQTA WLVDQNVLPT DDGFKYKWDV QLPATVMALY GKDGFLTGDS TADQGDFVGI VLDKSSFYAE AGGQEADVGT LEFLDEAGTI TGRFTVTDVQ VYAGFLLHKG AVEEGSIAVD QAVNCKVDYE RRRMIAPNHS MTHVLNAALR NVLGEKCDQR GSLCNNEKLR FDFAHKRAMT MLEIKAVEEF CQKSVADAQP VKSKVMSLAD AQAIDGVRAV FGEVYPDPVR VIAIGDSSSV EFCGGTHLEN TAEAEAFVLV EETAVAKGIR RVTAVTKDAA KRALAGGVKF QALVDKIEQL PATTSGLYKQ AGSARKDLDA AFVSAVLKAE LRTRLEAIQK KANDAAKKAL QQRVDLVLND VKKDVAVALE EKMQTLVLNV DIAADSKASQ RVMNTVKEIA PEMAFLGVSE AESGSGGKIM AFAVVPDRLM EEFDLRADEW IRATLESCGG RGGGKPGSAQ GQAQDCADVS GVMDAANAFA SSKVQSKTL
|
| |