Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34520 |
Symbol | |
ID | 7199637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 817144 |
End bp | 819618 |
Gene Length | 2475 bp |
Protein Length | 824 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179069 |
Protein GI | 219116548 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGGC CGCGACTACG CAGTGTGTGG GGCGTCGGCT GCCTACTTTT CGTCGGGGGG TCGGCCGCGG AAGCGACCCG TCGTTCGTGG ACGCAACGGC AGCCTTTCCG ATGGATCGTG AACAATCCTT CCAAGCTCAA CTTCACATCA TCACCCGCCA TGGGTGTCCC GAGTCCCCGT TCCAAGTCAT CGACGGTCGT ACCCACCCGG GTTCCCGCGA ATAGTTCCTC ACGGGTCCCG ACTCGATGGC CGACGACGCG ATTCGGAGGG TCCACCAGCG CGCCGACGGG AAGTACGACA AGAGGACTTA CGCTGCGTCC GAGTATGGCA CCTATGGCAC TGCCGACGAC GCATGGGACA CTCACGGGTC CAACGGAAAC ACCCTCGGGT CTTCTGAGAA CGAGTCCAAT GTGGCCCTCC CCGGTACCTG TTCATAGTCC AAGACGAGAC AGATCGCCAA TCACAACACG GATGCTGGGA ATTCCTACAT CGACGAGTCC GGTCACTCCA ACCGCTGTCT CTATATTTCC TTCTCGACCC GCCGGAAGTT CATCACGGAA ACCGGTAGAG GAGAATAGCA ACGACGCAAC CGGTAGGCCA TCGCTAGGAC CATCGCGTAC GCTTTCCATA CCAATGGCGT TCTCCACTGA TCTACAGCTG GTGCAATACA ACTTTAGCGG TCCGCAAAAC ATTGTGGAAA CTCTAGAATT TGCTTGGCAA GGTTATTTGA CCGCCATTTT GAGTCGCTAC TATCGAGACC GTGAAGGGGT CCAGTTTACC GGTGTTGACC TAGACGTACG GCAAGGCCGG CAACGCAGGT TCTTGTGGCA GTCGAACATC AGACCCACGA GACGCTTACA GAAACTGGTG GGCAACGCCA CGATTTTATC GTTCGAAGCT AACGGTACCG CCGTTTTGCT GGTGGATGCC AGTACACAGG ACGCAAATTC GATTGTTGCT TCCACCAATT CCTTTCTCCG CACAGCTGTC ACGATGGAGA ACTTGCAACA AGCACTGGTG GACGTTAACG AAAACCTGGT GACAGTGTCG AGTGTGAGTG TACCAAACGC CACTGGCGTG CCCGAGCCCG ATCGAGATGA TGGACCCACT ACCGTTGAGA CTGTATTCGG GTTGTTCATT GCGGCTGCGG CAATGTTGGG CTTGGCCTAT ACATGCCGCG TCATTTGCCA AAATCACAAG GAAAGGCAGG CTCGTGCTAA GAAATTGATG GCCCGTCCCA TGGTATTACC GAATGCCCCG CCGTCGCTAG CTTACCAGCC CAGACCGATG CCAAGGCAAA CTTTGGTCAC GCCCAATCAG AGTGGGAACA ATGATGACAC CTTGAGTATC CCAGGAATTC CCAGTACGGA AACCAGTGAT GGTGACCGTT TCGCCAGAGA GCTGCAAGAA GCAGCTTCGT TGGACCGGGC GGTTTGGGAA GAAAAGCAGT ATGACAACTC AAACGGAGTG ACCGCTCCTT TTACTAGAGT GCCGGAGTCC ACAGGGAAGC TACAAGTGTC CTCGTCATTC CCGTATGGGG ACGAAGCCGT CGGCAACTTC GAGTCCATGG TGCACCAGCA AGGAGGCTTC GAATTGACTC CACAAGTGGG TATGCTTGGG TCAAACGCTC GATCCGCGCC CCCAATGAAT GGAGACGACA TTCTAGATGT GCCCGACTTC GAGGCGTTTG GGGATACAAA CACGGGTCCG GAGCTGAGAC AATTTAGCGC ATCGGATCGC ATGCTCTCCG GAGGAAACCA AAAAACCCGC GGGTTGCAAA TGTTCAGTTT CACCGATCGC CCAGGAGATG CAACAGTGGG TGATTCCACA ATCACATCCA GAGACCCATC TCTTACCGCT ATTCCAGAAT CTCCGTCACT GTCATCATGG TCGCCAAAAG ACAATGAAGA TGACGAGGAT ACCAATGTCT CGGACATATC TCATACCAAT GCCATGCTGC AAGAAGTTGA GCGTTTGTCG ATGTTTGTTA GACAATACGA AAAGGAAAAG GAAGCTAGAA AAAGCAGCCA ATTTTTAGTC GATACGTCTA GTACGGGAAA TGCTCCAGTA CAAAGAACAC AGACAAGAGC TTTCACGGAA ATTGAGAGTT TAAAAGACGT AAGCTTTTCT CCAGGGGACG AAGATTCCAA GCGAAGACTC GGTATCGGTC AATACAGCGT ACAAGAAAAA ATTCCTGGAC CTCTAGTGGA TGACGATGGC GAGCCGCGGG GGACCTTGGA GACAGCAAAT GCTTTGCAGT ACCCCAGCCT CGCTGCGGCA ACTTTGGGAA ATACAGGTAC TCCTTTGGAG CACAGAGACG GATCAATCGA AAAGGGAAGT CTGCCAGGAC TCCGCACAGC AGTCCAGCAA GAGCGTCGTT TTGGTTTGCC TCGACCTAAC GTCCGAAGCC GTTCGGGCAG GTTTTCAACT GATAGAACGG GTCGAACACA AGCGGAGAAG AGACCCTCGC CTTAA
|
Protein sequence | MKRPRLRSVW GVGCLLFVGG SAAEATRRSW TQRQPFRWIV NNPSKLNFTS SPAMGVPSPR SKSSTVVPTR VPANSSSRVP TRWPTTRFGG STSAPTGSTT RGLTLRPSMA PMALPTTHGT LTGPTETPSG LLRTSPMWPS PVPVHSPRRD RSPITTRMLG IPTSTSPVTP TAVSIFPSRP AGSSSRKPVE ENSNDATGRP SLGPSRTLSI PMAFSTDLQL VQYNFSGPQN IVETLEFAWQ GYLTAILSRY YRDREGVQFT GVDLDVRQGR QRRFLWQSNI RPTRRLQKLV GNATILSFEA NGTAVLLVDA STQDANSIVA STNSFLRTAV TMENLQQALV DVNENLVTVS SVSVPNATGV PEPDRDDGPT TVETVFGLFI AAAAMLGLAY TCRVICQNHK ERQARAKKLM ARPMVLPNAP PSLAYQPRPM PRQTLVTPNQ SGNNDDTLSI PGIPSTETSD GDRFARELQE AASLDRAVWE EKQYDNSNGV TAPFTRVPES TGKLQVSSSF PYGDEAVGNF ESMVHQQGGF ELTPQVGMLG SNARSAPPMN GDDILDVPDF EAFGDTNTGP ELRQFSASDR MLSGGNQKTR GLQMFSFTDR PGDATVGDST ITSRDPSLTA IPESPSLSSW SPKDNEDDED TNVSDISHTN AMLQEVERLS MFVRQYEKEK EARKSSQFLV DTSSTGNAPV QRTQTRAFTE IESLKDVSFS PGDEDSKRRL GIGQYSVQEK IPGPLVDDDG EPRGTLETAN ALQYPSLAAA TLGNTGTPLE HRDGSIEKGS LPGLRTAVQQ ERRFGLPRPN VRSRSGRFST DRTGRTQAEK RPSP
|
| |