Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49982 |
Symbol | |
ID | 7198770 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 26465 |
End bp | 28525 |
Gene Length | 2061 bp |
Protein Length | 614 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184877 |
Protein GI | 219129398 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.806954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATGGGGCCA TTCATGCTGT GCAATAGTTG ACTGTGATCG TGAGACTTTC GCTTTTTCAG ATTGACTGCC TGGATTGATA CGGTTCAGCA GTCGACCATG TCGGAAAGGG GTTTGTCGTC ACCCCGCGGA AAGTCTACTT TCCCAGTGAT ACGTCGCAAC AGTTTCAACG AAAGCGAGTA CAGTCGATCA TCCTACGAAT CGTTAGGTCT CGTCGATCAT GTGCATCGCC TGGAGGCGCT TTTGTCCAAT TCGCTAGAAA CCAGCAGTAC ACTGCAGACA TGTGTCGATG AAGACGATGA CTTGGCAGCG GAATTGAAAC GGCTAGAACA CGCAGAGCAG TTGCTGCGGC AAGAGTTGGA AGACCTCGAA GTAAATGGCG TCCCGAAGCC GGATTCATCG ACGCAAGGCG TTTTTTCGAG CGACAATCGA GCTCGCGAAA GGACGCTGAG CGCTTCCAGA GAGGAACTGA CAGTGAGCAA TGTAGAAGAC GACGACGACG ACGCAGACGA TATGTTTAAC TATATGCTGA ACTTTACGAC TGGAGCGTCT CCCCTTTCCA CTTCGAAAGG ACGCCTGAAA ACAGACAGCG ACAGGTTCGA TGCCATTTCA ACCCCCCCGC ACACACCGCC TCGAACCGTC CTGTCACCAC TGCGGCAACG TCGCTTGGAT GTCCTGGGTT TGGATGACAA CGCTTCTTTC CACGAACAAA GCTGCGCCCA AGAAGATGGC AACACAAGAC GGTCGTCGAG TTGGAGTGTG GAGGATTGGT CCTCCTACAA TCAACCGTCG GAGGGGAGCC TCGAGTCGGA GAGCATCGAG TCTGGATCGG TATTCTCCAA TCTAGGCGTC GCCAGCGTAG ATCGCTCCTC CTCCTTTCCA GTGGACGTGC TAGCCCGTGG CGATGTTGAC GATGAGGATA AACCTGTTAA AGCGACAAAG GTTGATGTAT TGGCTGATGG CGATGTGGTT GAAGTGGAAG AGATCGATGA AGTGCCTCTC GAGGCTGCCG ATACACTGGC ATCCTCAACA CCGTGGGCCA GGAGAAGGAT TCACAATCAA ACAGAGATGC AGCAGAGTAA ATCCACAACA CCAATCATTT CAAGTCGAAG TATTACGACT TCCGCAGACA CAGTTCAAAG CAGTTCTATC GAAGTTAGTC GGAGTCCATC ATCAGCAGCC ATACCAAAGA GAATCCCCGC AATCACCCCC CGCCACACAA AGACACCAGA ATATGATCAC TTTCAAGGCC CAGATACCTC CTCTCAACAT ACCATTCCCA CTCGAAAATC ATCGCGAAAG AAAAATGAAT GGTCGGCCAT GCTGTCCCAT TCGGGGAAAG CAAAGGTCAT ACTGCGTCCC TTGGCCAATG ATCGTCCTGC TACAACAAGG GATAATTCAA TGTCTCCTCG TCTCTTTCCA AAGACTCGAA ATTCGCCTTC GGCGTTTGGC AATGGACATG CCGTCGAGAC CCGCGGTATA CTCCAAACGG AAACTTATCC GTCGTCGATC TCCAAGTGCG ACGATCCCCA ACGATTCGAA GATTTACAAT CTTTCGGAGC GGGGAGTCAA AATCCGGCCC AGCAACAAGC CAATGACGAA AGCACATCGC GGCGGGTTGG AAGTCTCGTC TCGGCAGCTC CGAGGCAGGA TACACGCCGG CCGAAGGTTC CTCACCGTAT CGCGATGGGC TATGCTCATC ACCAACTGCG AGACGATGAC GATCAGGATG CGCAGGTGTC TACATGCGGA TGGAAGTTCG GTGGCGAAAG GACTTCGAAC GGAATATTCT TTTGCTTAGC CTTGATTGCA TTGGTTCTGT TAGTTGCGGT GCCCACGGCG TTCATCTTGG GAAGTCGATA CAACAAACCT TAAACCCCTC GCTGCTTTCT GTAAATTTTA ACCGAGGCGC CGGTGCACTT TCCCGAATTG ATGACAGTAC CCGCTCTCCT TGCTCCGCTG TTGCCGCCCG CAAGGCCACG TTACGCTATT CCATCGACTA TTATTCTACA GGCGGTGCAC CGCACCTTGC CGGTATCCTG CACCTTTTTC AATATCACTA A
|
Protein sequence | MSERGLSSPR GKSTFPVIRR NSFNESEYSR SSYESLGLVD HVHRLEALLS NSLETSSTLQ TCVDEDDDLA AELKRLEHAE QLLRQELEDL EVNGVPKPDS STQGVFSSDN RARERTLSAS REELTVSNVE DDDDDADDMF NYMLNFTTGA SPLSTSKGRL KTDSDRFDAI STPPHTPPRT VLSPLRQRRL DVLGLDDNAS FHEQSCAQED GNTRRSSSWS VEDWSSYNQP SEGSLESESI ESGSVFSNLG VASVDRSSSF PVDVLARGDV DDEDKPVKAT KVDVLADGDV VEVEEIDEVP LEAADTLASS TPWARRRIHN QTEMQQSKST TPIISSRSIT TSADTVQSSS IEVSRSPSSA AIPKRIPAIT PRHTKTPEYD HFQGPDTSSQ HTIPTRKSSR KKNEWSAMLS HSGKAKVILR PLANDRPATT RDNSMSPRLF PKTRNSPSAF GNGHAVETRG ILQTETYPSS ISKCDDPQRF EDLQSFGAGS QNPAQQQAND ESTSRRVGSL VSAAPRQDTR RPKVPHRIAM GYAHHQLRDD DDQDAQVSTC GWKFGGERTS NGIFFCLALI ALVLTRSPCS AVAARKATLR YSIDYYSTGG APHLAGILHL FQYH
|
| |