Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41042 |
Symbol | |
ID | 7198853 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 208670 |
End bp | 211831 |
Gene Length | 3162 bp |
Protein Length | 885 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184990 |
Protein GI | 219129637 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.718194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGAAC CGCTTTTTTT GGGAAAGGCA ACTCTCTTTG ACGTAACGAG AGTCCTGATG TGCACCGTTG CGGAAAAAAT TTGGGAAAGT TCTATCGAGG AACCCATTTC CGTGACAAAA GTCAAACTTT AAAAAAGGTT GTGACAACCG ACAAGAAGTT TCTTACTGTA AATAGCAAAT GAAACCATAT TACTTGTAGG TTAAACACAT GATGATCTTG CAAACAAACA TCACTCCTTT GCCTTTTCAA CGGAACTGCA ACTCCGCAGA ATGACACTGT CTCTCAAAGT GTTATTCTTT ACGGTAGTTA TACCTGACAA AACGTTGTCG GTGATCAACT CCTTTCGCGG AACGGAAACA GGCGTCCGTA CTGCGGACAA TTTTCTGTAT TTTACGACGA AATTGCGTCG ACGAACCGGT TTCTAATACC AAGTAAAAAG GAACCGGTCC CAAGGCATCC CTTCGTCCAG AAAAGACGTC TTTAACGGAG TGGATCGGGT AGCCTTTTCG AACAGTTCCG ACTCCGAAGT AACTCACCGC AGTGCGCTCG CGTCACGGAT CATTGAAGCT GAATTAAAGT CCGCTTTCAC GACATTGTCT TGCCTCGCTA TCTGACACTC CTAGCGCTTT CTTGGCGTCG CTGATCGGAC TCGTGTTGTT GACCGGCAAC AATCTAAGAA CGACACCATT TCGGACGGAG AAACGATCGC GGCGGACGCA CGTGCCATCG CGGAGGCGTT TGGTAAACTC GAAATTGCTC CCGAGCACGA GGGCATCGAC ACTGGATTTG TGACGGACGA CGTTGACGAA CCTGTCTATA CCACGGCAAC GGCTGCCACA ACGCTTCCCG ACGCGTTGCT GCAGCAAGTC GTTTCCGAAG ACGAGGAGGA AAACGATCGG TATATTCTGG ATCCGTTGGA TCTTGACAGG ACTTCGGCAC AACTAGAACC CGAAGACGAC TGGTTTACTC AAGACGAACT TGCTGCGGCA TTGCGAGGGT TGGACCCCAG TCAAGACTTG CTGTGTCCGC CGGATTCTTC CTGGCAAAGC GACGAGGCGA GCGACGAAGA CTTGCCCGAT CAGGATCAGG AACTCCCGGA AATCGATATA TTGGATCTGG CTTACGACGG TGATTTTCTG GCTTCCAACG GAGGCGAATT TTCTTTGGAC AAAAGTCTTT TCGAACCTCT ACAAGATCTC GATTACGGTT TTGATCACGG ATCGATCGAC AGCCCTAGGA ACTGTCATAC AAGCGAGGAA TCACTGATAC CTACGAGTGA CGTGGGGGCT CCTGATTGTA ACAAAAGCAA CGCGATCGAA CTTTTGGAGC AATTGGAATC TCTTACGCTC GACTATCATC AGGATACAGA ACGAGAGGAG AACTTTGAAG AAATAGACAA CAACGAAATA AGCTTGGAGC AAGAGTTGCC ACCGTGGCTA TTCTGTCAAC CTTGCAACTC CAATTCATCG GACCTTGAAG CACTGGAGCC ACTACTAACC GCATCAACTG AAAACTACGT CGAACGCAAT CGTTACGACG AACAAGCCCT GGATTCATTG CTGGATTTGC GCGGCTTAAC ATTGGAAGAC TACTCGGAAT ACTGCCTACC TCAGGGTGAT TTAGAACAAT TGGATACGGA GGAACCGACT AGAAAAATGC TACCCTCGCT TTCGGACAAC GCGACGTCGG TTACAACAGG TGTTGATATT CGACCTGATG TGCAAAGGTT GGCACAACCC ATTTTGCAGA CTTTTGATGC GATCTATGAA TCAGAACTCA GCAAAGCGGA AACGACCGGA ACAGTAGCGA AACAAAGCAC TGCTAAGCCA CCCCGGAGCG AACGACTCTG TTTAGGGCAC AAGGAACGGA TTCTTGGTTT GGATTTGTCT CCGTGCGGTC AGTACTTGGC GACAGCCAGT CAAGATTCAA CCGTTCGCGT ATGGTCAACC GACACGAACC AATTGCTCGC AACAGTGCCG CACAATTCGG CCTATGAATG TTTGAGAGTA GTCTGGGCTA GTCCACAATG GGCTGAAAAC AATATAGATC GTAACGGTTG TGCTTGCCCT TACTTGCTGG CGACGGGCGG TGCCGATGGA ATTGTTCGAT TATTTCGGAG TGAGAAGCCG ACCGAGTGGG TATTGTGTGC CACTTTGGAC CATGCGGAAA TGAATCATTT TGAGGGCGAA GAAGAGGCCG ATACACCTCA AGTATATGCA CTTCAATTCA TTGATCATTG GAAAGCTTTG CCGGGTTCAA AAGAATCTGA CACGAATTCG TTCCTCTTGA CATCATCGGA TGACCACGTT CATCTATGGG AAATTTGTTC TAAATCCGAG GGGAAAAAAG AAGAATCCGA CAATGACAGC GGCGAATCGG GGAATCTTCG GTTGCGTGAA GTCTTTAGTA TGCACTTTGG CGATATGCAT AATCAGGCCT ATGGCGTGCA AGTCGGGCAT GTTACAGCTG CCGGGCTCGA CATTGCGGAT GCCACGACCA GCCCTATCCC TATAAGTAGC GGTGGAGATT CAGGTGTTTT TGGCGGAGAT CGTAATCCGA GAGGCCTCGT GTACGTTTTT GATGCGCGGT ATTGTGAAGC GAACGGACTT CTGGGAGCCG CTCTATCGGA TGGAACCTTG CGCCTCGTCA ATGGACGTGG GGTATGTCTT TCGTTGTTGC AGCTACCCGG TCATCGCTCG CACTTAACTT CACTTGCTTG GGATCGAACG GGAGAATGTC TCGCAACTTG CGTAGCGACA GGGCATTTAA TTACATGGGG AGTCTCCGTT GATGAATACG CCAATCGTGT GCACGCCACC TGTCGTGCTG TGATGGAAGG TGGCCACGAC AAAGGTCGAC CTCTGTTTGG TACGGAATTT GTAGGGCGTG ATGGGGATGA GGAAGATGAT CTTTTGATTT CATGGGGTGT CGATGGGCGG CTTTGTTTGT GGGATTCTTT TTCCACTGAC GAAATAGATC GACCGCTAGC CGTGCTCTTA CACAAACCGG AATATCCTAT ATACGCAGTG GACTTAATGC AAGATGCTTT TATTGCGGTT GGCGGCGGCA CGGGGGATGG TGGAGGTTTT GTCGGCATTC CTGTTCATCT TTACAAATTT CCACCGAAAG AAAAGGCATC ACCAGACCCT CTCCAAGGAT AG
|
Protein sequence | MVEPLFLGKA TLFDVTRVLM CTVAEKIWES SIEEPISVTK RFLGVADRTR VVDRQQSKND TISDGETIAA DARAIAEAFG KLEIAPEHEG IDTGFVTDDV DEPVYTTATA ATTLPDALLQ QVVSEDEEEN DRYILDPLDL DRTSAQLEPE DDWFTQDELA AALRGLDPSQ DLLCPPDSSW QSDEASDEDL PDQDQELPEI DILDLAYDGD FLASNGGEFS LDKSLFEPLQ DLDYGFDHGS IDSPRNCHTS EESLIPTSDV GAPDCNKSNA IELLEQLESL TLDYHQDTER EENFEEIDNN EISLEQELPP WLFCQPCNSN SSDLEALEPL LTASTENYVE RNRYDEQALD SLLDLRGLTL EDYSEYCLPQ GDLEQLDTEE PTRKMLPSLS DNATSVTTGV DIRPDVQRLA QPILQTFDAI YESELSKAET TGTVAKQSTA KPPRSERLCL GHKERILGLD LSPCGQYLAT ASQDSTVRVW STDTNQLLAT VPHNSAYECL RVVWASPQWA ENNIDRNGCA CPYLLATGGA DGIVRLFRSE KPTEWVLCAT LDHAEMNHFE GEEEADTPQV YALQFIDHWK ALPGSKESDT NSFLLTSSDD HVHLWEICSK SEGKKEESDN DSGESGNLRL REVFSMHFGD MHNQAYGVQV GHVTAAGLDI ADATTSPIPI SSGGDSGVFG GDRNPRGLVY VFDARYCEAN GLLGAALSDG TLRLVNGRGV CLSLLQLPGH RSHLTSLAWD RTGECLATCV ATGHLITWGV SVDEYANRVH ATCRAVMEGG HDKGRPLFGT EFVGRDGDEE DDLLISWGVD GRLCLWDSFS TDEIDRPLAV LLHKPEYPIY AVDLMQDAFI AVGGGTGDGG GFVGIPVHLY KFPPKEKASP DPLQG
|
| |