Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44026 |
Symbol | |
ID | 7203990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 729608 |
End bp | 732616 |
Gene Length | 3009 bp |
Protein Length | 384 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186405 |
Protein GI | 219113643 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAAAC TTCTTTACTT GCGATCAGGA CTGGTTTTTG CGGCCAGTGC TTTGTTAGGG GTTACGCCGG CCGATGGGTT TTCCCCGACA GCCACGCGGT CGCAGCTCCT TGCCGAGCTG CAAAGGTGTC TCACTCCGAC CGACTTGCTC GACCGGGTCG GTGCACGTGT CTCGCGGAGT ATCGATCCCG ACGGCAGCCT AGCGAGCCTC GTTCTCGTTC GTCTTTCTAA GCTTGTGATT TCTTTGGATA ACCAAGACCG AGCTTTCGTG GCGGATGAAT CATCAACGAA GACTCTGAAT GCCATCATCG AGAGTCTCCT CGATTCCGAC CCGTCATCAA ATTGTGAATC TATTGTCGAA GCAACGAAAG CTTGTTCTAC TTTTTCGCGC ATTTCTCCAT ACTCACCAAA CTCCACTTAC GAACGGCTTT TTAATTTTTG GAACGATTCA TCGGATATTG TCTCGCAACT AGAACCACAC CACACTTCCG GTGTCAAGTG GGCTTTTGAT GTTCTTACAC TTCAGTCATC GGCACCCTCA CCCACCGCGC TCGATCAGGC CTATGCCGAT CTCAATTTGC CATTCGCCAT CATACCTGGT TGTCTTGCTG GATTGGAAAG CCTTTCCGTA GATCGACTGA CTTCACAGGT ACATTTCAGG GTAGATGATA TTCGCACGAC TTCGAACAAA GTAGTTTCAG AACGCCGACA AACCGCTTGG GAAGGAGATA CTGGCGTGGC GCCATTCACT TATAGTGGCA AGGCGATGCC TCGGAGCGAC TGGTCACCAC TAGTTGTTCA AGTGCGCGAT GTCGTGCAAG CTCGTACAAA TCAGTACTAC AATGGATGTC TTTTAAATCT CTATACGGAT GGCTGTAGTG GGATGCGGTA CCATATCGAT CCAGACCAAG GGACTCTATG GGACTACGAC ACCGCGGTAG TTAGTGTGGG GGCATCGCGG CGATTTGCGT TCCGTGCCAT GTCCTCTTCT ACTTTGCAGC CACACAATTT TGTTGTCATG CACGGAGATT TGACGTACAT GTTTGGAGAG TGCCAGTCAC AGTTTCAGCA TACAGTGAAA AAGGCGGACG ACAAAACCGA CACCACTCCG CGTGCGAGCT TGGTGTTCAA AAGAACCTGG GAATGTAAGA AATGAAATTG CACATAAACT TATTCACTCT TTGTGGAATC AAAACGGAAG TGATGGTCGT CTTTGTGGGA GCGCATAAGC TCAAGGAACA CCTTCTCGTA ACCACTGTGA TAAAGGTGAT CCTCTATCAT GGTGCGAGAC ACATCGATTT CGGAAAAGAA AACATTGGGC ACGATATTCC CTGAAATACC GCTCCCGTCT CTATACAGGT GAACGCAACG ACCCGGAGGA TACAACTCCG GCTTAATCCG TTCGACCGGG TTCTGCTCGA TCGGAAGATC GTCCAGCAGT GGCTCCACCA CACCACGGAT ATGCTGAGTG ACGTCTTTGG TAAATAAGAC TGGCTGGCGG CGTGCCAACT CTTCAATCGC ATGATCTATA TCACGTTTTG CAAAGGGTAC CCAGTCAAAC TCCATAATAT CCAATAGCAA GTTGGCCAAA TGGATGCCAC TCAGGCGAGG TACTATATCG CAATCGTTTA TAACCGTCAA AATGTAGGAA GCTTGTTCAG CCAATTTCTT GCTGAGTAAA GCCGGGCAAC CAAAGCCAAT CACATTCACA TCGAACTTGG GATGATCTCT GAGTTCCATC CCTGCGATAG CTGCTGCACC AGCTCCTAGA GAGTGCCCAG TCAACGTTAG CTTGATGTTG AACTTACCGG ATTTCTCAAG AAGCTCCTCC AAAAGACCTG TATGGGTATC AGCTATATGC TGGCCGCTAC CCAAAATGAA CGAATGCGCT TTACCACCTC TGTACTCTAC CTCGTCGCAA AGTAGGCCTG TCAAGGCATC TGCAACTGAT TTTGTCCCAC GGATAACAAT CAAGACTTCC AAAAAGGGAT TAAAGGAGGA TTGATCTCGC TTGACCGCCA CAAAATTGGC GGGCTTTCCT GGCTCGCTTC GAACCTCCGC GTATACAAGC TCGTAGGGTT CCTGTTGCCT GTGAAGACCT TCACGAATAT CTTCAACACT GTCTGCGTAT GAGAGATACG CTAAGCTCAG GATGTTGTTC AGGTCCTCAA CCTCTTTGAT GTTAATTCCG GGATAGAAGC GGTGCATCCG GCGCTTCCAG CTTGGATTTT TAACTTCATC CTGGCCCTCC AAGTAGTAGA ACAAAGCACT AGGTGAAAGG TGGGAAAAAT CGAGGGAGCC CATGTACTTC TCGGCAACGC TTTCTATGTG ATCCTTGTAA TTATCCAGCA TTCGGAAGAG AGATTGAAAC GAGTGTGTGT CGACAACTTC GCCTTCGTCC GATGGTTGCC TAGCTTTCTC TACGATATCT TTCAGATAGT CTTGTTTGCT GTCACCTCCT GCAATGAGAC CTAAAAAGTC CTTTGCCATG GCCGAGAACG ATGGCTTTTC CTTATGCTGC TTCTCAGGGC CGCTTTTTGT TTCTTCAGTG AATCCAATTG AATTTTCAGC TGCTTCGAAC ATCGCCATAA GCTTATTGCC TACTGCAGCT GGCCGTAGGG CCACAGTTTT TTTTTGATCG GCGTCATGGA CCGATTTTGA TGTGGAAGAA GTTGAATCGT CCGTCAGAAT TCTTTTCCAC GCGTCTTCAG CTTTTTCAGT CAAATCTACT GTCATGTCAT AAATGGAGTC ATCTTTTTTT GAATGCTTGC CCTCTGATTT TTCCTTCTCT GTACCCTCTA GGAGCGCGTT CTCATCCGAA TGCTGGAAGG ACGTTATAGT CAGGAAAAAA GCTGCAATAA GTGTTGACGA CAGTACTCTT CGATTTACGG AAAATGATTT GAAGCGTTGG AAAACCTGTG CATTGAAAGG AGACATGGTT TCAAACTTCG CATCATTCGC AAATACCCAA ATCGACTCTC ATCAGCACTT GCTGTTACGG TGAAAGGAT
|
Protein sequence | MGKLLYLRSG LVFAASALLG VTPADGFSPT ATRSQLLAEL QRCLTPTDLL DRVGARVSRS IDPDGSLASL VLVRLSKLVI SLDNQDRAFV ADESSTKTLN AIIESLLDSD PSSNCESIVE ATKACSTFSR ISPYSPNSTY ERLFNFWNDS SDIVSQLEPH HTSGVKWAFD VLTLQSSAPS PTALDQAYAD LNLPFAIIPG CLAGLESLSV DRLTSQVHFR VDDIRTTSNK VVSERRQTAW EGDTGVAPFT YSGKAMPRSD WSPLVVQVRD VVQARTNQYY NGCLLNLYTD GCSGMRYHID PDQGTLWDYD TAVVSVGASR RFAFRAMSSS TLQPHNFVVM HGDLTYMFGE CQSQFQHTVK KADDKTDTTP RASLVFKRTW ECKK
|
| |