Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44969 |
Symbol | |
ID | 7199646 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 841999 |
End bp | 844170 |
Gene Length | 2172 bp |
Protein Length | 561 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179074 |
Protein GI | 219116558 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGAGCCATCC ATCAAATGGA GTAAAGTACG TCAACGACTC CTTTGACAGT CCGTGAGAAT CTCATTGTGA ACTTTCTCGG CCCATACAAT GATCCTCTCG CATTTACAGT GACAGTGAAA AAGACGTGTT ACCTCTACAT TGGATGGATA GTTTGATGGT TGTCTGTCTT TTATCGAGTG CGGACGGATT AAAGACCACA AAATCACGTG GCAGGCTTAG GCTTCGGAAT GGTATGAGAC CTCATTTCAT TCTTGAATGT GCTTTGGCTG CAAAAATTGT TGACGAGCCT TCTGCCTTTC CCGGTTGACC TGTAAGCGTG AAACTGTCCA CAGATTGATT GTTTATTGTT CCACTGGCTG ATATGCTGTT TGCTTGTTTG TTTGTTTTTT TGCAAAAGTG GTGCACATCT GACTATGAAG AAACAAATCG GTGTTCTGTT CGTTTTATTG CACCTGTGTA ACGCGTCGTC TAGATCCGTA GTTGCGGCTG CAAATGTTCG TACAAACCAG AAGTGCTCGA CAAAGACTCG TGAACCACCT TTCGGTAAAT GGATTGAGGA GGCGCGGCTC TTGGGATGTT GCTCGCCACT GATTTCTGCA TCAATGTTTG GTGCGGCAAG TGCAGTACTA CTTGCTGGAA CCGCAATGGA GCCAGTCTCA AGAGCTCTTT ACTTTTGGAG GACAGCCGGG CCCGCAATTT TCCATTATAA ATTTACGCAG TGGTGGTTGG AGGCATCCAA GGCTGACATA GAAAAGCGCG ATCTAGTTTA CGAAAGCTTG CACGATCGAT ATGCTGAACC CGCCTTGAAG ATGATGATTC GCCAAAAAGG TCTGTACGTG AAACTCGGAC AGGTTCTTTC CTCGCGACCA GACTTTTTAC CATCCCAGTA CATCGAACGT TTCGCAACTG TTCAAGATTC GATACCGCAA TGGCCTATCG ACCAGGTACG CGCGATAGTG GAGAAGTCCT TGATAGTCGA ACTCGGCCTG TCTTGGGGGG ACGTCTTCGA ATCCATGGAT GATATAGCAC TTGGGTCGGC AAGCATTGGG CAAGTCCACA GAGCCGTGTT GACCGAGAAG TGGGCCAAAA CTACAGGATA CAGAGGAGAT AAAGAAGCGG CTGTGAAAGT CATGCATCCC AACTCCCAAA AGCTATTTGC ATATGATTTT GATGTGTTTC GATGGGTTTG CCGTATTGCA TTACCCGGTT GGAAAGGTTT CTTGGACGAG CTCGAGCGAA GAATCATGAG CGAGTTCGAC TATCGGCACG AAGCCACTTC GTTGGATGAA GTTCGTTCTC CGTACAAATC TAGAGTTTAT ATACCGCAAC CTTTACAGGA GCTCTGTTGC CGTCATGTGC TTGTAATGGA GCTTCTAAAA GGGCGGAAAT TAGTGGACTC CTTTGAGGAC GGCCTGGCGA ATGCGATGGG AGGGCATGAT CTTGCCAAAG CATATTTGGC GAAGAAACAA AGAGAAATAT TGCTAGGTTC TTGCGATGGT ACAGATTATG ACGCCATCTG GCGGCTACCC ATAGTCAACA AACTCAAACT TTTGTGGCTG AGAAGAGTGG CATCCAAATA CATCGATCTT TTGCTAGACG TCCACGGGTA CCAAATCTTT CAGAACGGAT GCTTTAACGG AGATCCTCAT CCTGGAAACT GTTTGCAACT AGAGGATGGA CGTCTTGGCT TGATTGACTT CGGTCAAACC CGCCGTCTTA CAGAGACAGA AAGATACGAT TTGGCCCGAA TTGTATGTGC TTTAAATGAT CCTTCCACGG ATGCTATCGG AATTGATTAC GCAATGCAAA CGGCTGGCTT TCAACTCAGA GACGCAAGTG CGGAGATGAT GGTCAAATAT GCTACGATTT TCTTTGACTC TGACGAGGAT AGCAAACACC TGGGCTTCGC AACACCACAA CTTTATTTTG CTAGTTTAAT GGCAACCAAT CCTTTGGTTG TCATTCCAGA TTCAGCTGGT ACGTAGTTTG ATTGAAACTT GAGCTATGAA TTGTCGAATT GATTGCATCT GAAATTGCTT CTTTGTACAG TGTTTGTCGC TAGGACGAGC TTCTTGTTCC GTGGCATCGG TAGTGGTGTC GGCTCCGGCC CACTGAGAAC TTCACAAAGG TGGCAAGAGC ATGCTGCTGC TGCAATTAGC CAGGTCTCAT AG
|
Protein sequence | MKKQIGVLFV LLHLCNASSR SVVAAANVRT NQKCSTKTRE PPFGKWIEEA RLLGCCSPLI SASMFGAASA VLLAGTAMEP VSRALYFWRT AGPAIFHYKF TQWWLEASKA DIEKRDLVYE SLHDRYAEPA LKMMIRQKGL YVKLGQVLSS RPDFLPSQYI ERFATVQDSI PQWPIDQVRA IVEKSLIVEL GLSWGDVFES MDDIALGSAS IGQVHRAVLT EKWAKTTGYR GDKEAAVKVM HPNSQKLFAY DFDVFRWVCR IALPGWKGFL DELERRIMSE FDYRHEATSL DEVRSPYKSR VYIPQPLQEL CCRHVLVMEL LKGRKLVDSF EDGLANAMGG HDLAKAYLAK KQREILLGSC DGTDYDAIWR LPIVNKLKLL WLRRVASKYI DLLLDVHGYQ IFQNGCFNGD PHPGNCLQLE DGRLGLIDFG QTRRLTETER YDLARIVCAL NDPSTDAIGI DYAMQTAGFQ LRDASAEMMV KYATIFFDSD EDSKHLGFAT PQLYFASLMA TNPLVVIPDS AVFVARTSFL FRGIGSGVGS GPLRTSQRWQ EHAAAAISQV S
|
| |