Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45021 |
Symbol | |
ID | 7199530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 1025235 |
End bp | 1026724 |
Gene Length | 1490 bp |
Protein Length | 339 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178897 |
Protein GI | 219116204 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0104781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TAAGACAGAA CTGGTTCATA TCAATTTAAT GTAAATGTAT CTTTCATTAT CCACAGCACT TTTCGATTTA TATGGCAAAG AACGGATTCT TTACCGTTGC CATCCCTGCA TTGGACGATT GATTCGAAGC TGGAGAACCA CACTCGCTGG ATTGGGCCCT ACGCACCAAC CGCAAATTTC CTAAATTAAT TGATGCCTGG TACGGTACCG GCTGCGAGAA GGCACACTAT TTACGCCCGG CCCAACAATC GCAAATTTGC TTGCGTGTGT GCTCGTTGAG GACGAGAGAT CGAACCATCG CCCGTTGTGG AACATTGGTT TCTAACTGTA AAAGCACCGC GATACGCCGA TTCCTCCCAC CAACTGCTCT ATCGAAATCT AGATTCTTTG TCGAACTGTA AGCCAAGTCA AGCAAGCAGA GCAGTGTGGT CCGTAGTGGA TCCGCTTTCG AACCTCTATG ATACCTAAAA ATGAACCGAA ATACCAATCG TTCTAGATTC AGCATTCATG ATGCGCAGGA TGAACCACAC GGAAGGCTCG CATTGAACAG CAACAGCGCC TCATTCCTTG ATTACATGCG AGAAACGCAA AGGGACCCCG GGCTACTCGA TCCGCCTGAT CTCTTGTATC CTTCACGCTA CGAAATCGAC ATAATACCGA TGCCTACCGT GGAGGATGCC TTTCGTCACG CTTTGCAAAA ACCAAAAGTT GAGCGTGGCG ATTCGCTGAG GCAGTTTGGG TTTCCGCGGG AGGAAAAGAA ACGTGCAGCA TTCATCCCGC CGGCCAAGAA GAAACCCCCT CCCGAACACC GCTGCACTAC TCAACAGAAT CGATCCCGAA CCTCCCCTTT ACCCGAAGAT GAGGGAGCTA CGGTTATGCC TTCCTCGGAG TGGATCCAGC ATGGATCCGT AAGTAGCTGT AGCAGTATCC CAGATATGGT CAACGAATAC GACAATGCAC ATCCGGCTTT GTCCACATTA AGTGCCCCTT GCTCCTTACC CCAGACAGTC CGAACCACTA GCGCTAGGCA TCTTCACCAA AATGACACCA TCAGCAACCA CGGAATCATT CCCAAGGGCC CCAACCACTA TTCTCCACCG GCCATTCCAC GTCCCGAAAT TGAGGTCGCA CCCGGAGTCT ACATGATGCT TCACGGTAGC GCCGAGACAA TGCAAGCGCT GCAGGAGAAT CGACTCTCGC AGTGCATATG CATGGCATGT ACGATGTCCT TGTATGCCGT GGAATATGCC ACGTTTGTCC TTTGCCCAAC CTGTCGCGTC GTGTCTCCCA AGGACATTGA TTGTAGCGGT GTTGAGGCTT CGTATTTGGG TGCAGGCAGC AGCGACCGGG GGGTCGGGCT AGGACTGACG GAAGAAGTCT ATGAACAATG CACAATGCAA GAAAATGCAC AGGTAACTGA GTGTCCTGCG ACCAATCAGC ATCGACAGCT CTATAGTAGT GAAACGAAAG AGCAAGATCG ACTCTTGTAG
|
Protein sequence | MNRNTNRSRF SIHDAQDEPH GRLALNSNSA SFLDYMRETQ RDPGLLDPPD LLYPSRYEID IIPMPTVEDA FRHALQKPKV ERGDSLRQFG FPREEKKRAA FIPPAKKKPP PEHRCTTQQN RSRTSPLPED EGATVMPSSE WIQHGSVSSC SSIPDMVNEY DNAHPALSTL SAPCSLPQTV RTTSARHLHQ NDTISNHGII PKGPNHYSPP AIPRPEIEVA PGVYMMLHGS AETMQALQEN RLSQCICMAC TMSLYAVEYA TFVLCPTCRV VSPKDIDCSG VEASYLGAGS SDRGVGLGLT EEVYEQCTMQ ENAQVTECPA TNQHRQLYSS ETKEQDRLL
|
| |