Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50148 |
Symbol | |
ID | 7198849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 194929 |
End bp | 196062 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184986 |
Protein GI | 219129629 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCGCA GCAGGATCCT TGTTTCACTC CGACGGACGC TCGTAGTATA CATTTTTGGC CTACTCTGGT CCTGTGTTTC CGCGCTTGGA AGGCCAGCGT ATATAGTTGC TACCGACAGG AGACCGAGCT GTATCTTTCC TTCGACAAGT CTTCGTCATT TTGCCGCTCC AATCAATCTT GCGGATGTTG GAGTGTTCAA TATTCCGGGG AGCGATCCCG AACGACCCAG AAAGGTAAAT CAAGATCGTC ATTTTTTGTT TCAGAACGAC GAAACGATCG TGGTGGGAGT GATGGATGGG CATGGAAAGG ACGGCCACAA ATTGACAGAG TTTCTTGCCA CACAACTTCC CTTTCGCTTA AAAGAAAGCT TAAACAGTAG CACACTGAAC AAATCCGACC ATGCGGCCTT ACGGGAAATG GAAGAAAGGC TCGTGACTTT GGGGAAAGCG AAATCCACCT CCATGAGCGA GGAGCACGAT GTCGTTTCTG GTGCTTTGAT CCAGGCTTTT CACCTTGCTC ACCATGATGC ATGTCAGAAT ACTGCAGTCC CCGCCGGTAG ATCGGGAACC ACTTGTGTTG CATGTGTCGT CACCGATGAT TCTATTGTAA CGGCCAGTGT CGGAGACTCC ACGCTTATTT TAGGTTTGTA TGCCGCTCCC GACGTAGCTG CGGAAGGATA CGTTCTTTCC ACGGAAGTTC TATCTGTTCG CACAACCGTT CAAATTGAAG GAGAAAGATC AAGAATCCAA GCAGGAGAGG GACGTGTGGA TGGAAATGGG AATGTCTTCT ATGGACCTGT AGGAATCGCC ATGACAAGAG CCTTGGGCGA CGCTGTCATG CTTCGAGCAG GAATTTTGCC AACTCCAATG ATCCGAAATT TCAATCGGCC TGTGAGTCAC GCCCAAGAGG AAACTGGTGA AATTTTGTCC ATGATTGTTC TCGGGAGTGA CGGAGTGTTT GACGTAATGA CGAAAGAGGA CGTTATACAA TTGGCCGGCC AAGTTATACA GGAGTTTGAA TCGACTAAGA CTGCCGCCGA AGCCATTTGT AACGAAGCCA GGCGACGATG GCTAGCAAAC TTGCCGATAG AACCGAAGGT GGATGATATA ACCTGCGCAG TAGTTCAGAT ATAG
|
Protein sequence | MIRSRILVSL RRTLVVYIFG LLWSCVSALG RPAYIVATDR RPSCIFPSTS LRHFAAPINL ADVGVFNIPG SDPERPRKVN QDRHFLFQND ETIVVGVMDG HGKDGHKLTE FLATQLPFRL KESLNSSTLN KSDHAALREM EERLVTLGKA KSTSMSEEHD VVSGALIQAF HLAHHDACQN TAVPAGRSGT TCVACVVTDD SIVTASVGDS TLILGLYAAP DVAAEGYVLS TEVLSVRTTV QIEGERSRIQ AGEGRVDGNG NVFYGPVGIA MTRALGDAVM LRAGILPTPM IRNFNRPVSH AQEETGEILS MIVLGSDGVF DVMTKEDVIQ LAGQVIQEFE STKTAAEAIC NEARRRWLAN LPIEPKVDDI TCAVVQI
|
| |