Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50586 |
Symbol | |
ID | 7199406 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | - |
Start bp | 210608 |
End bp | 211789 |
Gene Length | 1182 bp |
Protein Length | 362 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185543 |
Protein GI | 219130797 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.616621 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAAC ATTTGCTTGA ACGAACAACA GCGTGGTTGA CGACTTGGAG TCTCATTCTG CTCGTGGCCG TTGCTCAAGA AGATGTTCAC CCTGACGTAG TTGCCGTTGT TGATGTTGCT GGAGGTGTAC GACCGACACC ACTTTGGGCA AGCAGTTATT CGGACGGCGA GAATTGCTAC TGCCTTCCTT CGTTGGATAG CGCCATTGGG AACTTTGTCG TAGAAACGCC ATTAGGGTGG CTGACGACGC AGGAAGTGTG CGATCTGCTA GGAACAGGAC CAGGAAGACT AGGACAACCC CTTTACAATG ATATCCAATG CGGTAACGGC CCCCCAAACG CTGATGAAAA CGAATTCCTT TGTCCGGGAC GAACCGATGT AAGTGATTGC GAGGAAAAAG CCGAGCGCTG TCCACTAGTA GAAACAAATA TATCTCAACG GCAAGCTTTG TACTGATTCG CTCTTCCTCA GATTGGGGAA ACAGGTTGTG GTCAAATAGG ACCCAAATGG AATTTTGATA ATGCAAACCT TGCGGACGGC CCCCCACGAC TTCCATCGTT GCCTGAGGAC ATTCATCCCG ATATCGTTGC GGTGATCGAC GTTGTGGGTG GTGTGACGCC GAATGGAAGA TCGTGGGCCG ACAGCTATTC CTTTGGCAAC AAGTGCTATT GTGCGACAAC GTTTGATCAC GACATTGCGG ACGTGCTAGT CGAAACACCG CAGGGATGGA TGACGATCCG TCAAGCTTGC GAGTTACTTG GACCGGGTCC CGGTATTGAA GGACGACCGG TGTACAATGA CATACAGTGC GGGAATGGAC CACCTAATAA TGCAGGCGAT GAGCACGTGT GTCCTGGACG AACCGATGTA CGTCGACTGG AACGCGGTAG ACTGGCTAGT ATTTTCAGAC CGTTGTTTTC TCGGACTGTT TATAATGAAA TGCCTCACCA TGATGCTTCT CCGTTGTACA ATTCACAGCT TGGACCAGAA GGTTGTGGTC AGATTGGTCC CCGTTGGAAT TTTGATGCCA TCAAATCATT ACCGCCAGGC AGCGCCCCCA CAGCTCTGCC CTCTTCTTTA GCAGCCGGAG CAGTGCCTGT CCCAATGCTA CGGGGCTTAG GGGTAATCAC CGGATATCTG TTCTGTGTTT TGAACTGGCA ACTTTTTGAT CTCGTTCCGT GA
|
Protein sequence | MKQHLLERTT AWLTTWSLIL LVAVAQEDVH PDVVAVVDVA GGVRPTPLWA SSYSDGENCY CLPSLDSAIG NFVVETPLGW LTTQEVCDLL GTGPGRLGQP LYNDIQCGNG PPNADENEFL CPGRTDIGET GCGQIGPKWN FDNANLADGP PRLPSLPEDI HPDIVAVIDV VGGVTPNGRS WADSYSFGNK CYCATTFDHD IADVLVETPQ GWMTIRQACE LLGPGPGIEG RPVYNDIQCG NGPPNNAGDE HVCPGRTDVR RLERGRLASI FRPLFSRTVY NEMPHHDASP LYNSQLGPEG CGQIGPRWNF DAIKSLPPGS APTALPSSLA AGAVPVPMLR GLGVITGYLF CVLNWQLFDL VP
|
| |