Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38848 |
Symbol | |
ID | 7203595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 347991 |
End bp | 349646 |
Gene Length | 1656 bp |
Protein Length | 552 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182948 |
Protein GI | 219125354 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGTCC AGGGCTTGTG GCGTTTGCTG TTGCCCATCG GTCGTCGCAT CTCAATTGAA ACGTTGGAAG GCCGCGTGCT CGCGGTGGAC GCCTCCATTT GGCTCACGCA GTTCCTCAAG GCCATGCGCG ATCCGGACAC GGGCAAGGTC CAACCCGCCG CGCACTTGAT TGGCTTTATC CGTCGGCTAT GTCGTCTCCG CTTTCACGGG ATCCGCGCCG TCCTCGTCTT TGACGGACCC ACGCCCGCCA TCAAACGACG TGAAATAATG CGACGACGCA AGCAACGCGA ACAGTTCGCT ACCTTGGGAC CAGCCGGAGT CCAACGACTC GCGCGACGGC TACTAGCCCA AACCCTACAG CAGCAACAGC AAAAAAAGCC AACAGTCCCA GAGCCTCACG GTCACGACGC AGCCTTCCAC CCAACGTCGT CGACCGGAAC GGCACAGTCT TTGGCGCCCG GTTTCAACCC AGGGGGGCGG GACGGCAATC CGAAAGACGG ATCGACCACG AACGCTGCAA CTTTGGCTAC CGCTGCTTTA TCGAATGACC CCACGAATTC ATCAGAAGAG CCCGCAGCAT ATTCGGCTAC ATCCGACTCA CTCGGTACGA AAGCTGCGAC CGATTCTGCG CCTCCGGAAC AACCTGCCGC CGCATACCCT CCGGATACCG GCAGTGACGC TGCTATTGCC GCGGCGTTGG AATTCGGTTC GGACAATGAG AACAACACGA ACGACCCCCA CGTCCGCGAC GACGATGGTG ATGATCCAGT CAACGACTGG GATTTACCGC TCGACCACGA GGCGACTGGG AACGAGTCCA GCAATAGCGA CGACCCCGTA ACAACCAATA GCAGCTCCCT CGGCTTTCCT CGTTCCAATA AACGCCAACG CCGTTTGTGG GATGAGCGTC GCGGCACCAT GGACGTGGCG CAGATAGCTG CCTTACCACC CGGCCAACGT AGGGATGCTA TTGAAGCCGC CAAACGAACG CAGCGACTCC TTTCGCGACG GGAATTCATG CCGGCCGCCG CCAACCCCGA CGCCTTTTCG TCCGTCCAGG TCACCAACTT TTTGCGGTCG ATCCGTCTCA ATCAATCCAT ACACGCCATG GCGCTCCGTG TCGTACACGA CGAGGAAAAG GCGTTTGCAT CCCAACCGGG TGAATTTATG GCGTCGGATC GGAACACCAG AGTATCCCTG ATACGGGAAG ATGATCCCGA CGATAACGAC CGTACCACAC CACCAGACGC GCCAAGGGAG CGACCGTCGG CGGCTCTGCG GGCACGGCAA CAGCAAAACC GACGGAACAA TGGTACCCAT CGATTTTCGC ACCGAGATTC ATCATCCGAC GAGGATTCAC AGTCAGTCGG AATCGGAAAA TGTGCCAACC CAGCCTTTGC TGCTGCAGGG AAAAAACGAC GACGGGCCAT TTTGGATGAG GAGGATGACT ACGGCGATTC TTCAAAAGAA GAGGATCGCG TCAATCAAAA GTCTACGCTT TCGACAAACA GAGCATGGTA CAAGGATCGC CCGCAAGCGA CATCGCATTT ACAACCTCTG GAATTGGACG ACAGTAGCGA AGACGATGAC AGCGTCCCAA ACGCAGAACT TAACAAAATG GAATTAGACT ACAGCCAACA ACGGCTTGAC TCGTCG
|
Protein sequence | MGVQGLWRLL LPIGRRISIE TLEGRVLAVD ASIWLTQFLK AMRDPDTGKV QPAAHLIGFI RRLCRLRFHG IRAVLVFDGP TPAIKRREIM RRRKQREQFA TLGPAGVQRL ARRLLAQTLQ QQQQKKPTVP EPHGHDAAFH PTSSTGTAQS LAPGFNPGGR DGNPKDGSTT NAATLATAAL SNDPTNSSEE PAAYSATSDS LGTKAATDSA PPEQPAAAYP PDTGSDAAIA AALEFGSDNE NNTNDPHVRD DDGDDPVNDW DLPLDHEATG NESSNSDDPV TTNSSSLGFP RSNKRQRRLW DERRGTMDVA QIAALPPGQR RDAIEAAKRT QRLLSRREFM PAAANPDAFS SVQVTNFLRS IRLNQSIHAM ALRVVHDEEK AFASQPGEFM ASDRNTRVSL IREDDPDDND RTTPPDAPRE RPSAALRARQ QQNRRNNGTH RFSHRDSSSD EDSQSVGIGK CANPAFAAAG KKRRRAILDE EDDYGDSSKE EDRVNQKSTL STNRAWYKDR PQATSHLQPL ELDDSSEDDD SVPNAELNKM ELDYSQQRLD SS
|
| |