Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40022 |
Symbol | |
ID | 7195496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 618369 |
End bp | 619814 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184032 |
Protein GI | 219127624 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGGTA CTGAACACGT TGTGATTAAG CAGTCGGTGA AGATCGAAAA CATCACACGC AGTGGACTTC AGTATGGCAC AGCGACCGGA CAAAATCATT CGAGACACAG CAAAACAGCA GAATCGCCTT TTCAGTCTTT GCAAACGTCA CTAGAATCTC AGCGACTTAC ACCTATCGCT ACCAGCTCAA CCGTTAGGGG CAATCGCTCC GTCACTGATC TTCCATACGC AGATGCTCGC GACGAAGATG GTTCCTGGGG GTACATCGCC GACGCAACCC AGGTGAGGAG TCGAGTCCTG GCGCTTCTAC CCTCAAACCA TACACTGCAC AACAATGTCA CCAGTTTCAT ACCCATGACG GAATCTGAAC AAGAAGAAAT ATGCCAAAAG CCACCCGGAA GCGGACCGGA GCAAGAATTG GGCTGGAAAC TGATGCAGCG TGTTGTCGTC AATGCGCCCG AGCCGAGGTA CGCCAACGAG TCTGCAGTCA TTGTCGCCAA CGAATCTGCA GTCATCGTCA CCAACGAGTC TCCAGTCATC GTCACCAACG AGTCTCCAGT CATCGTCACC AACTCGTCCT CCATCTCAGT AAGCCACCAC ACAGAAACAG TAGCACCCAA AATTCTTTGT GTCGTCTACA CGTATGATGC TCATCACGAT CGAGTTGCGG CGATTGGTGA TACCTGGGGT TGGCGCTGTG ACGGCTTTTT GGCCGCCTCC AACCGAACTA TTCCGGAGCT TGGGGCTGTA GATTTGCCCC ACGTTGGACC CGAAGCTTAC GGCAATATGT GGCAAAAGAC GCGTTCTATA TTGGCGTACG TGCACGAACA CTATATTGCG GAGTACGATT ATGTGCATGT GGCAGGAGAC GACACGTACG TGATTGTGGA AAATTTGAGA AATTACTTGG AGTTTACGGT AGAGGCAAAA CATGGTCGAG ACAAAATACC ATTATACTTG GGTCAGAGAA TTGTTGCGGG GGCTGGTTTC GCATTTGTTT GTGGAGGAGG GGGTCACATT TTGAACCGAC TGGCTTTGGA CCGTTTCGTC AAAGAAGCAC TGCCAACGTG TGAGGCCGAC AGAGAAGACC CTGCCGAAGA TCGCTGGCTA GGATATTGCT TGAGAGAATT GGGTATTCAT CACACGGACA CAGTTGATGG TTTCAATCGA CAACGATTTC ACAGTTTCGA TCCATACGAT TTGGCTTCAA GGAATCCGCA GAGAGGCTTC TGGAAACGGC AGTACAAATT GTGGGGAGAG ATGTACGGCC TCAAGTGGGG CATTGACTTA GTTTCGACAC AAACCATAAC GTTTCATATC ATAAGGGGGG CGACTTGGAT GAAGCGGGTG CACGCTCTAC TTTATTGTAC ATGTCCTGTG GGTACAGTAA TGGGCAATAT CCTCTCGCAG GTAATGGATG CAAGTGTAAG CAATAGACGT ATATAA
|
Protein sequence | MDGTEHVVIK QSVKIENITR SGLQYGTATG QNHSRHSKTA ESPFQSLQTS LESQRLTPIA TSSTVRGNRS VTDLPYADAR DEDGSWGYIA DATQVRSRVL ALLPSNHTLH NNVTSFIPMT ESEQEEICQK PPGSGPEQEL GWKLMQRVVV NAPEPRYANE SAVIVANESA VIVTNESPVI VTNESPVIVT NSSSISVSHH TETVAPKILC VVYTYDAHHD RVAAIGDTWG WRCDGFLAAS NRTIPELGAV DLPHVGPEAY GNMWQKTRSI LAYVHEHYIA EYDYVHVAGD DTYVIVENLR NYLEFTVEAK HGRDKIPLYL GQRIVAGAGF AFVCGGGGHI LNRLALDRFV KEALPTCEAD REDPAEDRWL GYCLRELGIH HTDTVDGFNR QRFHSFDPYD LASRNPQRGF WKRQYKLWGE MYGLKWGIDL VSTQTITFHI IRGATWMKRV HALLYCTCPV GTVMGNILSQ VMDASVSNRR I
|
| |