Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37779 |
Symbol | |
ID | 7202761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 47592 |
End bp | 48662 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181991 |
Protein GI | 219123354 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.648541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGTA TTCCACTTGA GCTCCAAGCC ATCTTTGTTG CTTCGAACAA AGGAGAGGAA GTACACAAGC CGCCTTCCAA TAGAGCTGCT TTCAAGGTGT TCAAGAGCAT CTTCGGAGGC TTTGGGCGAC GTAAGCGATC CACCGAGAAC AAAAGAGAGA ATTTGAAGCG CATTGCAAGC CAGAGCTCAT TTGCCACAGT CAGTACTCTC GGGGTGGATG ACTTTAGCGG CAGCTCCCAC AATGAGAACA ACGAATGTGC CCCCGTTACG CCGCACGAAC ACCTGGAAGT TTTATTGAAG GCACGTGGCT ACTGTACCGA ACGCTATTCT GTTCTTCAAA CCGCATTCTT CAATCGACCT ACACCCCTTC AACTCGCTTC ATACGATACC AAGCTTATAC AGCTCATCAA GAGCCAGGAT GAGCAGAAAG TTCGAGAAAT TCTTGCCAGT GGTATCTCAC CTAACGCTTG CAATATTCAT GGCGAATCTT TGATTCACAA GGCGTGTCGA TTAGGATATC ACCGTCTCGT CCGGGCGTTC ACCGACTTTG GCGCAGATCT CGCAATCTCT GATGCCCAGG GACGTACGTT GCTACACGAC ACCTGTTGGG GTGCCCGACC TTCGTTCCAA ACTTTTTCTC TTATCGTTGA TCGCCAACCA GAACTTCTTT TTCTAGCTGA TTGCCGTGGT GCCTGCCCCC TCGAGTATGT TCGCAAGGAT CACTATGTTT TCTGGATTGA GTACTTGGAC CAAATAGCGG ACAAATATTG GCCTTCAACT CAGTCCACAC CCAAACTTTC GTATCTGGTA AAGCAAGAGC CGCACTCAAG ACAGATTGGA GAACCAGGAA ATGCTCTCTC GTTGGAACTT GCCGCAATGG TTGCGTCGGG AAGACTGAGT CCAGAAGAAG CTACATACTT GGCAAAAGGA GACGAAGAGG ATTCAGTCAG CGGTGAGGAG GATTCATTGA GTGACGACGA CGAATCTACG TGGAACGAGG AAGACAACGA AGACGACGAG CTACTTGCTG ACCTGTGTGG AATTCACAGT CTATCAAGCA TTCCCGTCTA A
|
Protein sequence | MSRIPLELQA IFVASNKGEE VHKPPSNRAA FKVFKSIFGG FGRRKRSTEN KRENLKRIAS QSSFATVSTL GVDDFSGSSH NENNECAPVT PHEHLEVLLK ARGYCTERYS VLQTAFFNRP TPLQLASYDT KLIQLIKSQD EQKVREILAS GISPNACNIH GESLIHKACR LGYHRLVRAF TDFGADLAIS DAQGRTLLHD TCWGARPSFQ TFSLIVDRQP ELLFLADCRG ACPLEYVRKD HYVFWIEYLD QIADKYWPST QSTPKLSYLV KQEPHSRQIG EPGNALSLEL AAMVASGRLS PEEATYLAKG DEEDSVSGEE DSLSDDDEST WNEEDNEDDE LLADLCGIHS LSSIPV
|
| |