Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47992 |
Symbol | |
ID | 7203233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 644230 |
End bp | 645600 |
Gene Length | 1371 bp |
Protein Length | 432 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182267 |
Protein GI | 219123928 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00155367 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCGTTTCA TAATAACAGT AAACAATACA CACCAAGTCA GCAGATAATA TTCTAATCAA TTGGTGGTGA CCATGGCTTA CTCTGAAGCC GAGCGTGATA TATCCGCCAA AGATGCCGCG GTTCGGAGGC TTCTTTCCCT GGCTAAAATA AAAGTCGCCC ACCCCGCGCC CTTTTCAAAC ATTACCAAGC TGGATATCTT TGACTGTGGC ATTTCGTCCC TCCCAGATAG CTTCGCCGAA GCATTCCCTG AACTTTCTAT TTTATTCTTG TCCAAAAACA AATTCAAGGA GATGCCAAAA ATGATTGGTG ATTGCCCCAA GCTACAAATG GTTTCATTCA AGGATAATAT GCTTGCGACG ATTCATCCTG ATGCATTGCA ACCACAAATG CGATGGCTCA TCTTAACAAA CAATCGTCTA TCTAACCTTC CGGAATCAAT TGGTCGATGT CAAAAGCTGC AGAAATTCAT GTTGAGTGGA AATCAAGTGG AATCTCTTCC AGATACTATT CGAAACTGTA TTAGCCTGGA ACTAATCCGA TTAGCTTCCA ACAAATTGAA GGAGCCACCC ACAGCTCTTC TTGATATTCC AAGTCTTCGC TGGGTAGCTT TATCGGGAAA TCCATTTCTG CAGCATCTGC AACCATCATC GGAAGCATTG GACATTCTGG AAGAGGTGGA AGAAAGCATT GGCGAAGTCT TGGGACAGGG TGCTGGAGGT GTTACTCGCA AGGTACTCTG GCGGGATCGT GTTGTTGCTG TGAAAGAATA CAACGGTGCC ATGACCTCCG ACGGTCTTCC TGAGGAGGAA CGTCGCATAT CGTGTGCAGC ATCGGCACTC AACTCCGCTT GTTTTATTGA AGTTCTAGGC GAAACGCAGG CAGGTTCCTT GGTGATGGAG TATTTGGATC AGTATTCAGC TTTAGCCGGC CCGCCAAGCT TTGAAACCTG TTCGCGAGAT GTATATACGG ACTCCGTATG TATTGTTCAT GATGAGCAAG CCGAGAAAAT CTTGTCTCAT TTGTTGGAAG CCCTGGCTAA GCTCCACAGC GTTGGTATAT GCCATGGAGA CTTTTACGGA CACAACATTT TGGTCTCTCA AGATGGATCG GACGTACGAT TGAGCGATTT TGGAGCGGCA TTCTTTTATG ACAGAGAGCA TGAATACGGC ACTTCTATAG AAGCGATTGA GCTACGGTCA TTTGCAGTTT TGGTTGAAGA AGTTAACTCT TTGCTCAAGC AGCAGAGTGA GCGATTAGAC AAGCTCGTAA GAAAATGCCG AGAGCAGGGT TGTTCGTTTG CGAAACTTCA CATCTGGTGG AAACAACTAC AACTGGCTGG GCTCGCATCT GCCTTCGCTG TTGACGCCTA A
|
Protein sequence | MAYSEAERDI SAKDAAVRRL LSLAKIKVAH PAPFSNITKL DIFDCGISSL PDSFAEAFPE LSILFLSKNK FKEMPKMIGD CPKLQMVSFK DNMLATIHPD ALQPQMRWLI LTNNRLSNLP ESIGRCQKLQ KFMLSGNQVE SLPDTIRNCI SLELIRLASN KLKEPPTALL DIPSLRWVAL SGNPFLQHLQ PSSEALDILE EVEESIGEVL GQGAGGVTRK VLWRDRVVAV KEYNGAMTSD GLPEEERRIS CAASALNSAC FIEVLGETQA GSLVMEYLDQ YSALAGPPSF ETCSRDVYTD SVCIVHDEQA EKILSHLLEA LAKLHSVGIC HGDFYGHNIL VSQDGSDVRL SDFGAAFFYD REHEYGTSIE AIELRSFAVL VEEVNSLLKQ QSERLDKLVR KCREQGCSFA KLHIWWKQLQ LAGLASAFAV DA
|
| |