Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47316 |
Symbol | |
ID | 7202484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 274347 |
End bp | 275622 |
Gene Length | 1276 bp |
Protein Length | 414 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181521 |
Protein GI | 219122374 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00838049 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCCCA GTGGGTCGGT AGGGGCGACT GCGGGTGCTT CTTCCTCCAC TAGGGCCGTG GAAATGCGCT CACGGGCTTC CGAGATGGAC CTCGAAACGA GACCCTTCGT AACGGACCGG AAAGTTCACC GCACACCCCA TTCCACCGGT AATTCGTTTC GACGAATAGG ACAACAATTG CGTTTGCGAT GGAGTCTATT GTCGATGTCG GCCCGCATTA TTTGTGCCCT TTGCCTACCT CTGTTGCTGA TACACGTCAC GGTAGGGACG GCCGATTGGC TCTGGGGAGG CTACCAGAAG ACCTTACCGA ATGTCGCCAC GTCTTCCGCA ACCGACTTTG CCGTGGTAAT CAATACCTAT AAACGGCCAA ATATGTTGCG TGAGGCGGTC CAGCACTACG CACAAATCTG CGGACCGCGG TTTGGCGTCG GACAAGTCTT TGTCGTGTGG GCGGAACTCG ACGTGGTGCC TCCGGAACCG TCCACTTTCT TGGAGTCGGC CGGAACGCGC GGCCTTAAGA CAACAGCCGA AGTGCATATG GTAGCGGTGG CGAAAGATTC TCTGAATTCA CGTTTTTTAC CGATCGAACG ACTCCGGAGT GACGCCATTT TCATGGTGGA CGATGACGTG CGCGTCGATT GTCAATCGCT GCGTCAAGGT TTCTGGGCCT GGAAAGCCTC GCCCCATTCC ATGGTGGGTT ACTACCCCCG GCTTGCGCAA GCCCCTCGCC GGCGGCAATC GGTAGATACC GTCGGGGCGG AGTACGTCTA CCACTCTTGG CCCATGGTCT TTTGGAAATC GCGCCTCAAT TTCATTCTCA CCAAGGCAGG ATTCGTACAC CGACGCTACC TGACCATATA CTCGGACCCT TCCCAACATC CCGTGGAAAT CTTGGACTAC GTAGACCAGC ACTTTAATTG CGAAGACGTC GCCATGAGTC TGTTGGTTGC CAACGTGACG CGCGCGGAAA CGGGCATCCC GGCACTCCCG GTTTACGTGG AAGCGTCCGT TTCCGACCAG GGTTTGTTCG GCGGCATTAG CACCGGATCA GGTCACATGT CTCAGCGATC GCGCTGTTTG ACCGACTTGA CCAAGGTGTA CATCCAACAC GGATGGTCGC CACCGTTGGA CCGTACCTTT GCGCTGGCCG ACGCCTCGTG GGTCCAACAC GCCCCGGGGA CCTGGTGGCA ACACCGACCC AGCAACGTGT TTGAATGGTT CGCCCTCGAA AATGTATTCC AGTAACAGAA CGTGTAAATT AAAAATTATT GCATTT
|
Protein sequence | MHPSGSVGAT AGASSSTRAV EMRSRASEMD LETRPFVTDR KVHRTPHSTG NSFRRIGQQL RLRWSLLSMS ARIICALCLP LLLIHVTVGT ADWLWGGYQK TLPNVATSSA TDFAVVINTY KRPNMLREAV QHYAQICGPR FGVGQVFVVW AELDVVPPEP STFLESAGTR GLKTTAEVHM VAVAKDSLNS RFLPIERLRS DAIFMVDDDV RVDCQSLRQG FWAWKASPHS MVGYYPRLAQ APRRRQSVDT VGAEYVYHSW PMVFWKSRLN FILTKAGFVH RRYLTIYSDP SQHPVEILDY VDQHFNCEDV AMSLLVANVT RAETGIPALP VYVEASVSDQ GLFGGISTGS GHMSQRSRCL TDLTKVYIQH GWSPPLDRTF ALADASWVQH APGTWWQHRP SNVFEWFALE NVFQ
|
| |