Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46302 |
Symbol | |
ID | 7201496 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 867824 |
End bp | 869837 |
Gene Length | 2014 bp |
Protein Length | 504 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180722 |
Protein GI | 219119943 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0148963 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TATCCTGTTA CCTACCAAGT CCGTCGACCA CCTTCTTCTA TCCGTTCGAT TCCCCTTACA GTTAGCTTCT TCCACTGCGA TACCGTTACT GCGTACTCTG ATTTGTCGTC GTCCGTTTCC TCTAAAGCTC CGTCGGATTC GCACTGGTAG AACAAACCGT CCAGCATGAG TGACACGGCC GATGCACACC GCGGCGGTAT CCACCAGGAC ACGATCCGGG AAGAACCAAC GACGAGTTAT TCCTACAGCG TGACTCACAG CCCGTCGGGC CAATCATCGT CGTCGTCCCG ACCTCCCTCG CACATGAAAA ATGAAGTGTC CGGCGATCAC CGACTCTCCA GGTATGTACA CAACATACCA CTGTGGAGAA GCAAAATCCT GAGTATCTAC TCCGAGGCAC CGTAACGTAC CGTACCGCCA CTCACACGGT TGGCTTTCCT TTCTGTCTCT TCCTTTTATA ACACCCCCCC ATTCTCGTGT TGTTTCTCTC TCGAATGCAG TATACACCGC GGCGAATCCG AAGTGGAACA AATGGCGGAA CGAATCGAAA CACTCCGGGC CAATCGCGTC GTCATTACGT TTCCCGTCCG TGTCCGCTTT CTCACCGAGT GTCGCCAGAT TGCCGTCGAA CAGGGCTACG ACTCCACCAA AATTAGTCGG ACCGATCTCG AAGTCTTGCG CTCCCGGATT GCCGAAGAAG CGTCGCGGAC ACCGGAGGTC TGGACCCGCG CGTTGGGCGT TTTGGGTTTC TACAACGTGG CGCCCGCCTT GTCGGTCGTG CTGGATCGGG CGATGGAAAC ACTACTACTG CACGACGACG ACTGGTTGCA CTACCAAGAC TTTTCCTTCT ACTATACCCG GGTGGTGGCC CGTTGGCCCT GGTTGCGTTC GGAAATGCGC GTCGCTTGTC TCATTATTTT TGCCTACTAT TTTTTTACGC CTATTCTGTT CTGTCACATT GTCCAAGAAG ACACCATTTG CGTGGATCTC GGCCGTTCCT ACGACGGGTG GTTATCGAGT CTGTACTTTG CGTCGACGAC GCTTTCCACG GTCGGGTACG GAGATTTAAA GGTGGAGCAA TCCCCACGGT GGCGGTCTTT CATCGGATCG CTCTACATGA TCGTGTCGAT TGTCGTGGCC GTGGTGGCCT TTTCGGCAGC CGCCGGCAAC GCCTTTTCGC CATTGAACAC CTACGACGCT TGGATCGCCA ATTTCTTCAT TGGTGATCCA CACCCCAACG ATTTTCTCTA CACCCGCGTC GCCCGCGTTA AACTCGTCAA GATAACGGAG ATTGTCGTCC AATTCGTCCT CCTCAACCTG ATTGGCGTCT TTGTGACGCG GTACTACGCC CGGCATTCCG AGGTGGAGGA GCAGCAGTGG ACGTGGATGA CGTCGTTGTA CTGGGCCATC CAGACCACCA CGACAATTGG ATACGGTGAC TTGGATATGC CCTTTCAGTT ACGATGGTTC CAAATCTTTT ACCTCACACT ATCGACCTAC TTTGTGGGCA ATTGTCTGGG CAAACTGGGT GCGTTACGTG CCGAATTGGC GGAAGTCCGA CGACGGTACG TGTGGGAACG CCGCAAGGTA ACCCGTCGGT TCATTGACGA ACTACAGGCG TACGAACACG ACGACAACGT GGATCAGTTC GAATTCCTGG TGGCGTCGCT GTTGATGCTC AACAAGATTT CGTCCGCGGA CGTGACACCC ATTATGGACA AGTTCCGGGA ACTGGCGGGG GGCAAGGGCT TTATCAACGC GTTGGACGAC GCGGAGGAAG ATCCGATTGC GGACGAAGCG GAAGCGGTCG ACGCCGACAT TCAGGGTCAG CAAAGCTAAC GAATGGCAAA CGTACAGCAA TCCGGTTTTG GGGCGACGCG TGGAAAACCA TACCGACGAC TCTTGGTTGG TTTGGGGTTG AGTGCTACTG TTTATGTACG TACGTACGTA CGTATGAATG AGTGTATCTG TGTGTATATA AAGAGATAAC GTATAAAGAT GCATTAGGTA TGCA
|
Protein sequence | MSDTADAHRG GIHQDTIREE PTTSYSYSVT HSPSGQSSSS SRPPSHMKNE VSGDHRLSSI HRGESEVEQM AERIETLRAN RVVITFPVRV RFLTECRQIA VEQGYDSTKI SRTDLEVLRS RIAEEASRTP EVWTRALGVL GFYNVAPALS VVLDRAMETL LLHDDDWLHY QDFSFYYTRV VARWPWLRSE MRVACLIIFA YYFFTPILFC HIVQEDTICV DLGRSYDGWL SSLYFASTTL STVGYGDLKV EQSPRWRSFI GSLYMIVSIV VAVVAFSAAA GNAFSPLNTY DAWIANFFIG DPHPNDFLYT RVARVKLVKI TEIVVQFVLL NLIGVFVTRY YARHSEVEEQ QWTWMTSLYW AIQTTTTIGY GDLDMPFQLR WFQIFYLTLS TYFVGNCLGK LGALRAELAE VRRRYVWERR KVTRRFIDEL QAYEHDDNVD QFEFLVASLL MLNKISSADV TPIMDKFREL AGGKGFINAL DDAEEDPIAD EAEAVDADIQ GQQS
|
| |