Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_11009 |
Symbol | |
ID | 7197628 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 1082486 |
End bp | 1084393 |
Gene Length | 1908 bp |
Protein Length | 535 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178359 |
Protein GI | 219115127 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCCGGCAGT ATCAACTGGA GGGAATCGCT TGGTTACGTT TTCTACATAC ACTCAGACTG AACGGCGCTC TGTGCGATTC TATGGGACTT GGGAAAACAC TACAGGCTTT GATTTGCGTT GCCATTTCTC ACGACGTAGT CCACCATGCC GCACCAGATA GCAAACCAGT CTCCATCGTT GTCTGTCCTT CTACTCTGGT TCGACACTGG ATCGCTGAAA TCAACAGATT TTTCAAAAGC GACGATCCGG TTTTCTTTCC TCTCGAGCTT TCTGGTAGTA GCACGAGTCG TCGAGCAGTA TGGGAAAAGG GCTTAGTATT TTGCAACATA ATTGTTACAA GCTATTCTGT TCTACGGAGC GATATACGAA TGCTTGCATC GCAAAGCTAT CACTATTGTG TGCTCGATGA GGGCCACCTC CTCAAAAATC CAAAGACAGG TACGCTGGAT TATCTCGAAT TTGAAAACTT GACAACCACG TTCTGACTTC CGCATCGTTC CCTCGTTGCA CAGAGACGGC GAAGGCGTCT CGTCAGCTGC GGTCGAAGCA CCGTCTCCTT TTGTCAGGAA CGCCGGTGCA GAATCATGTT CATGAACTGT GGGCTGTATT TGACTTCTTA ATGCCAAATT TTTTGGGATC CTCGGTATTT TTCTCAGAAA AGTATGCCAG AACTATATCG AAAGGACAAG CTCCTGGCGC ATCTGTGAGA GAGATCAGCG AAGGAATAGA AAAGTTGAAG ACGTTACATC AGCAAGTACT ACCGTTCATA CTCCGGAGAG AGAAACAACA AGTTCTTAGA GAATTACCAT CAAAATTAGT CACTCAGATT GAAGTTCCCA TGAGTGATCT GCAGCGAAGG CTCTACACTG ATTTCTGCTC GTTTGCAGAT GTCCAACAGT CACTCCGGGC TCTAGATCGC GCTGCGAAGG ACGATCTCGG CGATAGATGC CTGGAGCAGG CAGGGCGTAG CTCGCTACAG GCCCTTTTGT TTTTAAGACT TCTCTCAACT CACCCATGGC TTGTCAGATC CGCCATACCA GTCGCCTCGG AAATCAGCGA CAACGATTGG CTTCGTTTTG ATACCTCCGG TAAAATCAGA GCGTTGGCTG ACTTACTCCG AGAGCTTAGT ATTTTCACCG ACGACTTAAG CGCTGCCGAT AACGATTCGT CACTTTTGTA CTGCGAGGAC GACCATGTTG ATGTAGATGT TTATTCCAGC CTCGTCAATT CATCAGACAA TCACATGCAA CCCGCACCCA CAACCTCCGA AGTCCAATCG CAGACAAAGT GCTTGATCTT CGCTCAGTTC ATTCAAAGTC TTGATGTTGT GGAAAAGCTT TTATTCAAGC CTCACATACC ATCGCTGAAA TATCTTCGAT TGGATGGAAG AGTTCCTGCC AGAAGACGCT ATGCCATTGC CGAAGAGTTC AACCGTAACG ATGAGATCAA GGTTTTGCTG CTAACAACAA GGGTCGGTGG TCTTGGACTA AACTTAACAG GTGAGTAAGG GTCACGAGCT TCTGTCTCAC TGTCACGGGG ATATGTTTTT CTCATGGACA TCGTACAATA TAGGAGCGGA CACTGTAATT TTTCTCGAAC ATGACTTTAA TCCTTTTGCT GATCTTCAAG GTATGCAACA ATTGCGAGAC ATGGAGATAT TTGCATAGCA CTGGCTCAAC CTTCTATATC ATTTTAAAGC AATGGACCGG GTCCACAGAA TTGGCCAAAA GAAGGCTGTA TGCGTTTACC GGTTAGTTCT GGTCGACTCA ATTGACCAGA GAATTATGAA GTTACAAGAA AAGAAGTTGG CTATGAGCGA GGCGATAGTG AACGCCGACA ATTCTACTAT GTTCAGCATG GGGACTGATC GATTGCTTGA CATTTTCACG ATGAGAAGCG ACCAAGAG
|
Protein sequence | LRQYQLEGIA WLRFLHTLRL NGALCDSMGL GKTLQALICV AISHDVVHHA APDSKPVSIV VCPSTLVRHW IAEINRFFKS DDPVFFPLEL SGSSTSRRAV WEKGLVFCNI IVTSYSVLRS DIRMLASQSY HYCVLDEGHL LKNPKTETAK ASRQLRSKHR LLLSGTPVQN HVHELWAVFD FLMPNFLGSS VFFSEKYART ISKGQAPGAS VREISEGIEK LKTLHQQVLP FILRREKQQV LRELPSKLVT QIEVPMSDLQ RRLYTDFCSF ADVQQSLRAL DRAAKDDLGD RCLEQAGRSS LQALLFLRLL STHPWLVRSA IPVASEISDN DWLRFDTSGK IRALADLLRE LSIFTDDLSA ADNDSSLLYC EDDHTKCLIF AQFIQSLDVV EKLLFKPHIP SLKYLRLDGR VPARRRYAIA EEFNRNDEIK VLLLTTRVGG LGLNLTGADT VIFLEHDFNP FADLQAMDRV HRIGQKKAVC VYRLVLVDSI DQRIMKLQEK KLAMSEAIVN ADNSTMFSMG TDRLLDIFTM RSDQE
|
| |