Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46603 |
Symbol | |
ID | 7201871 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 870057 |
End bp | 871165 |
Gene Length | 1109 bp |
Protein Length | 275 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180912 |
Protein GI | 219120344 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.283462 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGTGCG AGGCCGCGCA AATACATCAG GTCGAAGGAT CTCACAATCT TGATTTCAGA AAAACGCACT GGTTCTCCTT TTGCGATATA CTGCTCCTAC AAATTAAATG TCTAGCTCTC GTAGTCAATG GTAAGGCAAA CTGAATAGAA AAGTGATATT GCCAGAGTCG TTTACTGTAT ACTTTAAAGA ATTGTTTGGG ATCACCTGAT TTGACTTCGC TGAATTACTA TGGATGATGG TCCTCTCACT TCAAGTTGAC CAGATGTTTC CAATTACCAT AACCCTAACC TTTTTCTCAC CAACGGCTGT ATGTGTTTCT TATGTTCGCC TGAACACGCT TTTACCAACA CAACAAAATT GGCAGTTGCT TGAACCTCCT CCAAGCACTT TTGGCCGTCT TTGCCCTGAT CACCGCTACC CTGGCGAGCT TCTGGTGTGA GACCATCAAG TTCATCCCAA GTAGTAGACG TAGACTTTCG ATACCTTCTT TGCACTTCGG ACTATGGTCT GTACGATCAA CAACGATAGA CAATCTTGGG ATTGCAGGAA ACGTCCGTAT CGTGGTACGG AACACTTGTG TCAGCTATCC TAGTTTGCTA AACATCGATT CAAGCTGGAA GGCAGCTCGT GCCTTTTCCG TCGTGGCCTT GATTCTTGGT TGTGTAGTCA CGCTTACTTT GGTTGTTCTG TCGTGCATTT CCTCCGACTT TACAAAACGG AGCTGGAGCC TCCTAGTCGT GCTTTGTTTG GTTATTCTTC CATTGTTTCA AGGATTGACA TTTTTAATAT TGCAGTCCAA CGCTTGTCGT GACAATGCAG TAGTTACAAG CATTGGAATT TTACAAGGAG TATCTTCATA TTCCGATGAA TGTGATTGGG ATGCCGGCAG TTCGGCCAAC GTTGCCGCCG TTGTTCTTTG GTTCGCTGCC GGAATGGTAA TGCTCACGTT GGGCCCCCCG GAACGCGATG AACGTTCGTC CAGCCGAACC CAAGAAGTTA AAAACAATAA CACAGGCGCG GCAGCTATGA CCGAAGCAGA CGGCGCCATT GGTGGCTCGG AAATCCCGAA TTTGAACGTC TAATGGAATA AGGAAGCAAT GGATCCATT
|
Protein sequence | MVCEAAQIHQ VEGSHNLDFR KTHWFSFCDI LLLQIKCLAL VVNALLAVFA LITATLASFW CETIKFIPSS RRRLSIPSLH FGLWSVRSTT IDNLGIAGNV RIVVRNTCVS YPSLLNIDSS WKAARAFSVV ALILGCVVTL TLVVLSCISS DFTKRSWSLL VVLCLVILPL FQGLTFLILQ SNACRDNAVV TSIGILQGVS SYSDECDWDA GSSANVAAVV LWFAAGMVML TLGPPERDER SSSRTQEVKN NNTGAAAMTE ADGAIGGSEI PNLNV
|
| |