Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46797 |
Symbol | |
ID | 7204666 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 474661 |
End bp | 475959 |
Gene Length | 1299 bp |
Protein Length | 322 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185894 |
Protein GI | 219121337 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.147983 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAATTAGCGT GAGATTTTCA CTGGCTAGGC CTGACGGACA TACGATTTAC TCGAGAAAAC TCGAAGTATT TTTCGAGTTT GTCACCCCCT CTATACTTTG CTCTTACTGT TAGTGTTGAT ACCAGAAGCG GAGTACATGG CAATATAGAC AGCATCCTGT ATTGAGTGGC GAGATGGTAG ATGACCATCA CGACAAAGAC ATGACGTTGG AGGAGCGCTT GGCATGGCTT TGGGACCGGG TAAGTCGTCG ATTGGTACTT TTTTGGGAGG ACGGTGTTGC AACAACGAGA TCATAAGTTT CTTGGCTGAT GACAATTTCT TTGCCGTCAC ATTTCGCAGG GTATCCAGGT GACAACTCCT GAGGAACGAA AAGTTTCTCA AATGGCCAAT GTGATGCGCC AGGCTGATAC GGTATCGAAC GCGGAGGTTT CGTATGTGCT TATTCCAGCA GATACTTCTA AACCTCTGCA AGAGTTGAAT TTCCGACCCG ACACTTTACC GGGTGATAGA CTTGTGGAAC ACCTGAAACC GGCGTTCGCT CGAGAAGCGC AAAGTGTTGA CATTGGCTTA CTTCAACAAC AAGCGACACA AACGCTGGCT GCTTCCCCGG ATACACCTAG CACGGTGTCG GATGCCACCC TACGGCAAGT CGCTTCCGAA GCCAACGTCG AGACTTTTCA TTTGGTACAT GCGTCGGCTA CGAATCAATA CACTGACGTT GTAATTTACT TGGATGAAGT CGGTATGTTG AAGCGATTGA GTTTGAATAC CCGAGCAAGT AAATATGCAT ACCTTGCAGG CTTCAATCCA GCACCCCAGT TTTACGGCGA CATATTTATT GGTCGCTTGC AGCGGAAGCC TACTTTGCGT CATAAATCGT TTGTGCTGGG AATCGATACT GCGCCGGATG CACCGTGGTT GCAAGCAGCG ACCTTGGAGA ACCTCCAGCG TCAAATGGAA CTCAACCGAA TGACGGGACG CAGCGATACA CAATTAGCAG TAGCCGGCGA CGGCCAAATG AAGCAAGAGG ATGGCTTTTC TTGGGTGCAA ACGGAAGAAG AATTGGAAGT TGTGATTCCA CTTCCAGTGA AGACCCAGTC GAAAGACGTC AATGTTGTAT TTCGACCTGA ATCTCTCAAA GCTACATCCT TTGGCAAAGA CCTTGTTATG GTGCCATTGT TTGAGCGAGT CGATGTCGAT GGTTGTACAT GGACGCTGGA ATCGAATGAT GAGTGCAGAA AGCTGGTAGT TACTATGGAG AAGGTCGAAT CCGCTTTCTG GCCGCGCATC AAAGATTAG
|
Protein sequence | MVDDHHDKDM TLEERLAWLW DRGIQVTTPE ERKVSQMANV MRQADTVSNA EVSYVLIPAD TSKPLQELNF RPDTLPGDRL VEHLKPAFAR EAQSVDIGLL QQQATQTLAA SPDTPSTVSD ATLRQVASEA NVETFHLVHA SATNQYTDVV IYLDEVGFNP APQFYGDIFI GRLQRKPTLR HKSFVLGIDT APDAPWLQAA TLENLQRQME LNRMTGRSDT QLAVAGDGQM KQEDGFSWVQ TEEELEVVIP LPVKTQSKDV NVVFRPESLK ATSFGKDLVM VPLFERVDVD GCTWTLESND ECRKLVVTME KVESAFWPRI KD
|
| |