Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48486 |
Symbol | |
ID | 7203768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 638273 |
End bp | 639867 |
Gene Length | 1595 bp |
Protein Length | 528 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182999 |
Protein GI | 219125459 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.178439 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATCCACCAT GACGGAGGTA CGCGCTGAAT CGTTGGGGGG ATCTGGACTT TTTGCTACTC AGGACTATCA GATTGGGGAT CAAATTCTCC AGGAAAATAC TCCTTTGATT CATCTAGCAC CAACTACAAA GGAATTGGAA AGCGCCATCG TTGGAGCCCT CGTGGAACAA ACGATTGAGA CGAACCCAGC AGGGACCACG GACGACGACA AACGCAGTTT ATGGAGGGCC ATTCAGGTTC CTGCTTCCAT CAAGGAGGCA CATTACGGAA AGTTTAAGGG CATGGTACAA GCAGCGCTTT GTTTTGCCGC TTCGGAAGTG TCCGTTGATT CCAAACAGCG CTTTCTCAAG TTGTATTGTC CACCGCTCAC GACTACGGAA AAATCACCTT CCGCCAACAG CGAAGCGGAA GCGGACATCA TAGAAGTGTC AAAGCAAGCA TTGCAATACG TAAAGGAAAA CATTCGGATC GATTCCAATC TGGCGCAAAA TGGGGCGATG GATGATGGCA CTCTACAAAA GGTCATGCTT TTGTGGTCGG GGAACAGTTT TGAAGGTGGC CGCGTTTACG ACAGCATAAG TCGGATCAAC CACTCGTGCG ACCCCAACGC AGTCGTTCAG CTTGGGCTTG GAACGGAACA AGACCGCCAA AGCATTGTGG CTTGTGCGCC CATTGCGAAC GGCGACGAAA TCACGATATC CTATTTAGGG CTTTTGCTCT ATGCGGACCG ACCGACGCGC CAGGCGTCGT TGTTAGGCAC CAAGCACTTT ACGTGTGCGT GTGATCGCTG TAAAACGAGC CTGCCGGACA ACGCGAGTGC TATCCCTTGC CCCATTTGTC ACCCAAGAAG AACGGGACAG CGCCAACTAG ACGAAGATGT TCAGTACGAT GACGAACAGA GCGTCCACTA CGCAATGATC CGTCAAACTC CCGACCATAA CGCGGCCCAG AAAAGGATGG AATGCGAGCA TTGCCACGCG AAAATATTTC CTAGCGACTC TAATCATGCC GTCTTGTGGA AAATAGCAAC CGCTGTGACG GACAAAACGG TTACTTTTCT GCGAGACCAC GCCGCGATGG AAAAGAATAA ACTCAACGAT GATGGCGACG ATGAGGAGGC CGAGCAAGTA CGAGAGGAGT TACTGGAGCA GCAACTTCAG ATATGCAGCA GCGTTTTGGG TGCCCAACAC TGGACGACGA ATATACTGTT ACTACTACTG TTGGATCAAA AGTTACAAGC GCTGCACGGC GGTTTGCTAA GTGACAACCA AGGGGAGGCC GATCTGGGAT CGATTGCGGA AGCCATGGAC ATGCTGGAAC GGTTGTTTCG GTACGTGAAT CGGTTGGGCC TACGCTTACA CCGGGGACAC CTACTTGCGG ATGTCACTAT GGGAGCAGCA CGAGCCTTGG TGAGTCTAGG CGACACCGGG AGTCAAAAGT ATGCTGCCGA ATGGATGGCC AAAGTCGACG ACTACGTTCA ATCATTCGAA CCCGAGGATG TGCAAAAGGT GGCGCACGCC CTGAAAGCAG CTTGGACGCG GGCCGATCTA AATCCTTCTC CCAGGAAACG AGCAAAGGGC GACGTGAAAA GATGA
|
Protein sequence | MTEVRAESLG GSGLFATQDY QIGDQILQEN TPLIHLAPTT KELESAIVGA LVEQTIETNP AGTTDDDKRS LWRAIQVPAS IKEAHYGKFK GMVQAALCFA ASEVSVDSKQ RFLKLYCPPL TTTEKSPSAN SEAEADIIEV SKQALQYVKE NIRIDSNLAQ NGAMDDGTLQ KVMLLWSGNS FEGGRVYDSI SRINHSCDPN AVVQLGLGTE QDRQSIVACA PIANGDEITI SYLGLLLYAD RPTRQASLLG TKHFTCACDR CKTSLPDNAS AIPCPICHPR RTGQRQLDED VQYDDEQSVH YAMIRQTPDH NAAQKRMECE HCHAKIFPSD SNHAVLWKIA TAVTDKTVTF LRDHAAMEKN KLNDDGDDEE AEQVREELLE QQLQICSSVL GAQHWTTNIL LLLLLDQKLQ ALHGGLLSDN QGEADLGSIA EAMDMLERLF RYVNRLGLRL HRGHLLADVT MGAARALVSL GDTGSQKYAA EWMAKVDDYV QSFEPEDVQK VAHALKAAWT RADLNPSPRK RAKGDVKR
|
| |