Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48646 |
Symbol | |
ID | 7194893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 471693 |
End bp | 473335 |
Gene Length | 1643 bp |
Protein Length | 516 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183230 |
Protein GI | 219125946 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.555867 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAACGTGTAC GCATACATCT ACCCAATTAA CCACCTTTGC ACTACCCGAC CAGTCGAGAA AGTGGACACT TTCACGGGTT GCTCGAACGG GCATGTCCGT CCGCAGGTTT TTTGGGTGGA AGTCACTCGA GGAGGACCTC GCATCTGCCC CGGAACGCAT CGACGATGCG GACAACGCAG ACTCTGCCCA GGATCCTGAT CCCTCGGATG GAATAATCAC TACCACGCAT AACGACAGCG ACACCAATAC ACTGGTTGAT TGGGACACGG CGCACGACGT CACGTGTCGA CAACAGCATT CATTGATGGC AACCCCGACA CAAGTGGTCC TCGCCTTCCC TCACGGAGTC ACGCCTCCAC CATCTGCCGA AGACTGTCTT TGGAAACCAT GCCACCGACG TACACTCCAA CACGACGTGG TTTCTCGGTA TCGACTGCGG TATCCATCCC TATCCACACC GGATGCATCC GACCCGGACT CGACCACGAC ATCGTCGTCG TCCGCCACTA CCCTCGGGCG GGCCGCCCGG TACCTGTACA CGTCCCTGAC GCATACACTC GGGGCGACTC CTACCTCTGC AGATCCGGTC TGGCTCCGTT ACAATCCCAA CGACGATTTT CTGTCGGGGT CTGAGCAATC CTTTCCTTCG AAAGGACTCG AGGAGGACGT CCATGATAAT AATCACGAGG CGCCCTTGCT TCATCGAACG CTCGCGGTGG AATGCCTCCA CGTGTTTGTT AACGCACTCG CGCACGACTC GGACTCCATT ACGGTGAAGG TCGTCCAACG ATACGGGCAC GATCGCCAAG CGTGGCCCGG ATACCTCTCT ACACGACTCG CACCGTCGCA TCTCTGGTGT GCTCAGCTGC CCGACGACCA AACTGATTTT CTCCTCGATT GTTTGGAATT GATGGGTCTT GTCCAAATCC GGCATCGGGA GTTGCAGCCC GATTTGGTTA TCCTCCACGG CTCGGTCAAC CGAAGCCACG GCAACGGCAA CAACAAGAAG AGCGACAACA ACAACAACAT TGACGCCGTC GCGGTTGCCG TTGCCCGTTT TGATGTGCAA CAGGCTCAAG CACGGGTGCA GGTGCAGATT CAGTCCTGGA CGACCCAAGC TGACGCTTGC ACCGTCCGGG CCCGACAGGC CCAACGGAAC GGCCGAAAGT CCCAGGCCTT GTGGGAAATG AAGCGGCGTC ACCTTTTCAC GCAACAAATC GAACGCCAGT ACGGAATCTT GCTCAATTTG GAAACGGCAC AACACGCCAT CGAGTCAGCC TCCCACCAGA CCGCTGTTGT GGCGGCGTTG TCGCAGGCCT CGCACACACT CCGGCTATTG CGCATCACCG TGTCTATGGG GGACGTCGAC CGCGTTGCTG ATGAGTTGTT CGAAGAATTG GAAGAAATGC GGATACGGGA CGACGCGCTG GTGGATTCAA CGGCGACCGA CACAATGGAT GACGACGAAT TGTTGGAAGA ATTGGCCGCT TTGACGTTGC AGGATGCGGT TCCGGGCGAA TCAAATACGC CACTCGTCCC GGCTGTTCGA GACGCCTCTT CTATCGAAAA TCAAGCGAAC CGGAAGGTGA ACAAGGAACC AACTACTAAG GCCGGGATCA CGGAAACTTT GGCGTCATGC TAG
|
Protein sequence | MSVRRFFGWK SLEEDLASAP ERIDDADNAD SAQDPDPSDG IITTTHNDSD TNTLVDWDTA HDVTCRQQHS LMATPTQVVL AFPHGVTPPP SAEDCLWKPC HRRTLQHDVV SRYRLRYPSL STPDASDPDS TTTSSSSATT LGRAARYLYT SLTHTLGATP TSADPVWLRY NPNDDFLSGS EQSFPSKGLE EDVHDNNHEA PLLHRTLAVE CLHVFVNALA HDSDSITVKV VQRYGHDRQA WPGYLSTRLA PSHLWCAQLP DDQTDFLLDC LELMGLVQIR HRELQPDLVI LHGSVNRSHG NGNNKKSDNN NNIDAVAVAV ARFDVQQAQA RVQVQIQSWT TQADACTVRA RQAQRNGRKS QALWEMKRRH LFTQQIERQY GILLNLETAQ HAIESASHQT AVVAALSQAS HTLRLLRITV SMGDVDRVAD ELFEELEEMR IRDDALVDST ATDTMDDDEL LEELAALTLQ DAVPGESNTP LVPAVRDASS IENQANRKVN KEPTTKAGIT ETLASC
|
| |