Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54883 |
Symbol | |
ID | 7203722 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 11991 |
End bp | 15092 |
Gene Length | 3102 bp |
Protein Length | 980 aa |
Translation table | |
GC content | 60% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182883 |
Protein GI | 219125219 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0342548 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACTGTGAT CTCCATCGTC GTTTTGTGTT CTTCAATAGA ACTACGAAGC AACCCCACCG TCTTTACGCT CCTAACCCCG CCCTACCGCC CATCCGCAAC CTCTTCGGTC CCGATGTCGA CCTCGGCTCA TTTCAAACTG AGCGACTTTC CTCACAAAGT CCTCGACCCG ATCGCCACCC TCACCGTCCC ACCGACCTAC GCGACCATCA AGCGTGCCCA ACGCCAGCTC ATGACTAACG CCGCCGCCAT TCCCACACTC AACGGTGGTG GCGCCCATGG CCATATGGCC CTGACCTTGA CCGCCCTTGC CTACGCCGAC ATCAGCAACG TCCCGTTCGT CATTCCCGTC GCCCCTCCGG CCAATCCGCC TCCTGGTGCC ACGCAACCGC AAATCACCGA AAACAACCGC ATTCATCAAC ACGATGCCGA CATCTACAAC CTTTATGTCG CCGTCAACAA CGCGCTTCGC CAGCAACTTC TCGACGCGGT TCCCCGCATT TATGTCCGCG CCCTCGCCCA TCCCATGTTC GAGTTTAGCA ACGTCACGTG CCTCGACTTG CTCTCGCACC TCTGGACCAA ATACGGTACC ATCAAGCCCG CCGAGCTCCA GAAAAATTTC CAGTCCATGT ACACCCCTTG GAACACGACC GAGCCGCTTG AATCAGTTTT TCTTCAGCTC GACGAGGCCA TCGCTTTCTC TGTTGACGGT AACGACCCCA TCTCGGAAGC TGCTGCTGTT CGCGCAGGCT ACGAAGTCAT TGCGCACTCG GGCCTGCTCC CCCTGGACTG CAAAGAATGG CGCAAATTGC CTACTGCTGC TCACACCCTT GCCCATTTCC AGCAGCACTT TTCCCTGGCC GACGAAGACC GGCGCCTCAC GGCAACCACC GGTTCCCTCG GATACGCCAA CGTGCTTGCT GCTGCCCCCT CTCTCGCTCT TGCCACGACC TCCGACACTC TTAGCCTTCC TTTCTCCGCG CTCTCTGTGT CCCAGACTTC TGTCTCTTCG CCGGACATGA CCTACTGCTG GACCCATGGT ACCAGCAAAA ACCGACGCCA TACAAGCGCC ACGTGCAAGA ACAAGGCCCC TGGCCACCGC GACGACGCGA CCGCCACCAA CACTCTCGGC GGCTCCACCA AGGTTTGGAC GGCTCCCAAG CCCCCTGAAT AGGAAAGAGG GACGGCTACG CCGATGGTTA ACTCTAGTAA TACCGATTAT TTAAATCATA TTACTAGTCT TAATTCATCT GTAGTCCCCT CCCCGCCTAG TCCCCATACC TCGGCCATTG CCGACACCGG TTGCACCGGC CATTACATCA CCATCAACTG CCCCCACACC GACAAACGTC CTGCGAATCC CAGCCTTGCC GTCCGTGTCC CTAACGGCGC CGTCCTCCGC TCAAGCCACA TTGCCACCCT GGCCCTCCCT GGCTTCTCCC CTTCCGCTTG CCAGGCCCAC ATCTTCCCCG GGCTTGCCTC GCACCCACTC ATTTCGATTG GGCAACTTTC CGACGACGGC TGCACTGCCA CTTTCTCAGC CACTAGCCTT GAGATCCACC GCGACACCAC ACTACTCCTC TCCGGCACTC GTGCACCCAC AACCGGCCTC TGGCACCTCG ATCTTACCCC CGCCAAGCCT CCCAACACGG CCCATGCGCT TGTTCCGCAC ACACCCCTTG CCGACCGCAT CGCTTTTGTC CATGCCTCGC TCTTCTCCCC GGCTCTCTCC ACATGGTGCC AGGCCCTCGA CTCCGGCCAC CTTGCAACTT TCCCCGACCT TTCCTCCCGC CAAATCCGCA AGTATCCACC TAGTTCCCCT GCCATGGTCA AAGGTCACCT TGACCAACAA CGCGCCAACC TTCGCTCCAC CAAGCTTCCC CCTGTCTGTC CCCCCACCAC GACGGAACCC CCAGCCGCCG CTGTGCCCGA CTTTGATCCT CCCGACGCCC ACCCTATCGC ACGCACACAC CATGTCTTTG TTGCCCACCA ACGGGTCACC GGTCAAATCT ACACGGACCA ACCGGGCCGT TTCCTCACTC CCTCAAGTGC CGGACACAAC GACATGCTTG TGCTCTACGA TTTTGATAGC AATGCCATCC ATGTCGAGGT CATGAAGAAC AAGTCCGGCC CCGAGATTCT TGCCGCCTAC AAACGCGCAC ACTCTCTCTT TACCCAACGC GGCCTCCGTC CCCAGCTCCA ACGCCTCGAC AACGAAGCCT CTACAGCCCT CCAATCCTTC ATGACCTCGG AACACGTCGA CTTTCAGCTG GCACCTCCCC ATCTGCACCG TCGTAATGCC GCCGAACGGG CCATACGTAC CTTCAAAAAC CACTTTATTG CTGGCCTCTG TACCACTAAC CCGGATTTTC CCCTCCATCT TTGGGACCAC CTCCTCCCAC AGGCCCTTAT CACCCTAAAT CTTCTTCGTC GCTCCCGCAT CAATCCCAAG CTGTCCGCCC ACGCCCAGCT TCATGGTGCT TTCGATTACA ACCGCACCCC GCTTACTCCT CCCGGGACTC GCGTCCTAGT CCACGTCAAG CCGTCCGTCC GCGAAACTTG GGCCCCCCAT GCTGTCGAAG GTTGGTACCT CGGCCCCGCC CTGAACCATT ACCGTTGCCA CCGCGTCTGG ATCACGGAAA CACGTGCCGA ACGTGTTGCT GACACCCTTT CCTGGTTCCC GACCCGCATT CCCATGCCTG CCGCTTCGTC CACCGACCGC GCCCTGGCCG CCGCCCGTGA CCTAGTCCAT GCCCTCCAGA ATCCTTCCCC TGCGTCTCCG TTCGCCCCCC TCGATGCCAC CCAGCACAAG GCACTCACCG ACCTTGCCAA TCTCTTTGCC ACCGTGGCCG CCCCGGCCGC CGACGTCCCT GCACCTGAAC CTGTGCCTCC GGTCCGTCCT CCTACCCCAG CACCTCCCCC TGCTCAGGTC CGTTTTGCCG TTCCTCTTGT CACGGCCGAA CATGCCCCTG CACTTCCGAG GGTGCCCATT CCGGCCACAG CACTTCCGAG GGTGCCCACC ACGGCCACCT ATCACTCTCG CACCCGCAAC CCCGGCCGCC GCCGCCGCAC AGCACGCAAC CAACCGGTAA CCCCAACCCT AG
|
Protein sequence | MSTSAHFKLS DFPHKVLDPI ATLTVPPTYA TIKRAQRQLM TNAAAIPTLN GGGAHGHMAL TLTALAYADI SNVPFVIPVA PPANPPPGAT QPQITENNRI HQHDADIYNL YVAVNNALRQ QLLDAVPRIY VRALAHPMFE FSNVTCLDLL SHLWTKYGTI KPAELQKNFQ SMYTPWNTTE PLESVFLQLD EAIAFSVDGN DPISEAAAVR AGYEVIAHSG LLPLDCKEWR KLPTAAHTLA HFQQHFSLAD EDRRLTATTG SLGYANVLAA APSLALATTS DTLSLPFSAL SVSQTSVSSP DMTYCWTHGT SKNRRHTSAT CKNKAPGHRD DATATNTLGG STKERGTATP MVNSSNTDYL NHITSLNSSV VPSPPSPHTS AIADTGCTGH YITINCPHTD KRPANPSLAV RVPNGAVLRS SHIATLALPG FSPSACQAHI FPGLASHPLI SIGQLSDDGC TATFSATSLE IHRDTTLLLS GTRAPTTGLW HLDLTPAKPP NTAHALVPHT PLADRIAFVH ASLFSPALST WCQALDSGHL ATFPDLSSRQ IRKYPPSSPA MVKGHLDQQR ANLRSTKLPP VCPPTTTEPP AAAVPDFDPP DAHPIARTHH VFVAHQRVTG QIYTDQPGRF LTPSSAGHND MLVLYDFDSN AIHVEVMKNK SGPEILAAYK RAHSLFTQRG LRPQLQRLDN EASTALQSFM TSEHVDFQLA PPHLHRRNAA ERAIRTFKNH FIAGLCTTNP DFPLHLWDHL LPQALITLNL LRRSRINPKL SAHAQLHGAF DYNRTPLTPP GTRVLVHVKP SVRETWAPHA VEGWYLGPAL NHYRCHRVWI TETRAERVAD TLSWFPTRIP MPAASSTDRA LAAARDLVHA LQNPSPASPF APLDATQHKA LTDLANLFAT VAAPAADVPA PEPVPPVRPP TPAPPPAQVR FAVPLVTAEH APALPRVPSE GAHHGHLSLS HPQPRPPPPH STQPTGNPNP
|
| |