Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50572 |
Symbol | |
ID | 7199396 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | - |
Start bp | 158070 |
End bp | 161111 |
Gene Length | 3042 bp |
Protein Length | 860 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185533 |
Protein GI | 219130776 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATTCCAAAA ACGGTCAAAG CGGCGAGTCA TATGACTCTT AGATAAAATT ACTAATTATA CATGTTCAAG ATTGTGACGA CACTGAAGAA AGTAGAAAGC CAAGATGACC AGACAGTTTG TTTTCGCTTC TTTCCGTTCG CGAGAGTCAC TTCTTTCCTG GTCCGTCAAT TGCGATTGAT CTCTTGGTGA TACAAAGTCA CGTGGTATGG AAACTGAAGA ACATAGTAGT AGCCATGTTT ACGACGACGA GGAAGACGAA GAGCTCTTTA TGTTGATACA AGAGTCGAGC GGAAAAGAAG GAGACGTGAT CGAGAGATTC CCTGTTCGCA ATATTTGGTA CCTCGTCGGC GCAGGACTCG TGGTATTAGC AGCGTCTTCG TTTTTTACCG ATCAGGTTGA AATAAAATTT GATTCGCGGC TAGAGGACGT TGAAAAGAAG AATCTATCTG CTGTGGACGA TGCTCGGGTC GAAAACGAAT CGGATATTGG CAACGATAGC GGATACGGGA CTGTGGACGT CAAAGCAGAC TTGCATGGCA AATCCAACGC GCTCGATGAT TCGAACAAAA GCGCTAGCAC AAATTTCGCT GAACATTCCG TGTCCATAAA TGATGTGGAA GATTTTTATG CCTCCATCGA GCCTGGCTTG AAACCCCGTT TCAATGTGCT CTCCCCATCT TATATTCCGC GAGGCCAGCC GCTTGCGCAG GAAAAGCGAA ATGAAATCAA AACGAAGTGG GGGTCTTGGA CCCTTGTCGA CGAAGTGAGC CGGCCGGAGG AAGATTTTTT TGCACGGTAT CCTTATCGCG ATGTCCCGAG GAATGATTTT CCAAGTAACG CCTGGCAAGT GGACGTCGCA TATTTATCGC GATTTCTGCC AGAAGCTGAG GCCTTAGTGA CGAGGTCGAT GGAAGCGATT CTCGCGGAAT TGGCCCATTC TCCAATAGAA GAGCCAGGTA TGACAATAGA GGAACGCTCC AAACTCTTCG AACTTGAACA GACTGATCAT GATATCAAGT TGAAAAACGT CATGTTTGAT GGATGCGGCT ACACTTCTCC AGATAGTGAA GGTTTGCTAG CGAGACGCAT ATTACACGCC GTCATGACCG AAGATAATTT TAATTTTGTG ATGGGAGGCC ATTCCGCTGC TGCTGGTAAG AAGCTTTTCG TTTCCCCGTG TGCACATAGA AGGATTGATA CTTACTTCTT CATATTTTTT CAGGGCATGG CAACAACTTT CAGCAATCCT ACACATTGCA GTTTCAACGT ATTTTAGAAC CCATCTTGGC ACGGTTGGGT GTACGACTAC AAGCACATAA TTTTGGTATG GGGGGACTAG GCACGAGCCA AAACGCAATG GCCGCCAGAG ACCTTTACGG CAATGAGATC GATATTTTAA TGTGGGATTC TGGTATGACA GAGAAATCGA ATTGGCACCA GAACTTGTTC GTCCAGCAGA GTCTACTGGG AGCCGAGGGG CGTGTTCCGA TGCTGTGGAA CACTGCCAAT CTATTTACCT ACCACGATGA ACTAGGTGTG GATATCATGC AGGTTGGGGG GTTCTGGAAA GTGGGAGATA AATATTTGCC ACTTTCAGAC GACCCAGTCC AGGTCAAAGG GCTTCCTTAC GCAGTGCAGT ACCTGAAGCC GTCTTCTGGA ATGCAAGGTG AAACTCGAGC TAACCGTTAC AATGGTACGT GTTGGATCGA TCGACCAGAT ATCGATCCGC CTACTAAGCA AGCCGACTTC CCCGGGGGTC GAGCGAGCTG GCATCCGGGC AACCGTGAAC ATCAATACGC AAGCCGTCTG ATGTTGTTTC AGGTGCTGAA GGCTTTGCAA AAGGCTATTC ACATTTGGCG AGATGCAGAC GACTTTGCCT TGCCTGACAA GGCTTGGCAC ATGGGCGAAC ATTACGACTC CATCAAATCC AAGCTGCAAT CGTGGAACAA TACAGCCTGC AAAGCGAATG CCGTTCTGCC ACCTCGATGG TGCGAAGTGG CTTTTCAAGG GCGATCCGAA TACCTTCCAC GGGCCAATCC AAGCGAAACT AGCATTCGGT CGTTGGTTAA AGAGAGTATG ACAATACCGG AAGTGAAACG CAACTTGTAC GATCCTCCGG ATGTTTGGAT GCCCGTGCTG GATCCTCCTG ATGGAGACAT CGATGTGCTG TCAATAGTCG AAAATGGGGT TGAATTCGCT GCCAATCGTC AGCGCATAAA ACATGCTCTA GACTATATTC AAAAGTCGAA ACATCGGGTC GCTTTGTTGA ACAACAATTC TGAAATCGTA CCTGGAAAGG GATGGGTACG TAAAGTAAAC GATGACGAGG CTCACTCGAT TCGTTGCATC AAAGTATCTC GACTCACAGG TAACCTGCTC TCATTTGTAT CGGCAGTACT CTCACACAAA GTCGGCTCCC GACAACTGCG ATGGAACTTA CGATTCCTTT TGCGGGCATT CTGCGGATGA ACGGTGCCTT TTGATAGGTC ATAACGACGA CCGCGGAGGG CTGACGTACA ATAGCTTGAG TGGCTGGCTA ATTCTGAACA TCGAAAATGT CCGTGAAGGG ATCATTGCCA TTAAGGTGTT TGCAAACGCT GACAACCCCT TGACGAAAGG GTGGTGCTCC GTGAACAATG AAGAACCGTG CACAGGCACT GAAGTCAACA GCGGAGTAGA GGACAATCCT AGCGATCGGC GCCTGGTCGA TGCTCCACTG TGTGAGGACT TTCGGTTCGA GTTCGCCATT GATGGTGAAA TAACGAGTTG GACGAGAGTC GAATGGGAAG AGAAAAAGAT GAACGTTGAC CGGGTTGTTG CCGTATGGAC TCTACTGGAT GATCCCGGCT TTTCCAACGG TAGTTCTGTT GATGTGGAGT TAGCGATTCG AATGATGGGT TGCGACCAAA AAACAACGCA TGACTTAACC CATGTGTACT GGGCATAGGA TGTATCGTCT CTCTCTAGAC CCTTACACCC AACCCAAACT ACCGACGATC TCCTAACTTT AAAGTAGGAG TATAGGCCAT GCGCAGCACC AG
|
Protein sequence | METEEHSSSH VYDDEEDEEL FMLIQESSGK EGDVIERFPV RNIWYLVGAG LVVLAASSFF TDQVEIKFDS RLEDVEKKNL SAVDDARVEN ESDIGNDSGY GTVDVKADLH GKSNALDDSN KSASTNFAEH SVSINDVEDF YASIEPGLKP RFNVLSPSYI PRGQPLAQEK RNEIKTKWGS WTLVDEVSRP EEDFFARYPY RDVPRNDFPS NAWQVDVAYL SRFLPEAEAL VTRSMEAILA ELAHSPIEEP GMTIEERSKL FELEQTDHDI KLKNVMFDGC GYTSPDSEGL LARRILHAVM TEDNFNFVMG GHSAAAGHGN NFQQSYTLQF QRILEPILAR LGVRLQAHNF GMGGLGTSQN AMAARDLYGN EIDILMWDSG MTEKSNWHQN LFVQQSLLGA EGRVPMLWNT ANLFTYHDEL GVDIMQVGGF WKVGDKYLPL SDDPVQVKGL PYAVQYLKPS SGMQGETRAN RYNGTCWIDR PDIDPPTKQA DFPGGRASWH PGNREHQYAS RLMLFQVLKA LQKAIHIWRD ADDFALPDKA WHMGEHYDSI KSKLQSWNNT ACKANAVLPP RWCEVAFQGR SEYLPRANPS ETSIRSLVKE SMTIPEVKRN LYDPPDVWMP VLDPPDGDID VLSIVENGVE FAANRQRIKH ALDYIQKSKH RVALLNNNSE IVPGKGWYSH TKSAPDNCDG TYDSFCGHSA DERCLLIGHN DDRGGLTYNS LSGWLILNIE NVREGIIAIK VFANADNPLT KGWCSVNNEE PCTGTEVNSG VEDNPSDRRL VDAPLCEDFR FEFAIDGEIT SWTRVEWEEK KMNVDRVVAV WTLLDDPGFS NGSSVDVELA IRMMGCDQKT THDLTHVYWA
|
| |