Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47943 |
Symbol | |
ID | 7203132 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 512733 |
End bp | 516915 |
Gene Length | 4183 bp |
Protein Length | 1167 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182242 |
Protein GI | 219123876 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACAGTGT GTGTGTCAGC TTTCTGGTCC ATTCACTACT CTCGAAAGGC TAGAATTGAT TGTATTGTGT ACAGCGAGTA TCTCCTTTCC TTCTCTATTT GCCTCTGGTC TTCGTTCGGA AGGAAGAGGA ATTGAAAAAA AGGAATCGTT GTCGTTTCCT TCGTCGAATC GCATCCCTCC ATTCGATTGG TGGTTTCGGT AGCGTATCCC CCGTTTCCGG TTCTCGACAC ACAGCTACAT CCACGTACCT ACAGATACCG AGTATTGACA CGAATACTAT TCTCATGGCG TCTCCAAATC CTGCGTATGC GGGACGTGGA GGTCGTGGCG GACGGGGTGG ACCTCCGCCC TATTACAACA ACAACAATAG TGGTGTACCG GGTCCCCGTC CTCCCACCGG AGCCTGGCAA CCCGCACCCA CTCATACGAT GCCGACACAG TCTACCGCCG TGCCGAATCC GTCATCGCCC AATCCTCACC ATCTCACACC CCAACAACAA CAGCAACAAC AACCACCTCT CCTGAACAAT CCCTACGCGC CAAACCAAGC AGGACCCCGT GGAAACCAGC CTCCTCCATA CCCACCCCCG GCCTACTACA ACCAGCACTA CGCAGCTGCC GCACAACAGC AGCAGCAGCA GCCGAATCCT TACGGCGCCA CGGGTCGGGT TCCGTGGCAG CCCGTACCGA ATCAAGGCTA TCCCTCCACC ATGAGGGGTG GCGTCTTTTA TCCACCTCCC GCCGCTGCCG CTGGACCCGG TCGTCCTTCC GCTACCAACG CGGGGACCAT CCCAGCCTCC GTCGTGGTGC CCCGCGAACG CAAAGCTCTC GTCATTATGG TACGTCCCGA GTGCCATGGA CCAGCGTCCC CCGCTTTGCG CTCGTCTCAC ACCGCTCTAC CCGATTACTC GTGTTTGTTT CTGCCGTTGG CGACTTTTTT ACACCAAAAC AGGATAAAGA CGGCAACGTG ATGGATTTCA CCAAAACGGC TCCCAAAAAG ACTACGGCCA CGACTTCGGC GACTCTTGCC CCGTCGGCGG TCCCGGCCAA AACGGAATCG AGCAGCCAAG ACGCTGGCGC CAAGTTGCGA CAGGCCGCAC TCGAACGCAT CCAAGCCCAA GACAAGGCCA AAAAGGAAGC CGAAGAAAAA GCAAAGGAAC AATTGTTACA GGAAGAACAA GCCAAGAAGA AAGCAAGTGA CGAGACCGAA GTACAAGGTA AAAACACGTC CGAGGCCCAA AGCAAGGTCG CCGAAGAGCC AACCAAGTCT CTGGCGGATC GTTTGAAGCA AGCACCCAAA CCCGCCACTG TCAAAACGAC GACTTCTTCT TCCGGTCGTA TCGTCTTTTC CAAGTCGGCG TTGCTCAAGT ACGTACCGTC CTTCGTGGCC TGGCGGAACA CACTCAGCCT TTCGCCTAGT TTTTTACGTT TTGGATCTTT CCCTTTTCTC ACCCGCTCGT ATTTCATTTT TTAAATCTCT CTTGCAGATT TAAGAGCACG GAAAGATGTA TGCAATGTCC CGAATCGCTC CCTGACATGA CGATCGTCAA GGGACCCGCC CGTGGCAAGA GCGGGGGCAA CCGCGGTGGT GGTGGTGGAG GAGGCGATCG ACGCAACGAC CACGACAGCG GTGGCAGTAG TTGGAAACGA GGCAACGCAC CCCCACGTCG TCAAAGCGCC ACCAATAACG ATGGCGGTGG TGGTGGCGGC GGCGGCGGCA GCTACTGGAG CCGCGGACAA GCTCCTCCGC CACAGCAACA GCAACAAAAC AACGACAAGC ACCGGAACAA TCGGGGTGGA CGCGGTCCTC CGCCACCTCT GTACGACGGC CCCGTCGCGC CTCTAGTGAA GTCGGAGAAT CACTGGCGTC CACAAAAGAA CGCGTCAGCT TTTATTATTG CCGAGAAACA GGTCAAGGCT ATTTTGAACA AGATGACCAA AGAAAAATTC GATAAACTCG CGACACAGAT GCTGGAGATT CCTTTGACCT CTTCTGATAT GTTGAAAATG ATGATTAACA ACGTGTACGA TAAGGCCATT GACGAACCTA CTTTCGGAGA TATGTACGCT GATTTGTGTA AGAGACTTTC CAAGATTGCT ATCGACTTTA TTAAAATTAT CGAGTCGGAC GAAGAGCCCC CGACGGACGA CGATACGACC ACGGAACCGG CATCTCCAGC TGACGATAAA AGTAGCCACC ATACTGTTTA TCGCTGGTCG AACGACGTGA GTACTACCGA TAGTGAGATT GTTGGACCGT TTGAATCCGA GGAAGAATGC ATCGAGGTTG CTCTTGGTCA AGCAAATGAG CCTACTCCGG TTGAGCGAGG CGAAATGTCT CTGGAACTGG TCAGCGCAAC CATTAGGAAA GGCGTGTTTA TCAAAATAAT GAAGCAAAAA AAGGACAAAG ATGGCGAGGA ACCAAAGCTC TACACCGTAT ATTTCCCAGT AAAGGAAGCA AAAGAGTGTG GTCAGCAGCT GTCAAACATT TTCCTGAGCA AAATGGAATG TGTTTCAGAT GCGACCAAAC TTAATAGCTT CAAGCGCTCG CTACTCAACA AATGTGAAGA AGAGTTCGAC AAACAGGATA TTTATGTAGA CTGGAAGAAA GAAAAGAATG ATTATGAGGC AAAGAAGGCT ACACTTACTG CTCAAGAACG GGCTGAAACA GAAGCTGAGC TGGACTTTCG TCGCATTCGA ATTAAGAAGC AGATGCTCGG AAACGTTAAG TTCATTGGGC AGCTGTACAA GAAGGGTTTA TTGAAGGAAA AAATTATGCG GTACTGCATT GCCAGTCTGC TAAAGCTGGA AGCAAACGAC GCCAAAGCCA AGAATCCGTT GTACCGAGAC ACCGGTGATT TTGATATCGA CGAGGAAGAC CACGAAGCGA TTTGCAGCAT GTTCACGACT ATTGGTCTGA CTATTGACAC TCTTTCGACC TCTGATTTCA TGAGCGTTTG TTTTGAGAAG ATTTCAAAAC TGAGCAACGA ATCAAGCCTC CCCGCCCGAT CTCGTTTCAT GTACAAAGAT TTGCTTGAGC TACGTGACAA TAGATGGGTT CCGCGGCGCA AGGAAGAAAA GGCGAAAACG CTCGAAGAGA TTCGAAAAGA TGTGGAAGAG GAAGAGCGAC GACAGGCTCA GCAGTCGATG CGCAATAATA GCAACCGTGG AGGTGGCGGT GGCAGCCGAG ACTTTCGTGG TAGTGGCATC GGCAATGCAT TCGGAGGAAG CAACCGATCG CGTCCACAGA AGCTATCTAG CGAAACGGAC GCTGACGGGT TCGTCGCTGT CCCCACCAAA ACTGGCTTTG CGTCCCTGAG AGGGCCAGCA GGGAAACCCA GGCACCAGCC TAGTGAAAGT ACCAGCACGG TCAGTACGAA ATCGTCAGCT TTTTCTGCGC TTGCTAAGGA TCGAGATCCG CCTTCATCGA AGAAGTCTCC CGCCCCACTC GATAGCGATA CCTTGGAACG CCGGATTAAG CGTATACGAA CGGATTTTAT GGGTGATGGA GGGAACGTTG AGGAGCTTCT ACTAAGCTGG GACGAGATAT GTGGCACACC AAAGGCTGGC GTCACCCTTG TTCAAAAGAA TTCCGATAGA ATGATGGACT GCAAAGAGGA TGAAAGGCAA GCCATCGTGA GAATTGTAGC GATACTAGCC GAGAAAGGCA AGCTCACAAA AGAGGACGTT CGCACTGGTC TGCAAGACGC GATTGAATTT ATCGACAGTT TCATACTTGA TTCACCACGG GCTTACGAGT ACTTGGGTGA CATGCTTGGG GAAATGCTGA AGTTGAAAGC GATTGATGTC GCATGGCTTT GCAAAGAGAC CGAGAAGACG AAAGTGGATC AAAATAGCGA GGCACCGATC CGACTAACAC GCGCAACCAT TTCAGCCTTA AAGGCAGCCG CGGGAGTAGA CTTTGCAAAG AAATGTTTTT CTGGATCCAG CGAGAAGGAT TTGATTGGTT TGATTGGTTC CGATTCCTGG AAATCTATGG CGGCTGATCA ATTTTCCTAA CATTCAAGAG TATAAATCTG AAGTTGAGAA TTGCAGTAAC CTGTATATGG CTGTATGATC TGAACCGTCT TCCGGGCAAG CGAAGAGCTT AACCTTTCTT GTCACAATGT GTGATCAATA ATACAGTTAG CAATAGTCTA CTTTGAAATT AATAGTGTTA TCG
|
Protein sequence | MASPNPAYAG RGGRGGRGGP PPYYNNNNSG VPGPRPPTGA WQPAPTHTMP TQSTAVPNPS SPNPHHLTPQ QQQQQQPPLL NNPYAPNQAG PRGNQPPPYP PPAYYNQHYA AAAQQQQQQP NPYGATGRVP WQPVPNQGYP STMRGGVFYP PPAAAAGPGR PSATNAGTIP ASVVVPRERK ALVIMDKDGN VMDFTKTAPK KTTATTSATL APSAVPAKTE SSSQDAGAKL RQAALERIQA QDKAKKEAEE KAKEQLLQEE QAKKKASDET EVQGKNTSEA QSKVAEEPTK SLADRLKQAP KPATVKTTTS SSGRIVFSKS ALLKFKSTER CMQCPESLPD MTIVKGPARG KSGGNRGGGG GGGDRRNDHD SGGSSWKRGN APPRRQSATN NDGGGGGGGG GSYWSRGQAP PPQQQQQNND KHRNNRGGRG PPPPLYDGPV APLVKSENHW RPQKNASAFI IAEKQVKAIL NKMTKEKFDK LATQMLEIPL TSSDMLKMMI NNVYDKAIDE PTFGDMYADL CKRLSKIAID FIKIIESDEE PPTDDDTTTE PASPADDKSS HHTVYRWSND VSTTDSEIVG PFESEEECIE VALGQANEPT PVERGEMSLE LVSATIRKGV FIKIMKQKKD KDGEEPKLYT VYFPVKEAKE CGQQLSNIFL SKMECVSDAT KLNSFKRSLL NKCEEEFDKQ DIYVDWKKEK NDYEAKKATL TAQERAETEA ELDFRRIRIK KQMLGNVKFI GQLYKKGLLK EKIMRYCIAS LLKLEANDAK AKNPLYRDTG DFDIDEEDHE AICSMFTTIG LTIDTLSTSD FMSVCFEKIS KLSNESSLPA RSRFMYKDLL ELRDNRWVPR RKEEKAKTLE EIRKDVEEEE RRQAQQSMRN NSNRGGGGGS RDFRGSGIGN AFGGSNRSRP QKLSSETDAD GFVAVPTKTG FASLRGPAGK PRHQPSESTS TVSTKSSAFS ALAKDRDPPS SKKSPAPLDS DTLERRIKRI RTDFMGDGGN VEELLLSWDE ICGTPKAGVT LVQKNSDRMM DCKEDERQAI VRIVAILAEK GKLTKEDVRT GLQDAIEFID SFILDSPRAY EYLGDMLGEM LKLKAIDVAW LCKETEKTKV DQNSEAPIRL TRATISALKA AAGVDFAKKC FSGSSEKDLI GLIGSDSWKS MAADQFS
|
| |