Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46523 |
Symbol | |
ID | 7201600 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 597089 |
End bp | 599378 |
Gene Length | 2290 bp |
Protein Length | 715 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180865 |
Protein GI | 219120244 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGCTTTGGG AAAATCGTCG GATCCTTACT CTTTTGCAGC GTCGCATTTT TGTCTTGTAC AGTTCATCTC CTTCCCATGA GATCTATTTT GCGTCAACAA GATCAACCCC GACTGTTACA CCGCCTTCCC AGCCAGCGGC AGTCGATGCA GGCATCATCA TATTTTCCCC TGGACAACGT GCTCGACGTA TCCTATCTCG ACTCTTCGCT GCCAAATTCG CGGGAATGCG ACTTTTACAC GAATCCCACT ATTCTCAGTC GTCTGATTTT GCACCAAAAG TACGAAGCCG CAATGCGCCG TTCCTCTACG CACAGCGAAG AAGCAAGGAC TTGGGTGGTC GTGCGGCGCC AGACTAGTCC GGCTTCTTCC GTGAGCAACC AAGCCGCGAC CCCATCATCC CCCGCCAAAA CGACATCGCA GCGTTCGAAC GTGACTTCCC TGAGCTCGTT GAGTGAAGAT GACGTCAACA ACACAACGCT TTCTTCGAAC AGAAATGGTG ACGTGAATTG TGAGTATTAT TCCTGTCGTC AGCTCCCCAT TCACATGGCC TGTGGAAACT TGTTTCGTGT AGTAGATCCG GCTTTGAAAG CTCAGCTCGA AAAGCTCATT GCCACGCTGG TGGTGGCCTT TCCCGAAGCT TGTTCCCAAC GCGATCACCA ACACCGGATG CCTTTGCACG AAGCGATCTG GTACCGAGCC GGTCCCGAGA CAATTTCGGC CTTGTTGATT GCCTATCCCG ACGCAGTTTC CATGCGGGAC AAGTACGGCC GCTATCCCAT GGCGCTCAAT GAGTGCCGAG ACAGCCCGTA TCGTACACAG ATTCGGCATA TGTTACTACA AGGTCGAGAC TTTTGGAATA CGGCCCGCAC GGAGGCCAAG CTGCGACTCA AACATCGCAC CGTGCCTGCC GATTTGCAGA GCGTTGCTTC CCAGAGCGTT TTGGCAGCCA GCGTAACCAG TACCGACGAC AATTCAATGT ACACACGTGG TGATAGTGTA CGGCAAGGCC GGGCGGGCCC GGGATATCAG CAACTATCGC CAACGATAAC TGGTTGTGGG GAATACAAAC GTACCATTAC TTCTTGGTCA CAGTTGGAGC ACCGCACGAA TACACTGGAA GAGAAATTGG CCGAGTCGAT GCAAGAAAAC TACGAAACTG GAAAAGAAGC AACCAAGTTG CGGGTTTCCA AAGCGAAATT ACAAGCAAAG TATGACGTAC TGATGGGGAC GGGCCTCGGA AAACAGATAG AACTTTTGCA AAATGAAAAG ATCGCACTGG AAGTTCAAGT TCGCGACCTC CAACAGCTCG CTCCACTTGC GGAAGTGCGA TCTTTTCCTC TCAACCAAGA CCACCCTATG GATAAACTTG TCCCACAAAA TATCGTCCTT CACTGTCAGC CCGAAGTGAC GGTAGACGTG GAATTGCAAG TCTTGAGGGA AGAAAATGTC CGGCTCAAAG CCGGTATGGG GCTGCTGTCG CAGAAGCATA AAGATTACCA ACGCCGCTTA GACTTTGCAG AATCTTTGTT GGACGACATG GAAGATCCCG AAGACTTCCC ATTCTTTGAC GACAACGCAA CCGATTACAG CACGATCTTT ACAATTTCTA CAGGAACGCC GAAAGAGAAA AGGATACACA CGCCGCGCCG TCCGGAACCG GAGAAGCGGG TCCTGTCGCC CGCACTCCAA CAAGTACGTC CCAAGTCGGC AATGAAAAAT TTCATGCCGG TGGAGGATGT CAGTCAGAAC AAGCCCGGTT TCATGGACCC ACTTGATCCA TCGCTGCTGG AAATGTCGCG AGAAGACGAC CTGGAGTCTA TCCTCAAAGG AGCACAAGAA TATTTTGATA AGTCGTCGGG CCTCACTCGT CGCCTTGCCG ATAGCATGTC ATTGCCTACA TCACGCATGA CGTCCCAGAT CACATTACCA AGCATTCTTG AAAAGCCCGA TGGGCAAAGC TCCCGTGACA GCCGGTCTGG AAGCTTCAAT AGTGGCCAGT TTAGCACGGA GGAAAAGGAG AGTGCGCTCG ACGACGAGAT CATCGAATCT AGTCTGCATA CTGCAATAGC CGGCAGCGAC AACTTGACAG TGCTTCTGCA AGAAACAGCC CGGCTCTACA GCGCTCTACC CCGTGATGCT TCCATCCCTC CTTCGCTATC TCCAAATGTA TCGGATGTCA CGTTGCAGCT GGCACGTATG GAACTGGAGC ATGGCTCGGT AAACTTCGAA GACCTTTTGG CAGAGGCCGC TCAAATCTAC AGCGGAAGCA GCAATGTGTC CCACTTCTAA
|
Protein sequence | MRSILRQQDQ PRLLHRLPSQ RQSMQASSYF PLDNVLDVSY LDSSLPNSRE CDFYTNPTIL SRLILHQKYE AAMRRSSTHS EEARTWVVVR RQTSPASSVS NQAATPSSPA KTTSQRSNVT SLSSLSEDDV NNTTLSSNRN GDVNYPALKA QLEKLIATLV VAFPEACSQR DHQHRMPLHE AIWYRAGPET ISALLIAYPD AVSMRDKYGR YPMALNECRD SPYRTQIRHM LLQGRDFWNT ARTEAKLRLK HRTVPADLQS VASQSVLAAS VTSTDDNSMY TRGDSVRQGR AGPGYQQLSP TITGCGEYKR TITSWSQLEH RTNTLEEKLA ESMQENYETG KEATKLRVSK AKLQAKYDVL MGTGLGKQIE LLQNEKIALE VQVRDLQQLA PLAEVRSFPL NQDHPMDKLV PQNIVLHCQP EVTVDVELQV LREENVRLKA GMGLLSQKHK DYQRRLDFAE SLLDDMEDPE DFPFFDDNAT DYSTIFTIST GTPKEKRIHT PRRPEPEKRV LSPALQQVRP KSAMKNFMPV EDVSQNKPGF MDPLDPSLLE MSREDDLESI LKGAQEYFDK SSGLTRRLAD SMSLPTSRMT SQITLPSILE KPDGQSSRDS RSGSFNSGQF STEEKESALD DEIIESSLHT AIAGSDNLTV LLQETARLYS ALPRDASIPP SLSPNVSDVT LQLARMELEH GSVNFEDLLA EAAQIYSGSS NVSHF
|
| |