Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48471 |
Symbol | |
ID | 7203699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 592640 |
End bp | 594609 |
Gene Length | 1970 bp |
Protein Length | 495 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182868 |
Protein GI | 219125187 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00174392 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTTTCCTT GTCTCATCAA GCAGTGTTGT AATTTGTCAT TTGAACGTTG ATCCTCTCAC TGTCAGTGCC GGCCGAAACA ATCGTAAGCT TTCCGGTGGA GGGTTTAACG TTTGATAGTA TTGAAAAGTC CGTTCGCGGT CCTTAAGGAG ACGCGGAGAT ACGGAGATTG AAGATCATCA TCATGTACGC AATTTCTGAT ACCAGTTCAC AGCTGAACAG CGATCCAGCC ATGGGGACCA AAAGTCTCAA GCTCTTTCAA CCAAGAGCTC CTCTTTCTAG GCCTCCGCTG GCTCCGCCTG TTTCCGCTTC ATGTCAACCT CCGAACTTGG AAGACCCGGC TCCGGAACAT TTAAACGTGG TTGAGGAGAA CGATCTTTCC CATCACTTCA TCCCAGATTT TAACCTAGTC TCGGCGACAA ATTTTTCGCA GGAGAGTTCG AGCTGCTGTG TCTCCTCACT GGGAGGATTT CTTGAAGAAG AGCAGAAATC ATACGACGAG AGAGAATGCA TTCCTGTGGG ATTCTTTTGG CGTAACAAGC AAAAGGTTTC ATCCGCAGAT GTGGCGTCGG TAGGTAACGC CAGTGACACC GTGGTGTCGG TGGACGACCA TCAACTCCAT CTGTCCGGTC GCGATTTACA CGAATCCGCG AAGATGGCCT TGAATGCTGG AGACTTCACC AAAAGTCTGT CCATGTTCGA AGCGATTCTA ATGGCGCAAG CCCAGCGGTT TGGTCCCTGT CACCCATCCG TTGCCGCTGC TATGCACAAT GTCGGAGGTA TGCATATCAT GAATCGTTTG AACATGTTAT TTCAAGCTGT AGATAGCTCA TCCTTCCTGT GCTCACTGCT TTTAGTCTGT CGACAGCGGA TGGGGCAACA TGATACAGCG GAGAATCTTT TTGCTGAAGC TGTTCAAGTG CGTCGGCAAA CTCTCGGCAG CGATCACCTG GAAGTTGCTG CTTCGCTTTC TAAGCTAGGA TCAACAAGAG TGGCGCTGCA AAAGTTCGAT TTGGCCTTTG GCGATCTCCG AAACGCCTCC AAGATTGCTA CCAAAAATCT TGGCCATGAA CACAAGACAG TTGCTCAAAT ACAGTCCCAC CTCGCATGTT TGTATTTCGA AGGCGGCGAG CTTTTTGCTG CTCAAGCAAC TTTTGAGGAC GCTCTAGAGA TCTACCGCGC TGTTTGGTCT AGTCAAGAGT CGAATCGCGA TACCACTATG ATGCAACTTA CAGATACACT TTGTAACATT GGATCTATTT TGAACCGACG CAAACGTTTT GGAGATGCTA TTCACTCCTT TTCGGAAGCT TTGGATCTCC AACGTGGTAT TTTTTCCCAC AACCACCCGC GCATTGTGCA GACTCTGGAC AATTTAGGAT ATTCATACTC AAAAAACAAG GAATATGGGA GGGCCTTGAC CTGTTACAAG ACTATGCTTC GCATGCAATT CTCTCATTAC GGGACCTTCA ACAACTTTTG CCTCGAAACA TTCCGCAAAG AAATTATAAT GTATGAAAAG CTCAAACGTC TTCCCGAAGC AGTAAACGAA ACGAAGGAAA CACTCAAACT CGAAATTTCG GTCTTGCCTA GAGATCACAC AATCGTGGTC CAAACAAAAC AGCTATTGGA AGACTTGGTA AAGCGTTGCA AACGAAAGTC TTCGCCTTGA CTTACACTTG CGTCAGGTTA ACCTTCATAG ATATCATTGC TACTTTGAAA GCTGTTCAAT GCTATTGATC CAATGAGAGA CATCTCAGCG AGATGATCTT TGGATTTACT GTCGAGTAAG TGGTCTAGTA TAGGGGCAAA TCTGAATGTC TTGACATTCC CTATGGATCG TCTCGAGAAT TATTTTGGAA CGCTGACTAA CAGTGACTTT GTTTTTCAAC GGAACGCAGC TGAAGGGCAT ATTTTGGTGG TTTCGTGAAA GTAATGGCAA TTAATAAATT TAATGGAATG CGACATAGTC
|
Protein sequence | MYAISDTSSQ LNSDPAMGTK SLKLFQPRAP LSRPPLAPPV SASCQPPNLE DPAPEHLNVV EENDLSHHFI PDFNLVSATN FSQESSSCCV SSLGGFLEEE QKSYDERECI PVGFFWRNKQ KVSSADVASV GNASDTVVSV DDHQLHLSGR DLHESAKMAL NAGDFTKSLS MFEAILMAQA QRFGPCHPSV AAAMHNVGGM HIMNRLNMLF QAVDSSSFLC SLLLVCRQRM GQHDTAENLF AEAVQVRRQT LGSDHLEVAA SLSKLGSTRV ALQKFDLAFG DLRNASKIAT KNLGHEHKTV AQIQSHLACL YFEGGELFAA QATFEDALEI YRAVWSSQES NRDTTMMQLT DTLCNIGSIL NRRKRFGDAI HSFSEALDLQ RGIFSHNHPR IVQTLDNLGY SYSKNKEYGR ALTCYKTMLR MQFSHYGTFN NFCLETFRKE IIMYEKLKRL PEAVNETKET LKLEISVLPR DHTIVVQTKQ LLEDLVKRCK RKSSP
|
| |