Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48267 |
Symbol | |
ID | 7203370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 684636 |
End bp | 686836 |
Gene Length | 2201 bp |
Protein Length | 570 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182584 |
Protein GI | 219124593 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000901628 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTTGCGCAG GGTTGCTTCC TTGCTTGTTT GCACGTCCTA AAATCAATCT CTCTAGAGTA GCATTCATTG CTTACAGTTG CTTTGTTAGT AGAGAGTTTC CTTGGTTGCG ATCGTGCGAG TGCCAATCCT GCCGTACAAT CCCAAACCAA ACTGTACCAA TTCACCTTCG TTCCAACTCC AACGCTTCGA CAGACCCATC CGTCCCTACC AACATGACGG CGGCGAACAG CGTTCAACCG CAGAGCGATC ACGCTCCGAG CGAAGAGCGT CGAAAGCTAC TCAACGAGGA ATTAGCGGTA CCACTGGTGT CGAATTTCTT CCCCATTGAA GGCTACTACA CCGCGGCGCA AAAGGTTTTC GATAGCTTCC AGGATGCCTT TGAACACCGA CAGATCGACA ACGCCTTTGT CTACGGGAAA CGATACTGTC TCTTTGTAGT AGACGCTATT CCACAACACA ATTACTTCAA CGCTACAAAA ATCAAAAAGA TGCAGAACCA GCATCATCGG CAGGTCGACC TCGTCATTGA CCAACTCGAC GTCGTGGCGA CGTGGATGGA CGAGTCGGAA ATGGAGCGCC AAACGCGAGA GCGAGAAGAA GCCAAACGAC GACGACAGCT CGCTATTCAA AGAGCCAAAG CGGAAACTGT GCGCTACCAA CAACAGGAGC AGGAACGCTA TCGCCAATTG CAGCTACGGT TCGATCAACA CAAGACACAC AAAGAGAGTC TCAATGAGGA TCCGGAGCAC GTACAGGCAT CGGCGATGGA AAAGTTGGAA AAGCTTCGAG CTCTCCAAAA TGGTGTCGAT GTGGCGGCAC GCATTCCCCA AGATCCTTCC GGCGAAGAAG CGGGCAAACC TGGATCACGA TACCGCCTTT TATCGGATTC CGAAGAAGAC CACGCCGAGC AACAGCAAGG CGATCAAAGG AATCCACCCT ACGATACAAT CATTAGTGGG ACCGTTCTGC CACCTCCTTT ACCCCTTCCA AGTGCACCGC CATCCTACGA TGCCATAGTT ACATCTCGGT CGTCCCGCAA CTTTTTGGGC CCGGCAGTGC CCTCGGAACC ATTCCCCAAA TCGACATTTT TGAACGGCAA CAAGTTTGTG GACGAAACGA CGGTAGCATT GCCCGCAACC CCCGAAACAC CAGCACGCCG CCAGCGCGTT CCTATGCGAG AGCTTCAGCA CCGGTACAAG CAAACATACG TAAAATACCA ACAGGCGGGG AAAATCAAGG TCTCCGGTAT CAACACTTAT CAAGGGCGGT TAATCGAATC GACCAACGGA TGTACAGTCA TTTCAGCTTT AGTCGCCGCG CATCAATTAT CCTCTCGATC GGGGGCAGTC ACGGACGCAA CCGTCATCAA CGTGATCGAT CGGCAATGTG GACCACTCTT ACGTGAAATA CGGGGCAAAC TTGGCTTGGG TGGTCACGCT CTGATTATTC CATCTGATGT ACACGATCAT CTTGTCGATC ACAACATATT GTCACAAGAG ACATTCGTTG GTGCCGCCGG CGGCAACATA TTGGACGAAG GCCACATGAA CGAGTTTCTC AAACTTTTGC AAGGAGATAG CGCACAGCAT GCCAAAGCTG CCGCCACACT ATTTTTTCGT GAGCACGTCA TTTCAATTGT TAAATCACAG CATGGCAAAG CCATTAGTGG TAGCCTTAGT AACCAGGGCT TGTGCTGTTA CGAATTAATC GATTCAATGC CGGGAATGTT TGATGGTGGA CGAGGCATGG CGACTCGTAC ACGTTGTACG GATATGGATT CTTTGCAAGT TCTATTGCGG TGGTACGCCT CGCGAAAATT TTCTGACTCC AACTGTTCCT ACATTGACAA AACCATTTGG GATGACAGCA TGGCTGACTT TGATCCCCGT GTCTTTCAAG GATTTGTTTG GGCGGCAGCC TCTTGAACGA ATAAGTTTTG CAAAAGATGT CTCGGCAAAA GATATACGTG ACTTTTTTCC ATTGTTCGGC TATCCGCGAA TGCGCGGCCC TTGCTTGTAC AACAATCCGC ATCAGAATTG AAATTAGAGC CTACCAGAAA CGAAGCTCCA AGTGGTACAT TGATAATAAA ACCATCTTCA ACGGAACAAA AAGGAAACTG CTGGAAAACT TTGTTTCGCC GTCATTCAGG AGTCCAGAAT CTTGGGAAAT TTGCGGAGTT CAGTTTTTCT ATCTTGATCT C
|
Protein sequence | MTAANSVQPQ SDHAPSEERR KLLNEELAVP LVSNFFPIEG YYTAAQKVFD SFQDAFEHRQ IDNAFVYGKR YCLFVVDAIP QHNYFNATKI KKMQNQHHRQ VDLVIDQLDV VATWMDESEM ERQTREREEA KRRRQLAIQR AKAETVRYQQ QEQERYRQLQ LRFDQHKTHK ESLNEDPEHV QASAMEKLEK LRALQNGVDV AARIPQDPSG EEAGKPGSRY RLLSDSEEDH AEQQQGDQRN PPYDTIISGT VLPPPLPLPS APPSYDAIVT SRSSRNFLGP AVPSEPFPKS TFLNGNKFVD ETTVALPATP ETPARRQRVP MRELQHRYKQ TYVKYQQAGK IKVSGINTYQ GRLIESTNGC TVISALVAAH QLSSRSGAVT DATVINVIDR QCGPLLREIR GKLGLGGHAL IIPSDVHDHL VDHNILSQET FVGAAGGNIL DEGHMNEFLK LLQGDSAQHA KAAATLFFRE HVISIVKSQH GKAISGSLSN QGLCCYELID SMPGMFDGGR GMATRTRCTD MDSLQVLLRW YASRKFSDSN CSYIDKTIWD DSMADFDPRV FQGFVWAAAS
|
| |