Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41351 |
Symbol | |
ID | 7199161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | - |
Start bp | 249747 |
End bp | 251786 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185347 |
Protein GI | 219130385 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGCA CTTCTAATAC CGCAACGACT ACCGTCGCCT TGCCTCCCGC TCTGTCCTTG GAAAGCTTGT CTTGCTCGCA CAATGGCGGA GAAACCTGGC AGCTCCGGGA CGTTTCGTAC GTACTTCCCC GGGGTGCCAA GGTGGCACTC GTCGGACGCA ACGGGACGGG CAAGTCCACC TTGCTCCGCA TCTTAGCTTC CCGGGCGTGT GCGGACGCTG CTGACGAAGC ACAGAATATC AAATATACCG GACAAGTCGT GACGCCACGG GACGTCAAGG TTGCCTACGT CGAACAGGAG CCGCACCTGT CCATGGACTT GAACGTCGCC GACGCCCTCT TGGGGTTCCG TGGCGACGGC ACCGTCGAGA CCAGTAGTGC CAAAAATAAA TACGCAGCCG TCCGGAAATA CCGTCTAGCC GTCCAGGAAG CCGAAGTTAA GCCGGAAGCC TTTGCCCAGG CCTGCGCCGC CATGGATGCC TTGGAAGGAT GGAACGTCTG GACAAAAGCC GAAGAAGTCG CCACTAAGCT CCGCGTACGG CACCTACAAG ATCAGCCACT GGCCAAATTA TCCGGCGGGG AACGGAAGCG GGTCGCATTG GCCGCTGCGT TGGTCCAAGA ACCCGACGTA CTTTTGTTGG ATGAACCGAC AAATTTCTTG TCCCTCGCGG GGGTTCAGTG GCTGAGTGAT TTGCTGCTCG GTGACAAGAA GCTTACCATT CTCATGGTGA CGCACGACCG AGCCTTTCTG GACGAAGTGT GCGATCGAAT ACTGGAACTG GATCAAGGAT CCGTTTACGA ATACGTCGGA TCGTACGCCG ACTACTTGGA AGGAAAACAG GAACGGCTCG CCGTGGAAGA CGCCGCCTAC CAGTCCGCCA AGGCCAAGTA CGCGGTCGAG CTCGATTGGA TGCGCCGACA ACCACAGGCC CGTCAAACCA AGGCGAAAGC TCGCATCGAC GCATTTTACA AGCTGGAGCA AGCGACCAAA CCGAGACCCC GCGATCCCAC TCTCAATCTA GCCAGCGAAT CCCGACGTAT CGGTGGCAAA ATTATTTCCA TGAGAAACGT TTCGCTGAAA TTCGGAGACC GGACCATGCT GAAAGATTTT TCCTACGATT TTTGCAAAGG CGACCGGATT TGTCTGAGCG GCGGCAACGG CATTGGAAAA ACCACATTTT TACGCGTGTT GACAGGCGAG CAACCGGCCG ACGCTGGCGA TATTGACATT GGCGACACGA TCGTGCTCGG CGTATACGAA CAAAACGGCA TCGAAATCGA GGACCCCGAG CAGACCGTGC TCGAATTTGC CGTCGAGCAG GCCCGAGCCC GGGACGGAGC CAGCGCCGAC GAAGGTCCGG ACGACGCCCG AAGGTTGTTA CGGCAATTTG AATTCCCTCA AGCCCGTTGG GCCGAACGCA TTTCGGTCCT CTCTGGTGGA GAAAAGCGAC GGCTACAAAT GCTTTCAGTC TTTAGCCAAC GGCCAAATGT GCTGATTATG GACGAACCGT CGGTAGATTG TGACTTGGAT ACGTTGACGG CTCTCGAAAA GTATTTGCAA GAGTTTGATG GGGTGCTGCT GATCGTCAGT CACGACCGCG CATTCGCCGA CAAGGTCACG GATCACTTGT TTGTCTTTGA AGGACACGGG GAAATTAAGG ATTTTCAAGG AAGTCTTTCT GAATACGCGA CCACCTTGAT TGAATTAGAG AACGACCGTA TTGCGGAAGG GTCCCGCGGA CAGGCTGATA CGGAAGAGAA AAAGGGAGCC TACAAAGAAG ACAAGGCCAA ACGGAACGAG CAACGCAATC AAGTGCGCCG GGCAAAAAAG GATATGGTCA ATGTCGAAAA GGCGATTGAA AAACTAAAAG AAACCGCAGC CTCGTACGAG AAAGAAATCG ACGTTTGTAG CGGCGAAGGC TGGACCATTC TGGCCGATTT GACTGACAAA TTGAACAAGG TGAACGAGGA AATCGACGAG AAAGAAATGC GGTGGATGGA ATTGGGAGAG CTAGTGGAAG AGAGTGAGGT CGAAGCGTAA
|
Protein sequence | MSSTSNTATT TVALPPALSL ESLSCSHNGG ETWQLRDVSY VLPRGAKVAL VGRNGTGKST LLRILASRAC ADAADEAQNI KYTGQVVTPR DVKVAYVEQE PHLSMDLNVA DALLGFRGDG TVETSSAKNK YAAVRKYRLA VQEAEVKPEA FAQACAAMDA LEGWNVWTKA EEVATKLRVR HLQDQPLAKL SGGERKRVAL AAALVQEPDV LLLDEPTNFL SLAGVQWLSD LLLGDKKLTI LMVTHDRAFL DEVCDRILEL DQGSVYEYVG SYADYLEGKQ ERLAVEDAAY QSAKAKYAVE LDWMRRQPQA RQTKAKARID AFYKLEQATK PRPRDPTLNL ASESRRIGGK IISMRNVSLK FGDRTMLKDF SYDFCKGDRI CLSGGNGIGK TTFLRVLTGE QPADAGDIDI GDTIVLGVYE QNGIEIEDPE QTVLEFAVEQ ARARDGASAD EGPDDARRLL RQFEFPQARW AERISVLSGG EKRRLQMLSV FSQRPNVLIM DEPSVDCDLD TLTALEKYLQ EFDGVLLIVS HDRAFADKVT DHLFVFEGHG EIKDFQGSLS EYATTLIELE NDRIAEGSRG QADTEEKKGA YKEDKAKRNE QRNQVRRAKK DMVNVEKAIE KLKETAASYE KEIDVCSGEG WTILADLTDK LNKVNEEIDE KEMRWMELGE LVEESEVEA
|
| |