Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42557 |
Symbol | |
ID | 7196096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 413686 |
End bp | 415771 |
Gene Length | 2086 bp |
Protein Length | 692 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176580 |
Protein GI | 219109652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGTGG ATTCTTCCGT CGCTTCTTTC GACGACACGA CCCGAACCGA TGGTTCCTCT TTTGAAGCCG ACGATGTCTT GGCTTTTCTG GAAAACAATT TACAAAACGT TCGTGACCTG TGTGTCTCGA AACTTTGGAA CGACGACCGC CTCGCTGATT TAGACGATTT CTTACGGAGC CATCACGCTG CTCTGCACCT ACAAGCCTTG CGCTTACCCA ACAACGGACT TACATCCCGT TCCTCGGTTA ACCTTGCCAA CATTCTTTCC ACAACGCAAA CCCTGCGCGA ACTCGATTTG TCCGACAATC AAGTGGAGTC CCAGGGACTG CTGGCGCTGT TGCCGGCGTT GACGCACGAA ACCTGCGCAC TCCGGCGCCT CGACTTGTAC AATAATAAAC TCGGCGCCAC CGGGGCCACA CAAATTGCCG CCATTCTACG GGACAACCGA TCTCTTCGCG AACTCCGCAT TGGCAAAAAC AATCTCGGCC GGAAAAAGTC CCTCAAAGTA ATTTCCACGG CGCTGCAACG GAACGCAACC CTGCGAACGC TTGATCTGTC GCACAACCAA ATTGATGACG GCGGCGCCAT TTTATTGGCG CCCGTTTTGG ATCCGGAAGT CTCACAGTCA CGTTTGCGCC GCTTGGACTT GACCTACAAC AAAATTTGGC CAGAGGGTGT CCGAAACCTT ACCGGAGCCC TGCTGGAAGG CAACCGGACC TTGCGATGTT TGAACTTGAG TATGAATCAC GTTGGACCCG AGGGGGCGGA GTCATTGGCG GTCCTATTGA AGTTCTCCTT CACGTTGCAG GAACTTTTAC TGTCGCGCAA CGCTTTGGGA GACCATGGCG TCAAATTATT GTGCCAAGGG CTAGACGAGA GTAAATTGTT GAGTGGGACA GGCTTGCAAA GATTGGATTT GGACTGGAAC GAGATACACG ACGACGGAGC CAAGGAATTG GCGACAATGC TGCTAGACAA CGCTATACTG GAGTCCCTCA ACTTGGCGAG TAACGCTATC GGTAGCGATG GAGCCAAGGC TCTAGCGAAT GCTCTGCACT CCAATCAAGC TTTGACATTT TTGAATCTAA TGGGAAACCA AATTCGAGAT CCTGGTGCGT TCTCTCTAGC CGAGAACCTT TGCCGCCCGT CGTGTCGAGT GGAAACGTTG CTGTGGGAAA AGAACAATTG TTTGACGCCT TTGGGAGAAG AGCGACTCAT CGCGGCGTTT GACTTTCGGA AGAACCGGAG AACGTGGCTA GGTCAGATAC TTCGTGAAAT AGAAACATGC CAAAGTGTCA ATTTCAATTT GTTGTCGTGC AAACTCAGCG ACGAGGAAAT TATGGCGTTA GCGAAACATC TCGCTCAGTA CCGGCCTCGA GTTTCGACCG CGTATCTGGG TGGACACGGC GTAACAGTTC GAAGCATGAA AGTTTTGGCC AAGGACGTGC TTGCCAACAA CCACGTCAAT CTTCAACGGT TACACTTACA GCATACTCGT GTTGGGGATG AAGGGGCAGG AGCATTGGCG GAAGCACTGC TGTCTAACTC CAATTTGCGA ACTTTGACAT TATTCGATTG TAGTATAAGC CCAGAAGGAG CAAAGTTGTT GGCGCATACG TTGGCTCAGA ACAAGTCTTT GACACAACTA AATCTTCACA AGAATGCAAT CGGGAACCGA GGGGCACAGG AGCTTTTTAC GGCTTTAGTT GACCCACCGC ATCCCTCACT GGTTGTGTTG AATTTAGAAC AGAACGAAAT TAGCGACGGT GCACTCTTGC AGTTCCAATC GTTTGGCAGA CTGCAGCAGC TGAACATTGC TTCCAACAAT TTCACAGACC GTGCCGCCTT GGATTTGGCC AAGGCATGTT TCAACTCTTT GGCCAACGGC ACCCTTCAGC TAAGCTGGCT GACGGTGTCG AACAATTTTA TCTCAAAGAA AGGCTTGAAG GCCTTGGCAT TATTTCTTCC GGACGGGTTA GTCCTCGAAA ATGATGGCCA ATTAGAAGCG CAAACGGTAT TACCAATTAG AAGCGCAAAC GGTATTGACG CGTTGTACAA CAAAGCTTCT CTTAGCTAGC AATAAG
|
Protein sequence | MAVDSSVASF DDTTRTDGSS FEADDVLAFL ENNLQNVRDL CVSKLWNDDR LADLDDFLRS HHAALHLQAL RLPNNGLTSR SSVNLANILS TTQTLRELDL SDNQVESQGL LALLPALTHE TCALRRLDLY NNKLGATGAT QIAAILRDNR SLRELRIGKN NLGRKKSLKV ISTALQRNAT LRTLDLSHNQ IDDGGAILLA PVLDPEVSQS RLRRLDLTYN KIWPEGVRNL TGALLEGNRT LRCLNLSMNH VGPEGAESLA VLLKFSFTLQ ELLLSRNALG DHGVKLLCQG LDESKLLSGT GLQRLDLDWN EIHDDGAKEL ATMLLDNAIL ESLNLASNAI GSDGAKALAN ALHSNQALTF LNLMGNQIRD PGAFSLAENL CRPSCRVETL LWEKNNCLTP LGEERLIAAF DFRKNRRTWL GQILREIETC QSVNFNLLSC KLSDEEIMAL AKHLAQYRPR VSTAYLGGHG VTVRSMKVLA KDVLANNHVN LQRLHLQHTR VGDEGAGALA EALLSNSNLR TLTLFDCSIS PEGAKLLAHT LAQNKSLTQL NLHKNAIGNR GAQELFTALV DPPHPSLVVL NLEQNEISDG ALLQFQSFGR LQQLNIASNN FTDRAALDLA KACFNSLANG TLQLSWLTVS NNFISKKGLK ALALFLPDGL VLENDGQLEA QTVLPIRSAN GIDALYNKAS LS
|
| |