Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48422 |
Symbol | |
ID | 7203671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 416852 |
End bp | 419069 |
Gene Length | 2218 bp |
Protein Length | 702 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182831 |
Protein GI | 219125110 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATGCA AGTCTTCGAA AGCCGCACCG TCAGAACGGG GCTCGCAAGG CGCTGCTAAA CGCACTAGTA TGGACTCCAG AACCAAACGG AGCAGTCGAG GCATCTCCCC AAAGCCATCG TCGCCGTCGT CGTCACCGTA CTTGGGAACC AAGTCGACGG CGTCCTACGA CACCACTTCG GTCCAGAACG ACAAACTCTA CAAGCTCCTC GTCTCGGCAC AGCAAGGCAA AGTCGACGGC GCCGTCCTCG TCCAGAAAGT GCGCAAGTTC ATTACCGTAA ATCCTGCATT CGCCGCGTAC CAAAACCCCA ACACCAAATC CACGCCACTG CACATGGCTG TACGTTTACT GGATCTAACG GATCTCTCGT CTGCCGCCGC TGCGATTGTT CCTAACCTCG TCACGGCGTA TACGGCGGCC GTCGCGACAC GGGACGCGGC TGGCAATCTA CCACTCCACT ACGCCCTCGC TCCGACCTCG CACTTTGGTG GCGGACCCCG GACCAGTTTG ACCCCACGGA CCGCCGTCGT GCAGGCGCTC TGGACCGTCG CACCGAAATT CGCCCTCGCG TACTGGAATC GGAACGACGT GGTGTACGAA TCGGGGGACG CCACCGGTGG ATGCTCCCCG CTATACAGAG TTCTGCAAAC TCTACCGGAC GACTTTGTGG CGCATGCCGC CACCACGGTC GAATACGTAC ACGTTCTCTG TCATTTGGCG GCTGGAACGG AACATTACAA TACCGCGAGC AAGGCCTTTG TCAGTCTCGG CAACACCAGT GACGCCGACA AGCCTCTCGC CCTGCTCTAC CGACGATTCA CGCGACAGTT CGATGCCGCC GAAAAATTCT TCGACGGAGA CAACTCGCGT GACGAAGTAG TGCAACACCG ACGACGGTAC AAGACGGCGG CAGGGAATAC CTGGAAGATT ATTGAATGTC TTTTGCAGCC CCACGCGCCC GCTTCTTCGT CCTGGCAAAT TGTCCACCGC GCCGTACAGG TGGAAACCCC ACCCGACCTC CTGAGATATA TTGTGGAAAC CAACGCTGAA GATTTGACCA CGGCCGACGA AACGGGAAAT CTGCCCCTGC ATCACGCCGC CATGTCCAAG CCTCACTCGT TGTCGGGTGC GGCTTCGTCT GGCGGATTTC CCGCCTTTTA CACGAAATTC ATTGTGGATG AACTCTTGTA CAAGTTCCCC GACGCGGCCA GTATACCCAA CGCGGACGGT AAATACCCTC TCACACTGGC TGTTCAAGCG GGTAAACAGT GGATTGGTGG TGGCATCAAG AGCCTTTACG AAGCCTATCC GGAAGCCCTC ACACAGGTTA AACTCAATGA ACATAAAGTG TTACGCCAGG CGCTTTCGAT GGATGTCGGA AGCAGCAACC CCAATTCACC AACAGTCGAC GACAGGGAAG AAAAACTCGA CTCAATCATC CGAGACGAGC AGCACGACGC CATTATGCTC GTTCAGCAAG AAACAGTGGA CGTCCCGGAG GTAGTCTTTT CTATGTGGGC TCATGAAGAA GATGCCGGCG TGCAAATGCT CGGTTGCGTG GCACTCCACC GCATGGTCCG GAATGTGAAC GACCCCGCCG ATACCTTACG AATAGCCCTC TCCGCCGTAG CAGCGATTGT CAACGCCATG AAGGCGCACC CTAATGAAGT CATTGTGCAG GAAAAAGCCT GCCAAGTATT GCAAAGTCTC GCCGTCGTCG ACGGTCACCG GGAAGTATCC TTTGTCGCAT CCGGTGCCGT TGCGTCTATT GTTGGCGCCA TGCAGGCCCA TGTCAGCGAC CCCGGCGTCC AGGAAGAAGC CTGCACCGCG CTCGCCGCCG TGATTCGTGT CGGTGGCGCC GAGCGCGCCA CCATTGTGGC CAGCGTTAGC GGCTTGACCG CCATGCTCAA CGCCCTCGCA GCCCATCCCG ACGTGGTCGG CGTCCAGGTC GCGGCACTTA ACGCCCTCGT CATGCTCACA TCCTTTCCCC GCGCCAACCT ACCCGATGTA CCGCGATCCC AAACGGAAGC TCTCTTGTTA GCGGCCCGCG ACAAATTCCC GCTCGAATGC CACGCACACG TCGAAACGCT CCGGTCACGA CTGTCTTGAA ATCCGTGGCG TGGGAGATGG CGATTTGGTG CCCACCCACC GGAACAAATG GATAGTGTAC GTCGGCTCAA TGGCTGCTGG GCGGAAGTTT CCAAGAGGGG TGTGACTT
|
Protein sequence | MGCKSSKAAP SERGSQGAAK RTSMDSRTKR SSRGISPKPS SPSSSPYLGT KSTASYDTTS VQNDKLYKLL VSAQQGKVDG AVLVQKVRKF ITVNPAFAAY QNPNTKSTPL HMAVRLLDLT DLSSAAAAIV PNLVTAYTAA VATRDAAGNL PLHYALAPTS HFGGGPRTSL TPRTAVVQAL WTVAPKFALA YWNRNDVVYE SGDATGGCSP LYRVLQTLPD DFVAHAATTV EYVHVLCHLA AGTEHYNTAS KAFVSLGNTS DADKPLALLY RRFTRQFDAA EKFFDGDNSR DEVVQHRRRY KTAAGNTWKI IECLLQPHAP ASSSWQIVHR AVQVETPPDL LRYIVETNAE DLTTADETGN LPLHHAAMSK PHSLSGAASS GGFPAFYTKF IVDELLYKFP DAASIPNADG KYPLTLAVQA GKQWIGGGIK SLYEAYPEAL TQVKLNEHKV LRQALSMDVG SSNPNSPTVD DREEKLDSII RDEQHDAIML VQQETVDVPE VVFSMWAHEE DAGVQMLGCV ALHRMVRNVN DPADTLRIAL SAVAAIVNAM KAHPNEVIVQ EKACQVLQSL AVVDGHREVS FVASGAVASI VGAMQAHVSD PGVQEEACTA LAAVIRVGGA ERATIVASVS GLTAMLNALA AHPDVVGVQV AALNALVMLT SFPRANLPDV PRSQTEALLL AARDKFPLEC HAHVETLRSR LS
|
| |