Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48806 |
Symbol | |
ID | 7195112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 286524 |
End bp | 288834 |
Gene Length | 2311 bp |
Protein Length | 675 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183460 |
Protein GI | 219126429 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0022357 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGTTTGCC TTAAAGTTTT CTGCCGGAAC CTGCCTAAAT ATCCGACACG CCTCCTACTT CTTAGAGTAT AGCAATCAGA GGTCCTCAAA AATTTAGATC CACAACCAAA AATGGCAAAG GATACCATCG AGCCTGGACC CAGTCCGTTG TTGATTGCGT TTTCGGATCA CCATGTCTCC GCTTCAGTGA CAGAGCTTGA CCACTACCGT ATTCGTGGAG GAATGGCTGT TGAAACGGAT TACATTGTGA CCATCACACC CTTGGGAGAA GAGTCCTGCT TTGACAGCTT TACGATCAGC AAAACCTACT CGGCCTTCCG AACCTTATCT CACGCGCTCA AAAAAATTGT AGATAAGCAC ACGAGCGCCA ACGAAATTCT CCCACAGAAT GTGATCAAGA CGGCCAAGTA CTGCGAACTC GTCAAGCATC TCATTGGCTC GCAACGAACA GAGTATTTGG GTAAGGTCAA CTACATGTAC GTCAAGATTC AAGCCAAGCA GCGATCAAAA CTTCTGGATG ATGTGTTAGA AGCTACGTGT AATTACTTCC CTGCGGATCC CAGTGTACAT CCGCTCGTGT CCGAAGTCGC CACTTTGCTG GAAACCTTCT TTTTGACGGA TCACTGTGAA GCAGACGAAA CTGCTGATCT TCATCGAGGC GCGAGCACGA GTTCTGTAGG AAGTAAAAAT GCCGAAAAAG GTAAGAAAAA CAACCCACTC GGTTTCCTGC CGACCCTACC GCCGTTCTTG GGCAAGACCA AGAAGAAGTC GGCAAGAAAC AACATGGCCA GTATGGTGGT TCCAATTACG CGAAAAGTCC GGCGTCCTAA GGCGACCCGC GACCAGGACG AGGAGGAGCT AGCCTTGGTG GGGGACGATG CCAAACTTTT GTTAGACGAT GATCGACCCA CGACGGAACT TTTGCCAAAT TACGCACGCC CGGTACCCGT CGTACGCACC GGTGGTACCC GTATTGGTAA CCTTATCGAG AACAATCCAG TGGTATTCTT GGGCATTGCT GGAGGCGCCA TTACAGTATT GCACTTTGCG TCCGAAGCAA AGATTACTAT GGACGTTGAC ATTGCTCTTC TGGTTATCTT TGCTGCATTT TGCCTTGGTT TACATACGCC CCGTCCAATG GTAGGGGGTT TTGACCGTCC CCCTACCATG AAGGGTACTT TTCAGGCCGA TGGCCATCGC AAGCTCATGC GGCGTTCGAT GATTGTTGCT GTACCTGTGC CTGGTGAAGT TCGAGAGGAA GCGGAAGAGC CAGATATCAT TCTTGGAAGT CCTTTGCCTG TTTTTCCGTT GGGTGCCAAA ATTGGATCGC ACAACAATTG TTGGTCAGAG CCGGATCCAG CGACTTTCCA AGTACGTGGT GACAAGTATC TCCAAGACAA GAAGAAGATG GCATCTGGGG AGTTTATGTT TCCGGTGCGC GGTGTGGATT TATTTCTGAC AGACACGTGT CCCGAAAATG TTGGGAGTAA TTCGAGTGTC TTTGGTGGAC GCTTACGCGA AAAACCTACT TTTCTCATCA ACTTTCGGTT GCCGTGGGGT GTGCTGATTT TCTATTTCGA AATTCCGGAA AAGTTTGTTC CATTCTTGCA GGCTTGTTAT GAAGACGATT TCGACAAGTC GACTCTCGCT GAGCTAGGAC CTATGTCTGC AGCGGACAGA ACTGTGTGTC GCTTTTTAAT GAAAAACATG GCTCACAAGA ACAAAACATT AAAAATTGTT CCAGTTGTTG TAGCAGGGCC TTGGGTGGTC AAGAGTGTTG TTGGTGGGAA ACCGGCAATT GTTGGTAACA AGTTACCCAT TAATTACCTC TACGCGCCTG CCAAAGACGA CAAGGCTTGT TACTTGGAAG CTGACCTGGA TATTGTTGCG TCATCCGCCG CCCGAGGTAT TCTCTCTGTA GCTCGAACGT ATACGCAAGA CTTGACAATT GATCTCGGCT TCGTCATTCA GGGAAACACC GAAGACGAGC TTCCGGAACA GATGCTGGTG GGCTGCCGAT TGCATGGCGT CGATCCTTTG AACGCGGCCT CGATGCCGCC TATGAAAGAC GATCTCATGA TCAACAGTAG CTTGTCGGCG GATGATGACA GTGTAACGCC CACGTTGCAA GCGGAATGAG CTCTGAACCA AAAGCTCATT AGGGGAGAAA AATTTATATT CACAAATTCC ATAAAGACAA GGTTCGCTAA CCTTATAGCA AAAGGAACTA GAGGATTCGC TGTCGCTTAC CTCATGAGTG AATAGGTAGG TATTTTGTGT CCACCAGTTG TAAAAGTGCG ATTACCCAAA G
|
Protein sequence | MAKDTIEPGP SPLLIAFSDH HVSASVTELD HYRIRGGMAV ETDYIVTITP LGEESCFDSF TISKTYSAFR TLSHALKKIV DKHTSANEIL PQNVIKTAKY CELVKHLIGS QRTEYLGKVN YMYVKIQAKQ RSKLLDDVLE ATCNYFPADP SVHPLVSEVA TLLETFFLTD HCEADETADL HRGASTSSVG SKNAEKGKKN NPLGFLPTLP PFLGKTKKKS ARNNMASMVV PITRKVRRPK ATRDQDEEEL ALVGDDAKLL LDDDRPTTEL LPNYARPVPV VRTGGTRIGN LIENNPVVFL GIAGGAITVL HFASEAKITM DVDIALLVIF AAFCLGLHTP RPMVGGFDRP PTMKGTFQAD GHRKLMRRSM IVAVPVPGEV REEAEEPDII LGSPLPVFPL GAKIGSHNNC WSEPDPATFQ VRGDKYLQDK KKMASGEFMF PVRGVDLFLT DTCPENVGSN SSVFGGRLRE KPTFLINFRL PWGVLIFYFE IPEKFVPFLQ ACYEDDFDKS TLAELGPMSA ADRTVCRFLM KNMAHKNKTL KIVPVVVAGP WVVKSVVGGK PAIVGNKLPI NYLYAPAKDD KACYLEADLD IVASSAARGI LSVARTYTQD LTIDLGFVIQ GNTEDELPEQ MLVGCRLHGV DPLNAASMPP MKDDLMINSS LSADDDSVTP TLQAE
|
| |