Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46434 |
Symbol | |
ID | 7201538 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 326808 |
End bp | 329098 |
Gene Length | 2291 bp |
Protein Length | 631 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180803 |
Protein GI | 219120115 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATCACTTC CACTGAAAGC AAGGCGTTGC GCCTGTACGT CAGACCTTTC CGGAACAGCC TATTGGGAGG ATTCTAGCGA TTGTTGCTCG AGTCCTCTGT TGTTCGTATT CCTACATGTT TCCTTGCTTT GCAAGATTCC CGTTGGTGAA TTCTGTGTAT TTGTTGGTGT CCCGAAAAAA GAGACTCGGT CGGTAATCGT TGTGCTGGTG TAGATCCTTT GTTTCCCACC CGTCATGACC CTACTGGGCA CGACTGCCAC TGAGCCTTCC CGCTCGGTGG CACGACCTTT GGCCAACCTA GGGAATACCT GTTATATGAA TTGTGTACTG CAATCGTTAG CCCATTGTCC GGAACTTTGT TTGGCTATGG ATACGCAGCC ACACCGTTTG ACTTGCCCCG TAGCCACCGA AAACGCCATC CTTTCGCGAT CTGCTTCGCC GTCGTCGTCG CCCGACGACA GTCTCACGGG CAACAAAGAA CCGCGCAAGA CGGCAACGCG GAAATCGCGG CGTTCCGGCC GCAAGACGCC CCCCGACGAC GAAGACACGG CTTCGTCCTC GGGTTGGCAG TTTTGTGCGC TCTGTGAAGT CGAAAAGCAC CTACAGAGCG TGCACGATTC CGTGCACAAG GACAAGCCCG TGGCACCGTC CACCTTTGTC GAAGGCTTTA TCGAGCACGT GGCGCCGTGG TTCAAGTTGG GGGTGCAAGA AGACTCGCAC GAATTTCTCC GTCTCTTGAT TGACGCTATG CAAACCTCCT GCCAACAAGC TCGAGCCCTA CCCCATGCGG ATATTTCCCG AGACGGCAGT CCACAAGGAA TCAAGCCGGA CCACAGGAAT CAAGACTATG AGTATCCCTT CTCTCTCTTT CGTGGCACCG TCCAGTCCAA CGTAACTTGC AGCTCTTGCC AGGCTTCCAG CTCGACATTG GATCCCATTG AAGATATAGG TCTCGAAGTT ACGCTCCCTT CTCACGTCTC GCCAGGTGCC AACGATCGAT CTTCGCGCAA CAGCTCTCCC GTCCCGGCTC CAGCGCTGGC GGACGTGCAA GCCGCCTTTC AACGATTTGC GCGGGCCGAA GCCTTGGATT CGGGCTACAA ATGCGAGAAG TGTGGCAAGG TTGGCCGCGC CACCAAGCAA TCTTGCCTGT CGTCCATCCC GCCCATTTTA ACACTCCACC TCAAGCGCTT CCGATACGGC GACTCCCGTA CGACCAGTAC TGCAGCAGCC GGCACCACTG GTCGTCGGAC TGGCCGATCC GAAATTAATC AACTCCTCGG CAGCAACGCC GATTTCCTCG CCGGCAAGTC TGGCTCCGCC AAGATTGAAG GACACGTCAA GTTTGATCTC TTCTTCGACC TCAAGCCGTA CTTGACCGCT GAACTTAAGA AACAGCACTC CAATATGTTT TGTCGTCTGT TTGCCGTGAT TGTGCACGCG GGCAAGAACT CGCACTCGGG TCACTACATT GCCTACGTCC GTAGTCTGGC CCGCAACGAA TGGTGGAAAA TGGACGACGG CCGTGTGACG GCAGCTAGTG AGCAAGAAGT GTTAATGGCG GAAGCCTACA TGCTCTTTTA TCGCGTCGTC CAACACCCGA TTGCGGTGCA ATTGGAAGAA AAATGCAAGG CCAAGTTTCA AAATCGTCAC GAGCGAGAGC ACGTCCAGGC TTCGATCGCC GCGACCTCCA TTGAACCGGA ACAATCCAGC ATACGAGGCT CGAGTCGCAA GCGTGCCGCT CCCGCGTTTG AGGACGGTGA GGAATGGGCG CGGGCCAAGA CGAGCATCCC TCCGCGCCTG CTAGGTTTGA TTCGCAAGGC ACAAGAAATG GTGGCGGACG ACATCCAACT TTCTCCGGAC TATTTCAAGA TGATCTCGGA AGAAGCTGCC AAGGAGAATG CCGCAATCGG CAAAGGCCCG AGCAAGAGCA TCTGCGGTAA GTTCACTTTC CCAGGTTTCT GGGCCGCGGA CGCCGTCTAC TTTATTCTCA ACACCTCGTT TCCCCATGCA GAGGACGATG TACTGGGCGG AGCCGAATCT TTTCGTCGTA AATTGTGCGA CCTCTTTTAC AAGCTAGCCA AGCACTTTGA TGGCGATGGC AGTGGGAACG GTACTTCCTG GCTCTGTCGG GACAAGTCTG ACCGGGCCAA CGATGACGCG AAACGGGTTG CCGTGGTGGA GCCCGAAGAC GTTGATTTGC TTTGAGAATG AAAATTTCGA AAATACCCTC ACCAGAATAA TGAAATCAAA ATAGACACAA AGAAACAAAT AAATATAACT GTGTACAAAA C
|
Protein sequence | MTLLGTTATE PSRSVARPLA NLGNTCYMNC VLQSLAHCPE LCLAMDTQPH RLTCPVATEN AILSRSASPS SSPDDSLTGN KEPRKTATRK SRRSGRKTPP DDEDTASSSG WQFCALCEVE KHLQSVHDSV HKDKPVAPST FVEGFIEHVA PWFKLGVQED SHEFLRLLID AMQTSCQQAR ALPHADISRD GSPQGIKPDH RNQDYEYPFS LFRGTVQSNV TCSSCQASSS TLDPIEDIGL EVTLPSHVSP GANDRSSRNS SPVPAPALAD VQAAFQRFAR AEALDSGYKC EKCGKVGRAT KQSCLSSIPP ILTLHLKRFR YGDSRTTSTA AAGTTGRRTG RSEINQLLGS NADFLAGKSG SAKIEGHVKF DLFFDLKPYL TAELKKQHSN MFCRLFAVIV HAGKNSHSGH YIAYVRSLAR NEWWKMDDGR VTAASEQEVL MAEAYMLFYR VVQHPIAVQL EEKCKAKFQN RHEREHVQAS IAATSIEPEQ SSIRGSSRKR AAPAFEDGEE WARAKTSIPP RLLGLIRKAQ EMVADDIQLS PDYFKMISEE AAKENAAIGK GPSKSICEDD VLGGAESFRR KLCDLFYKLA KHFDGDGSGN GTSWLCRDKS DRANDDAKRV AVVEPEDVDL L
|
| |