Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47117 |
Symbol | |
ID | 7202030 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 506761 |
End bp | 509063 |
Gene Length | 2303 bp |
Protein Length | 743 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181218 |
Protein GI | 219121739 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGCTG ATGCGACCTT TACGATACTC AACAGTTGCA GGAGTTGGAT ATCGAAATTT TTGTTAGTTA AACGTCTGCC CTGGCGTCTC TTGTGTTTAT CTGCATCGCT CTTTTCGCTG TATATTCTCT TGCTCCGAAA ATCTTTATTC GGCGATGGTC CCACCTTGCA GGGAGGTTCG ATAGGTCGGA AATTCGACAA TTCCCTCAAG CGCCTTCTAT TTCGAAGACG AGCTCGAATG TATCGGCAGC GCTTCAAAGC CGAATTTGGT TTCGATTTAC CCCGAGTATC GGAAAAGTCT GCAAATTCTA TCGGAGTTCC TGTTTTGCTC AATACATGGC TTGCTGAATC ACTCTTTCTG GCCAAACATC GCAAAAACTC TCGGATGACT GCAGAATTGT CTTTGAGTGC GGTGGATTCG GAGACGCACG TTTTTCGACT GATGCAATTT GATGTGAGTA GCTGGTACAG GCCTCTCTCT TCCTGGCAAG GAAACAGCGC AGTGATTTGG AATGAGATTG AAATCGAGCA ATGGATTGAC ATCAAGTTTG CAAGCTTTCA AGGTATCGGC GCGATGTGGT CAAAGCACGA GCGAATTCAA TTCTGGGGTA TGTGCGCAGT ATATTTTTAT GGTGGAGTTT TCGTGGATCA CAATATACGC TCGAGGACAG CTCTCCCTAT TTTTAAGGCG GCTTTCGGCC AGGATCGCTT TTGGCACCAG CTTAACGAGG ACGGTTCCCT TCATTATCTT GCCTCAACAC CCAAGCACCC AAAGCTCGAG TGTATTTTGG ACGAAATCTT GACTCGTCGG AACGAGCGTG GGGCAAACTC TATTGCTTGG TCACATGTAA CACAATTACT ACAGCTACAT ATCTGGACAG GTTTTGATAA ATACAAGCCC GCGTGCTGTC CTATAGTTTA CGAACAGAGG AACTGGTCCC TATCTTCGAC ATCGAAGCAA GATTTCCTCC CAGTTAATGA AGAGATGGTT GTCGCACCGT CAGCCCTAGT CAAACGCTTC GATGTATCAG TTCAGGAGCA GCCATCCACG AGGTCGGTAA TTGGGATTCC GAAAGAACCG TGGAGCAAGG TGCTCAACGA CAATCAATGC TCGCCGGGAT GGCTGTGCAA CCGTTGTCTA CGCTTCCCAT GGTTTGGGAG CTTCTCCAAT TGTAAATCTG TATGCCGCTC TTGTTACACA GATCAAATAT GTGCGGCCAA CGACATTGAC CTAACAGATG AGATTGTAGT TGAAGTCATT GTCCGCGAGC GTCCTGGCAA CCATACCAAT CGTATTCCTC GGATCATACA TCAAACGTGG TTTGAAGAAC TACATACAGC GCGGTACCCT CATCTGCAGC GATTGCAAAA TTCATGGAAA GCATCGGGAT GGGATTATCG TTTCTACACC GACGAAGACG CTCGGATGTT CATACAGAAG AATTTTGCTA AAAGGTTCAC TTCTGCGTAT GATGCCATTA TTCCGGGGGC GTTCAAGGCC GACTTTTTTA GACTACTTGT ATTGCTAAAA TACGGTGGTA TTTACAGCGA TTTCGACGTG CAACTCGATA CTAACTTGGA CTACTTTGTC ACTAAAGACC TTTCATTTTT TGTTCCGAGA GACGTTGCAA TCGATCATTG GGCTGGGGGG AATTACTGTG TTTGGAACGG TCTTATTGGT GAGTTTTTAG GCCTTCCTCG ACGACTGTAC TGGGCTTATC CTGCTAACCG TTTGCCTTGG GATTTGTAGG GGCAGCTCCC GGTCATCCAA TCGTTGCACA GGCGGTCGAG GATATTTTGA ATCGCATTTC GAGGAGGGAG GACTATCTTG ACATAGAAAG CAGTCTTTGT CGTGGAAACC TTGACGCTGA AATATGGAAA CTCCGAAGCT TCCCCATCCT CCTTGTGACA GGTCCTTGCG CCTTGGGAAT ATCGCTGAAC AAAGTCTTGG GCCATCACAA CCTGGTCAAT GAGATCCTTC CTGGATGGAT GATTTTCTCG CAACATATGA CGGAGGACAA AGCTGAAATG AGTGATAATT GGGGAGATAT TCTGATCTTG CATACCGATC GACACGATTT AGGGGAGCTA CGCTTCTCCG ATCTTGGGAG AAACTTGCTT GTTGCTTCGT CAAATCAAGA CTATTTCGCT AGATCTGCAG TCCTTTTTGA GGCCGATCCA CAAAAGATGC CTCAGCATTA CAGCAAAAGT GAGAGTGATA TAGTGGGTTC AACAGCGACT TACAAAGATG ATAAGGTTTC CAAGGAACGA GTTGTCGTAA AGGTCACATT CACAGTGAGG TGA
|
Protein sequence | MLADATFTIL NSCRSWISKF LLVKRLPWRL LCLSASLFSL YILLLRKSLF GDGPTLQGGS IGRKFDNSLK RLLFRRRARM YRQRFKAEFG FDLPRVSEKS ANSIGVPVLL NTWLAESLFL AKHRKNSRMT AELSLSAVDS ETHVFRLMQF DVSSWYRPLS SWQGNSAVIW NEIEIEQWID IKFASFQGIG AMWSKHERIQ FWGMCAVYFY GGVFVDHNIR SRTALPIFKA AFGQDRFWHQ LNEDGSLHYL ASTPKHPKLE CILDEILTRR NERGANSIAW SHVTQLLQLH IWTGFDKYKP ACCPIVYEQR NWSLSSTSKQ DFLPVNEEMV VAPSALVKRF DVSVQEQPST RSVIGIPKEP WSKVLNDNQC SPGWLCNRCL RFPWFGSFSN CKSVCRSCYT DQICAANDID LTDEIVVEVI VRERPGNHTN RIPRIIHQTW FEELHTARYP HLQRLQNSWK ASGWDYRFYT DEDARMFIQK NFAKRFTSAY DAIIPGAFKA DFFRLLVLLK YGGIYSDFDV QLDTNLDYFV TKDLSFFVPR DVAIDHWAGG NYCVWNGLIG AAPGHPIVAQ AVEDILNRIS RREDYLDIES SLCRGNLDAE IWKLRSFPIL LVTGPCALGI SLNKVLGHHN LVNEILPGWM IFSQHMTEDK AEMSDNWGDI LILHTDRHDL GELRFSDLGR NLLVASSNQD YFARSAVLFE ADPQKMPQHY SKSESDIVGS TATYKDDKVS KERVVVKVTF TVR
|
| |