Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50123 |
Symbol | |
ID | 7198833 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 114930 |
End bp | 116574 |
Gene Length | 1645 bp |
Protein Length | 507 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185053 |
Protein GI | 219129768 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.386802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGGGCCCGTC GATCCCACAA TGCTGGCAAC GGCAACGAGC AACCAGAATA CGTACGGCTC CCTGGCCGTG GACCTCGACC GGAATGACGA AGAAGACGAC GACCAGGAAC AAGCCACAAC GAGGCCGCTC TTGCCGCCGC CTTTGCCGCG TCACCGACGA CGCGAGATGG CAATGCGTCG ACAGGGTCAG CAGAGTTCGG TGCTCGACAA GCTCTTTTCC GACGGCAAGA AGACTCAGTC CAAAAGGAGC GTACGAAATA GGATGAATCA AGGACCTGCT CTACCACCGG CATCCTCCAC GATCTTTTCC AATTCCAAGG CCTCCCAGGA CGATACCGTG CTACCGGATC GCCGGAAACA TCAACAACAC CGACATTCTT TTGTGTATAC TTTGTTGAAT CCACATTCCA AGCGAATACA AGCGGTTGCT TATAAACGTT TCATCTCCGT CGTCATTCTC GTCGACTTGC TCTTTTTCAT CACGTCCACC GACGAACACA TCATGGCTAA ACACAAAGAC TTTTTCCACT TCAGTGAAGG CGTGTCCAGT ACCATCTTCC TCATTGAATA TTTCTGCCGT TTGGTGATCA TTACAGAAAG CAAGCGCTAT AAGGCTGCTG GCCCCCTTTT CGGTCGCCTG CAGTACCTGC GCAGCTGGCC CGCCCTGATT GACTTGTTCG CCACCCTACC CTTCTTTCTC GAATACCCCA CGGGCTGGAA CTTGCCCACG CTCACTTACT TGCGCTTTTT CCGACTCTTT CGAATTTTGA AAACCGAAGG ATACATCCGA GCCATGGACG CCATCTACCG TGTGGTCTAC TACAATTCCG AAATTCTCTA CGTCGCGGCC TTGCTCTGCA TTTTCCTCAT TCTCGTCACG GCCGTCTTGT TGTACTACCT CCGCCCCGCC AACAAGGAAG ACGCGGAAGA CTTTGGATCC ATCGCGGCAA CCTTTTATAT GAGTACGCTC ATGCTGACTG GTCAAGGCGG ACCAAGCGGA GAATTGCCCT GGTACACGAA GAGTATTGTG CTCCTCACGT CCGTATTCTC TGTCGCCATG TTTGGTGCGT GTGCAACCGT GCCGAGCCCG GCCGTAGACC AAGGGGCGAC ACTATCTTGA AATACAAACA CTCACACTGC TTTTTTCCCC TTCGCTGACA CGACAGCCAT TCCGGCCAGT ATGCTGACGT GGGGATTCGA AGCCGAAGCC GAACGCATGG CCAAACAAGC GCACAAACGA TTACTGCAAC GGCGCTCGAC GCACGGTTCG TCGACGACGT CCACGGACGA TTACGAATCA GAGAGTCCGG ACGACGATTC TACGGACGAG GAGTACTTTC GCATCATTGC GGGGGCCGAC GAAGAAGGTG ATGGCGAAAA CGATACACCG TGGATGAAAG AAATGAGGGA ACGATTTATC AAAGCTGACG GGAACATGGA CGGAACCTTG ACCCTCAAAG AGTTTTTTCA ACTCCAAGCG TCCATGGGGG AGGACCCTCA CGATATCACG TCGTCTCAAA CGGCACTGGT CAATGCGAGT ATGATGTCAC GTATGGAAGC TCTGGAAAGC ACGGTCGCAG CCAACGCGCT CAAGCTTGAT CAGATTTTGG CGGTCTTGTC CGACTCTACG AAAGGACGTC GGTAG
|
Protein sequence | MLATATSNQN TYGSLAVDLD RNDEEDDDQE QATTRPLLPP PLPRHRRREM AMRRQGQQSS VLDKLFSDGK KTQSKRSVRN RMNQGPALPP ASSTIFSNSK ASQDDTVLPD RRKHQQHRHS FVYTLLNPHS KRIQAVAYKR FISVVILVDL LFFITSTDEH IMAKHKDFFH FSEGVSSTIF LIEYFCRLVI ITESKRYKAA GPLFGRLQYL RSWPALIDLF ATLPFFLEYP TGWNLPTLTY LRFFRLFRIL KTEGYIRAMD AIYRVVYYNS EILYVAALLC IFLILVTAVL LYYLRPANKE DAEDFGSIAA TFYMSTLMLT GQGGPSGELP WYTKSIVLLT SVFSVAMFAI PASMLTWGFE AEAERMAKQA HKRLLQRRST HGSSTTSTDD YESESPDDDS TDEEYFRIIA GADEEGDGEN DTPWMKEMRE RFIKADGNMD GTLTLKEFFQ LQASMGEDPH DITSSQTALV NASMMSRMEA LESTVAANAL KLDQILAVLS DSTKGRR
|
| |