Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45934 |
Symbol | |
ID | 7201144 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 711751 |
End bp | 713857 |
Gene Length | 2107 bp |
Protein Length | 627 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180298 |
Protein GI | 219119063 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACGGGAACAC TACCATCAAC ACTATCACTT CCACTACCAA CAACAACCCC ACGGGCGATC CCGACTCCCA GGACAGCCAA GTCGCGCAAC AGGACGAAGG CAACGATCAC GCCGAACCAG ATCCCGTCGT GGAGCCCCCG TCGGACCCCA GTATGCCCCA ACAAGCCGAC GCCGATGGCG CCGGCCCGTT GACTCCGTCC AGGCCCGAGT CTTCCTCGTC ACTGCCCGCA CAGTCGGCGT CGTTGGGTTC GGACAATACT CGCCGCACAC GCATCCGTCG CATACTGGAA CATCGCCGGT TGTTGTTGCA GCGTCTGCGC CAGAGTCGCG CCGCCGCGAA CAAGCGACTG GAACAATCGC GGACGGATCA TCCGGAATCG AAGGATGTCT CGGACGAACA GGAATTGGCA GACTTTCGCG ATATGGTGCG AGAAGCGACG GCGCTAGCCC GCAAAGTCTC CAAAATGGAC GCTGACGGAT CGACGGGAGA AAAACGCACC TCCGTGTCCC TCCGCAAAGG GTCCAGTGTG GGAAAACGCA TGAACGCTGC CCTTTCTTCG CTCGTACCGG GAGCGGCGAC GGCCGCTGCT GCCGCGTCCG CTGCCTTACC GCAGTCCTCC CACGTTACCT CCTCGCCCAA CCGAACATCC TCTTCGGCCG CTCCAGGTCG AACCCTTCCC ACACAGGCGT CCTCTACCGC GACAGCACAA CCCTCGTCCG CGTCCGGTCC CGGGAGTCGT CCGCCTATAT CAAAACCTCC CAAAACGGCT GCCGGCCGAA CCATGCTGCC ATCCGGGACG CAGCCACAGC ACACCGGCAT AAGTGGTCCG ACTCTACCCC CGAATCGACT CGCACAACAA CAACAACAAC AACAACAACA TCCGTCGCCC CCTGTACCGT CGGTTGTTTG TCCCGAAGCG GAACTATTGC GAAAAAGACG CAACGAAATT CGCCACAAAC TCATTCACTT GGCCAACACT CGTCGAGATA AAGTCGCCAA TGTGTCCAGC AGCGTCACCG GCGGTACATC AACTAAAGCA TCATCATTGC CTTCCTTGTC GCACTCGGAA CGCATTGCTG CCGTCCAAGG CCCCGGACGG CCGGTTAAGC TGCCCCGGCG GAGGCAGACA CACTGGGACT ATTTGCTCGA GGAGATGCGA TGGTTGGCCA CAGACTTTCG GGAAGAGCGC AAATGGAAAA CGGCGACCGC CCGTCTAGTA GGCGACGCCG TCGCAACGCG TGACGATCCC GTCCAGGGAA AAATGCCGCT TGTACAGATT TCTCCTTTGA CGGCCGCCGG ACCCGGTCTT GGTACTTCGA CTGGAATTTC GATAGATGCG ATCGATCAGG CGACTTCATC CAGAGGTAAC GGCGCGAGTG AGAAACGCAC ACGCTTTTTG GAAATGATAT CGGCCGAGGA AGAGACGTCG GCCCGACAAG TGGCATCTAT TGTTTCTAAT ATGGTAGCCG AACTCTGCAC CGCAACGGTG GAGTTTGCTG GGAACACGGG TGTCAACGCG CTTGCCAAAG CTCTCCGTCG ACACCAAATC ACCAGGAGCC GTCTGGAAGG CAGAGCCACC GTCGAGACCG TCCAAGCCAT GCAAATGAAC CAAGGTGGGT CTGATAGCGA GACGGCACTT GTCCTTGACG GCACGACGAA CACAGGTGTC GAGACGGATG TCAATCAGCT AGCCGTGGAA TCGAAAGCGG AAGAGTTTCA GCGAATGTCT AAAGCAGTGG ACGAGCTACT GAATAATGTG CGACAACTTC CGGAAACCAA AGCAAAGGCC TCCTCGACAA AACTTAAGGG ATTGGATCTG GAATTGACCG TTGCACAGGG GAAAATGGGG GACGACATTG AGGCGAAATG GAAGCTGGAT GCGGGGTCAG TGTTGCGGGG TCCCTTGGCT TCCGGAAAAA CTATAACTAC TTGCTCGCTG CTTTGGAGGC ATCGGCGAAG TGGACCACAG TTGGTTGTGT GTTCGTCGGC AAAATTGGTA AGGGCGGCGG ATGATTTCTT CTGTGGGTGC AATTATACGA TGGACCTCAC GACTTTCTTT TCTATGAAGA TTCGGTGGCT GCACGAACTC GGAAGCTTTC AAGGTAT
|
Protein sequence | MPQQADADGA GPLTPSRPES SSSLPAQSAS LGSDNTRRTR IRRILEHRRL LLQRLRQSRA AANKRLEQSR TDHPESKDVS DEQELADFRD MVREATALAR KVSKMDADGS TGEKRTSVSL RKGSSVGKRM NAALSSLVPG AATAAAAASA ALPQSSHVTS SPNRTSSSAA PGRTLPTQAS STATAQPSSA SGPGSRPPIS KPPKTAAGRT MLPSGTQPQH TGISGPTLPP NRLAQQQQQQ QQHPSPPVPS VVCPEAELLR KRRNEIRHKL IHLANTRRDK VANVSSSVTG GTSTKASSLP SLSHSERIAA VQGPGRPVKL PRRRQTHWDY LLEEMRWLAT DFREERKWKT ATARLVGDAV ATRDDPVQGK MPLVQISPLT AAGPGLGTST GISIDAIDQA TSSRGNGASE KRTRFLEMIS AEEETSARQV ASIVSNMVAE LCTATVEFAG NTGVNALAKA LRRHQITRSR LEGRATVETV QAMQMNQGGS DSETALVLDG TTNTGVETDV NQLAVESKAE EFQRMSKAVD ELLNNVRQLP ETKAKASSTK LKGLDLELTV AQGKMGDDIE AKWKLDAGSV LRGPLASGKT ITTCSLLWRH RRSGPQLVVC SSAKLIRWLH ELGSFQG
|
| |