Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46492 |
Symbol | |
ID | 7201579 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 501307 |
End bp | 503304 |
Gene Length | 1998 bp |
Protein Length | 563 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181020 |
Protein GI | 219120569 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.481419 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGTTCGCCT GGTTCCCGGC GACTTTCACG GAACATATCC GTTCGACTTT CTCCTTTCGT AGCTGTCTTT ACGATGAACA TGATCAAAGC CATTTTAAGG CTGCTGTTCT TTGCTCTGGC GATTCAGTTT GGATCAGCTA AGATTGTGGA CTTTCGTCTG GACTTTGGAG CGATTCCCTT TGACGAATCA TTCGAAATCT GCCAGTTCAA TTCGGCCTTG TTATCGCGCG TTCTGAAACA CAACGTGTCC GGGAATACTC TGGTGATACC AGAAGGGTAC ACCTTTCACG TACACCACGG TATCATCACA AATGGCCTTC ATGATGCTGT TATTCAGCTC GACGGAGTCC TTCGTTTTGA ACGGGCCGAT TTGGCCATCG ACGAGGGCCC GCCGCCGTTT CCGAGTTGTC TACAAATAGA CAATTCCAGC AACATTACCA TTACCAGCAA GTCCAGTTCC GGGCGTGGCT TGATTGATGG CCGCGGACCG CAGTACTGGG GTGTTCCCAT GATCGGCTAC ATACAGCTTG GCGAAAATCG ACCCCGTTTG CTCTTATTTA ATCGAACAAA GAATCTGTTG ATTGAACGAA TCATTCTTCA GGACTCGCCG TATCATACCC TGTATTTAGA AGGAGCTGAC GGAGTTGTTA TCCGTGATAT CAGTATTGTT GCGCGACGGA CGACAATGGA TGGGCACAAT TGGGTGGACT TGACTGCGTT CAATACAGAC GGGGTGAGTG GAATACGGTT TTCTTACCTT CTTTATGAAC GTCTGTCGAC GTGCCCTTGG GAGTCCACCT TGTGTTGCAT GATGTATTTA CTGCGCTGAT ACGTCGTCAC ATACTGACTA ACACTTTATG CTTGTTCTGC AACCATATAG ATTGACGTGT CTGGCCACAA TGTGCACGTT CACGACGTTG ACATCTGGGT ACAGGACGAT TGTATATCCG TCAAAGACAA CTTCTTTGAC GGTCATCGCT CAACCAATAT GACTTTTGAA CGTTTAAACG CGACTGGACT GGGTTTTGTA ATAGGATCGA TTTTGGGATC AACTGTTAGC AACATTACCT TCAAGGACAG CTACCTGCAC CGTCCGGTTA AGGGCATTTA CATGAAGTTT GCCCGTCCCA ATGCTTGGTG GGTTAAACGT AATCTGACCC GCGGGGTCGT GGAAAACATT GTGTACCAAA ACATAACTAT GGAATCTCCG AGCCAGTGGC CTATTTGGAT CGGCCCAGCG CAACAGGCTG ACGCAAGCAA TCCCTGTCAT CCCAACCCTT GCAGCCTCTG TTGGCCCATG TCACCAACAG CCAAATGCCA CATTGTACAG GAAAGCACGT ATCGGAATAT CACGTTGATA GATATCCAAA TCAACAATCC CAAAATGTCG CCTGGGGTCT TGCTAGGACA CGAAGACAAT AAAATTGATG GGATTGTGTT CGACAACGTG CGTGTTACCA AAGGTCGACC CTTGCCCATG TCCCGTTACA AGCGTGAAAA TACGTTTCCT GGAACTTTGC AACCGATCCA CGATCCGTAC GTACCTGGGA TCGTAACTAC CAATGATCTG GTTGCGACGA AACGTCGCGA GTTGTCCACC GGTTCAGACT TGGTCTGGCC AGGCTCCATG ATGCCTTCCA ATGACGAAGT GAACGAATCA AACGGCTTTT TCTCAAAATG GAATCCGTTT TGGAAACCCA AATGGCAAAA GACCAATCGG TATTACGCTT GTGAAGGGGT CTCTCGCGGA ATTGTAAGAG GCAAATCCTG GCCAGTGCCA TATTGCTTCG AAAAGGAAAG GCCTTCTTGG TCGGAAAGCA TTCTCGTAGT CCTCCGATCT ACTGGCGACC GCAGCATTGT TCTTCTCCTG GCGATTACCT TACTGGCGCT TTTCTATTGC TTCGCAATGT AGAGCACATA TTTTCCTTCA CAGAAGGCAG CTGCTGTTTG CGGTGTTTAC GATTTATATT TTTATAAAGT GTATGATATT CACGTGGT
|
Protein sequence | MNMIKAILRL LFFALAIQFG SAKIVDFRLD FGAIPFDESF EICQFNSALL SRVLKHNVSG NTLVIPEGYT FHVHHGIITN GLHDAVIQLD GVLRFERADL AIDEGPPPFP SCLQIDNSSN ITITSKSSSG RGLIDGRGPQ YWGVPMIGYI QLGENRPRLL LFNRTKNLLI ERIILQDSPY HTLYLEGADG VVIRDISIVA RRTTMDGHNW VDLTAFNTDG IDVSGHNVHV HDVDIWVQDD CISVKDNFFD GHRSTNMTFE RLNATGLGFV IGSILGSTVS NITFKDSYLH RPVKGIYMKF ARPNAWWVKR NLTRGVVENI VYQNITMESP SQWPIWIGPA QQADASNPCH PNPCSLCWPM SPTAKCHIVQ ESTYRNITLI DIQINNPKMS PGVLLGHEDN KIDGIVFDNV RVTKGRPLPM SRYKRENTFP GTLQPIHDPY VPGIVTTNDL VATKRRELST GSDLVWPGSM MPSNDEVNES NGFFSKWNPF WKPKWQKTNR YYACEGVSRG IVRGKSWPVP YCFEKERPSW SESILVVLRS TGDRSIVLLL AITLLALFYC FAM
|
| |