Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37855 |
Symbol | |
ID | 7202655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 235460 |
End bp | 237043 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | |
GC content | 60% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182031 |
Protein GI | 219123437 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTCGG TCAAACGGCG GTCGTCGGTG GTTCCGGTGA GCCACGGAGT GCACGAAAGT AACGGCAACG CCAACGGCAG CCCGCAACAC CACGGAATAT CCCCGCTGTC GGGCATCGCC CTGGCTTCGG GTGGACCGAC GTCCTTCGAC CCTCCGACGG CATCTTTGTT TCCGTCGGCG GACACGACTC ACGGTGGTGA AAAAGGTCGG CTCTGGCGTA GACGGAGGAA ACGTTGGCAA CGAAGACTGC GGATTGGGTA CGGACGTTGG CAACGACTAC TTCTACAAGT TTTTCTATTG CTATTATTGG CAGTTTTCTG TGGTTTCGTG GTACGAGTAT TCGTTTTTCG CAACTCATCT TCAGTGTCGT CAACCGAAAC GGACGCCCTT CCGAGCATTC CCTTTCAAAC CAACTTCCCC GATGCTCCCG TTTGTCACAG TCTGTCCCCG GACGACGTAT CCTATACACT GGTCACGCAG TTGAGTCAGG ATCGCTTGTG GATGATGGAG CACCACTGTC AGCGATGGGG TCCATCCCAT CCCATGTCCA TCGCTGTATT CACCAACCAA ACCGTCGCAG AAGTCCGCTC CCAACTCGTC GCGTTGGGTT GTGCACCGGA GCAGCTCGCC TCCGTCCAAA CGTTGCCGTC CACGGCGGCG GCGGTGTCCG ACTACCCGGT CAACGTCTTG CGCAATCTCG CCTTTCGCGC CGTCACCACT ACCCACATTG TGTACGTGGA CGTGGACTTT TGGCCGTCCG CGGATTTGCA CGCCACGTTG TCCGGGGCCC GCATTCGGCA CGCGCTAGCG CAGAACGAAC GCACCGCCCT GGTCATCCCC GCCTTTCAAC TGCAACGCCA GTGTCGTGCG TGGAAGGAAT GTCCGGATCA AAACGTGCCG GTCATGCCCA CGCACAAGGC CGCCCTCGAA CGACTTTCCC GAAACCGACA GGCCTTCCCG TTCGATCCCA CCAATGTGGG AGGCCACGGG TCCACAAAGT ACCGGGCGTG GATTAAAAGC CAACCCGACG GCGTGCTGTT GGAAATTCCG TGCGTACTGT CGAACCGGTA CGAACCGTAC CTGGTGGTGC GCTACTGCGA CGTCCTCCCG CCCTTTCAGG AAGCGTTTTC CGGCTACGGC AAGAACAAAA TGACGTGGGT CCTGCAACTG TTGCACACGG GATACCGTCT GTTCCAAATT CCGCAATCCT TCGTGACGCA CTATCCGCAT CTGGATTCCC CGTCGCGCAT GGCGTGGAAC GGGGGTCGGG GTGGGGCGCC GTTGCCGAAA CCGCGGGCGG CGGACGGGGC GCCGAACAGA ATGCGTGGCA ATGGTGATAG TGCTGCTGGT ACGGTCGACT GGTTGCGGTA CCGACGGGGC CGTGTCGACC ACGTGTTTGT ACAATTCCGG GAGTGGTTGC GGACGATGGT GACGGACGCC CGGGTGGTCC CGTACTGCGA GTCGGCCGAA GATGATGACG GTCGGTTGTG GATTGATCAC GACACGGATA CACCGCCGGT GCGGAAACGA CTCAACCCTA ACGAGCAAGT GGGCGATGCC GGACTTCCCC GAACCTCTCG ATAG
|
Protein sequence | MPSVKRRSSV VPVSHGVHES NGNANGSPQH HGISPLSGIA LASGGPTSFD PPTASLFPSA DTTHGGEKGR LWRRRRKRWQ RRLRIGYGRW QRLLLQVFLL LLLAVFCGFV VRVFVFRNSS SVSSTETDAL PSIPFQTNFP DAPVCHSLSP DDVSYTLVTQ LSQDRLWMME HHCQRWGPSH PMSIAVFTNQ TVAEVRSQLV ALGCAPEQLA SVQTLPSTAA AVSDYPVNVL RNLAFRAVTT THIVYVDVDF WPSADLHATL SGARIRHALA QNERTALVIP AFQLQRQCRA WKECPDQNVP VMPTHKAALE RLSRNRQAFP FDPTNVGGHG STKYRAWIKS QPDGVLLEIP CVLSNRYEPY LVVRYCDVLP PFQEAFSGYG KNKMTWVLQL LHTGYRLFQI PQSFVTHYPH LDSPSRMAWN GGRGGAPLPK PRAADGAPNR MRGNGDSAAG TVDWLRYRRG RVDHVFVQFR EWLRTMVTDA RVVPYCESAE DDDGRLWIDH DTDTPPVRKR LNPNEQVGDA GLPRTSR
|
| |