Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55097 |
Symbol | |
ID | 7198341 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 167199 |
End bp | 169349 |
Gene Length | 2151 bp |
Protein Length | 454 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184577 |
Protein GI | 219128768 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCTCACCAG AACCTGGAGC TCTCACCAAT AATGCTTTCA GATAGACAAC TTTACTGTTA AGGCTACGTA CTGCCCGGTC CGGTCCGGTT GCGAATTCCT ATACGTGCCG AGAACACAAC TCTAGTAGTA GGGCCATTTT TCCTAGACAG CTTTCACAAG AAACTGGAGC TCTCACCAAT GCTGCTTTCG GATAAATAAC TTCACTGTTA AGGCTATTTA CCGCCCATTC CGATTGCGAA TTCCCTATAC GTGCCTCAAA CCGTGGATCT TTTGAAGGTG CAGATGCCAG TTCACTGTTA GCGTGTATGA TTTGGTTCGC TGTTAACGCA AGCAGTTTCT ATACCAGTGG TCGGTGAAGT TTTCTTGTTT GCAAACCACT TTTCACTTCT TGACACACTT TCCGCATCCA AACTTTTGAG CTGTCGACCA AGATCATGAA ATCTCTCAGC CTTGTCGCTA TCTTGGCCGC AGGAGTATTC GTCCCCGGCT ACGACGCGCT CAAGCCTTCA AAGTGTGGCG GAAAGCTTAC GAGTCCTTGC CTCAGTGCAT CCGATACAAG GTACGATGCA AACTTTCCCA AATCAATCAC TCTGCAAAAT CCGGCATGGA AGCAATTCGA GGGCTTGTGG AAGACGACCT CAATCAATTT CCAAGGAAAC GGAATCGTGG CGCAGCCTCA ACCACATATT CCTGCATTGA AATACGCGAC TCTTCCGTAC ACTCTGAATG AGGTTGTAAC CTTCTACAAT CATACCATTG TCGGATCGAG AATGTCGCTT TATGCATATT TTTTCTATTC CCCGGCCCCA GAATCATTCT GCAACCAAAC ATTCAATCCG CCTTTCGAAA ATGTTATTGG GTCAGGAGTT TGCGGTGTGA ATGGATTTAC CACTGCCGTT GCCCAGTTTG GGACAAGTAC CCACGAAAAT CAAGGTGAGC AAAAACAGTA TGCAAGCTTT GGGTTCCCTG GCGCTACAAG TCCCTCTCAC ATGTCCCATA TTTAACTTGC TGATCAGGCG ACGTTGATTT TTTTCGTCTT CGGACTTCTG CTGCTCTCGG TCCGGTCACA ATTGATTTCG ATTCAGGACT ATTCACTTGG ATTGATTCCA ACTCTTTGCT TGCAACGAAT ACACTTGATG GCCTTTTCAG TCAAAGCAAT CCATACACCT TCCTTGACAA CAGCTCAGCC TTTGTCAACT TCAATGTAAT TGATCTTGTA AGGAGGACTA GAGACACCAA TGCCCTTGCT CAGATGACTC GAATGGAGGA GAGTGAGTGG CTCGCGGCTA TCGAGGAAGC GTACCAAGAT GTCAACATTG CAGCTGCAGA CAAAATCCCT GTTCCCTTCC AGACTTCTTC TTCGGATCCA GAATGGTATC CAACAGAAGA TGAATGGTGC GGTGGAGTTG GTAACGACCC AGAGTGCACT GTATCCCCAT ATCAAGAACC AGATGCCAAG TTAAAATCTA GTGCTTTGGT AGGGTTCGTT ATCCTTGGCC TCGCTGTGTT CTGCATTCCT TTGTATGCAC TATACCGATA CCGAATTGGC CAACAGGAAC GCCGCATCAA GGACAAATTC ATTCGAGGTA TTGCCAAAAA CATGTCCATT GCTCCTAGCG CTGGGGCGAT ATCCCGCGAC AAATTGGTGG AAGAATTCCA GCGCATTGAT AAAGACAAGG GCGGGACCAT TGAGAAAGCC GGTAAGTCAC GGATAATGAA CAACTTTGAC GATATTCCCT GTGACAAAAG TAGGATGCAA CTTACCCAGT ACTCTCTACA GAACTCAAGG ACTGGATAGA CGAGGGGAAG TTGGGAACAA TTTCAGACGC TGATTTTAAT GCATTGTGGA GTGCTTTGGA TAGGGACGGT TCCGGTAATA TTGATTTCAT GGAGTTCTGC ACTTTCCTCA GTGGTTGCAG CGAGGCGTTC GACAATGTTT ACGACGAGCA GCAGAAAATG TGAGTTTCCT CTCTTAGGGA GGGACGCCTT CTCTTGTCAA CTTTTATCAA TGGTGAATCA AAGGACCATC AACTCTAGGT CTTCATTAGG AGGGTAAAAC GAAGAAGAGC AATTTATGTA TGAGTGAACA TGGAAGGATG TCAAAAAAAC CTAATTTAAA GTCTGTAATA ACGAAATTGA GAACTAGTAG T
|
Protein sequence | MKSLSLVAIL AAGVFVPGYD ALKPSKCGGK LTSPCLSASD TRYDANFPKS ITLQNPAWKQ FEGLWKTTSI NFQGNGIVAQ PQPHIPALKY ATLPYTLNEV VTFYNHTIVG SRMSLYAYFF YSPAPESFCN QTFNPPFENV IGSGVCGVNG FTTAVAQFGT STHENQGDVD FFRLRTSAAL GPVTIDFDSG LFTWIDSNSL LATNTLDGLF SQSNPYTFLD NSSAFVNFNV IDLVRRTRDT NALAQMTRME ESEWLAAIEE AYQDVNIAAA DKIPVPFQTS SSDPEWYPTE DEWCGGVGND PECTVSPYQE PDAKLKSSAL VGFVILGLAV FCIPLYALYR YRIGQQERRI KDKFIRGIAK NMSIAPSAGA ISRDKLVEEF QRIDKDKGGT IEKAELKDWI DEGKLGTISD ADFNALWSAL DRDGSGNIDF MEFCTFLSGC SEAFDNVYDE QQKM
|
| |