Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50618 |
Symbol | |
ID | 7199433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011700 |
Strand | + |
Start bp | 137207 |
End bp | 139259 |
Gene Length | 2053 bp |
Protein Length | 598 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185567 |
Protein GI | 219130849 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0981009 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGAACTACGG ATGACCCACA TCTTCACTAG AAGGCCTTTC TCACTCTCCC ATCTATCCAG GGGTCCTCAT TCGTGAATCC CCTTAACCAC GTATACCTGC AATTGAGATC TATATCCGCG GAAAGGTACA TCACATGTTG ACAGTGAACA ACACTTGATA TGAAAGAAAA AAATGACCAC TCACCAAAAA CTTCGAACCA TGAGTTGGAG TCTAACGAGG AAACGGGAGT GAATAGTCCT GCGGAGTCGG ACGCCTCCTC TCCTTCGTGT GGTGAACAGC CAAAGCAGCC ACGATCGTCA CCAGAATCGT CTAAGGTGCT ACGAAATGTG CCCGACAAAG GACCACAGGA GGCCTCCACG GATGGGAAGC CTATTGCCAC TACGGCAAAA AATGAAATGT TTGCAGAACC CGCTCAAGAT ATTCCATTCC CAACGGAACT GCAGCGAACT GCGCAGAGTA CACCAGGGGC TTTTCCGCAA ATTGGAATGG GAAGTACGAG AAGCAGCGAG ACGCCAGTGA CCCCGCACTC CGCTCCGACC CGCGATGGTT CCCAGCGTTC CGGATCGGCG GAAGAGGATT TTTCCAGTGC CGGCGAACCG CCGCTGTTGG CGGCGACTCT GGTGGAAGAT ACCCCCGAGC GAAGTGATAC GACAGGGACT ACCGGGCCCA ACGAAGCAGA ATCGGACAAA CTGATGGTCG TGGCTGTGGC CGAGCCTGTG GAGCAACGAA CCGATACAGA GAAGGCAGCG TTTTCTCGAG GAGCCTCTTC GAAACGTTGG CGCATGTTGT ATGGAGCATT GCTGATAGCC GGGGTAGTCG CGCTGGCCGT GGGGGTGACC CTGGCATTGA CGATACAAAC GTCATCGGGG TCATCAACAA CATCGTCTTC GTCAACAGCA CTCGGGAGCC CCAACGGCAC CAATGCTTCC GTGCCGGATG ATACCAATAC CCCCCTTCTC AACGATACCA ATACCAACGA CTTCAACGAG GACGTTGAGC AACCCATTCC CGGCAATATT CCATCTAGCG CGCCGGGAGG ACCCGTCGTA CAGAGCACCC ACCTGGTAGC TTGGGAAAGT AGTTGCATGG ACGAAAAGGT AACTGCAGAA ATCCAATGTG GAACTAATGA AATTTTGTCG GTCCAAGAAG CAAGAATCGC AGAGTGTACG ATTCAATCGG ATCAAGGCTC GAACAGGGCG ATTTGCCTGC CTTTAACTAT GAATGGTACC ATTGAAAGCC TTGAGAGTGC CAATGGCAAT GATGTACGAG AGATAGCCAT GGTTCTGTTT CAATGTACCG GAGTGGAAGA TCGTCCCAAC GGTCCAGTCC GAGCGCGCTT GGTCCCAACC AGAAATGTTG ATACAACTGC TCCTGCAAAC ATGGACTGGC TCTGTGTCAG ATTTAACGAG AATCTCCAGT CGGCCATTGC CGATACAGTT CCTGCCACCG CTATGGCTGA CTTTGAAGTC GAGCTTTATA TAGTATTAGG TGACTTTTGT CTGGAAAGAC GCAGTCAAGG CGACATCAAC GCTGTCCGCC ACCGGCTTCG TACCAATACG CGTCGACAAC TTGTCGAAAG CAATGTTAGT GACGTGGAAG TCGAGAACGC CACCGGAGAT GACACGAGTG AGGGCTGGAG ACTACAAAGT TTCGTACCGG GTTCGTGCCA GCTCCCCGCC CGCAATCTGT TGATGGACGC GGAAGCGACC GGTGCTGCCG AGAACGAGTG GCTCCAGGTG TGCTACGCAT TCGACAATGA CGATACTGGA GACGACGATG ACGACGGCAA TGATGCGAAC GAAGACGAGC TGGTCGTAGT GCACGCGTCG GTCTCCTGTC GACACGAGAT TGGTCTTGGT CCCAGCAGGG CGGCGGTATG GAACGAATTC GAACGACGCA TTCGTCCCCC ATTGATTTGG CGAGAATTGT GGAGCACAAC CATGACTGGA GTGTAGATTC TTCACGGCGT TTGGCACGTG CGTGGTTGCG GGAATTTTGG TAGATATTGG ATACGTGAGG AACGGGCGTA TACAGACTGT ACCGACAACC ATT
|
Protein sequence | MKEKNDHSPK TSNHELESNE ETGVNSPAES DASSPSCGEQ PKQPRSSPES SKVLRNVPDK GPQEASTDGK PIATTAKNEM FAEPAQDIPF PTELQRTAQS TPGAFPQIGM GSTRSSETPV TPHSAPTRDG SQRSGSAEED FSSAGEPPLL AATLVEDTPE RSDTTGTTGP NEAESDKLMV VAVAEPVEQR TDTEKAAFSR GASSKRWRML YGALLIAGVV ALAVGVTLAL TIQTSSGSST TSSSSTALGS PNGTNASVPD DTNTPLLNDT NTNDFNEDVE QPIPGNIPSS APGGPVVQST HLVAWESSCM DEKVTAEIQC GTNEILSVQE ARIAECTIQS DQGSNRAICL PLTMNGTIES LESANGNDVR EIAMVLFQCT GVEDRPNGPV RARLVPTRNV DTTAPANMDW LCVRFNENLQ SAIADTVPAT AMADFEVELY IVLGDFCLER RSQGDINAVR HRLRTNTRRQ LVESNVSDVE VENATGDDTS EGWRLQSFVP GSCQLPARNL LMDAEATGAA ENEWLQVCYA FDNDDTGDDD DDGNDANEDE LVVVHASVSC RHEIGLGPSR AAVWNEFERR IRPPLIWREL WSTTMTGV
|
| |