Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49639 |
Symbol | |
ID | 7198272 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 276869 |
End bp | 278809 |
Gene Length | 1941 bp |
Protein Length | 435 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184435 |
Protein GI | 219128469 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCCTTTTTT CAATACCAAT CGGATTGGTC TTCTTGTTAG TCCTCGTCGT GTATACCTTT TTTGGTACCA AACGCGAATC CTCCACAAGA TCTTTCCACA CTACCGCTTC CATCACTTCT TTGCCAGACT TGTGTTCTCG TTGTGGATTC GTAGCTAGCT CACTGTGTAT CTTTTGCGGA CTCTACGCTT CTTGCAATGA AACTTCAGCT CGCTTCTCTT GCTCTGTGGG CGTCATCTGC TCTCGCGTTT GCGCCTAACT CTCTACCTTC TCGGAACAAT CGTGCGGTAG GGTATGTTCC TTCCTCGATC TCCTCGCTGC ACGCTGCGGC GTTGGAACCT CCACCGCTGT CCCCGCTCAC ACAATGGGGC GATCGGATCG AGAACATTCG GGCGCTGCAG GCCGAGCTCA AAGCCCGCGA GTTGCCTCCC TTTGCCCCGG AGCTCTCCGC CGTCAAAGAC TGTGGTCTTG CTCGTGAGGA CACCGAAGGA CAACTCGCCT ACGTCCGCGA CAACGCCATG CGCATCAAAA CTATGATGCA GGAACACGGC GCCGTCGTCT TTCGAGATTT CGATCTCATG AAGACTCAAG AAGGATTCCA AGCCTTTTAC GCAGCCATAG GAATGAAAGC CTGTTTGGAT CCCTTGCATT CCGTATCGGC ACGGCCAACG GTGGATGGAC AAAAGAATTC TCCCGTCTAC GAAGCCGTCA ACAAGGAATC GCGCAAGAAT TTTTTTATTG GTACGTGATG GTGCTAGATA TCGACAGAAT GAGGCTTGGC ATACGACGGA TAATGTCGTT TTCTCATGGC ATTTTTTTGG TACGTCGACA GGCATGCACA ATGAATTCGT CGGAACGCGC GCTCCGCGTG CTGCAGCCTT TGTCTGCTTC AAGGCCGCCG AAACCGGCGG GGAATTCCTT GTTGCCGATG GCCGTCGCAT GTTTCGTGAT CTCGATGCCG ATCTCATCGA AGAGCTCTAC AACCGAGAAA TCCGTTACTC GGTCATGGAA TTGCCATTTT TTGGATGGAT CGATAACTTG CCCTCGTTCG TTCAGCAACC CGCCATGAAT GTCGTTCGCG GGGCAGTTTC GGCGGCGATC AACGCCAAGG TAGATTTTGA CGTGGAATTA CTCTGGGGCG AAGGTGGATA CGACGGTACC CGCATGTTAC AAGCTCGAGC ACCATCGCAG CCGCCGATTG TCAAGCACCC CGTCACTGGA GATCCGACCT GGTTTTGCAA CGTGCATTCT CACAGTTCGA AACTGCGTCA TCAGCGCGAA TCAATGTACG TAACGACTAC TGCGCTGCAA TCGAAAAGAT CCGTCGGCGA GATTCATCTC ACAGTTGTCT TGTCTATCCA TCCCACAGCT ATGGTGCGGA ACGTTTCGAG GACGGTGCTT CCCAAATCAA CAAGTCCGAC ATGTTTTTTG GTGACGATGG CGAGCTGTCG GAGGCACAGT TGAAGCAACT GGATGAAGTC ACGGTGAAAA ATACCCGCTA CGTCAAGATG ACGGAAGGAG ATGTCGTGCT TTTGGACAAT TATAAAACTA TGCACGGGCG CAACGTCTTT GACGGAACCC GCAAACACGG CGTGGCCTGG TTCGAGGGAT GGGAAGGTGA AGCTGATATG AAACAACAAT TTCAAGCCGA AGGAGCTTCT CAAAAAGTGG TTGCGTAAAC GAACAATTAA CCCGCTCCTC GTTCCTGCGC AAGTCTCCAT TCCATAGACA CACTCGCGTG TTATCAAATC CAGTAGCGCC TGCTTCTAAC TAGCTCTTTT CGTCTCGAGC GCCGAGGAAA AGCGGCACTT TAGTTTTGTG GAGTCCACCA AATTCTTAGA TTTGAATCTT AACAAAGAAA TTGAGTCTTA ATGTAGGTTT GCGGGGCGGA ATGAGAGCGT TGTATCTATT GAGATCGGAT TAATAACCTT CACCTTTTAG A
|
Protein sequence | MKLQLASLAL WASSALAFAP NSLPSRNNRA VGYVPSSISS LHAAALEPPP LSPLTQWGDR IENIRALQAE LKARELPPFA PELSAVKDCG LAREDTEGQL AYVRDNAMRI KTMMQEHGAV VFRDFDLMKT QEGFQAFYAA IGMKACLDPL HSVSARPTVD GQKNSPVYEA VNKESRKNFF IGMHNEFVGT RAPRAAAFVC FKAAETGGEF LVADGRRMFR DLDADLIEEL YNREIRYSVM ELPFFGWIDN LPSFVQQPAM NVVRGAVSAA INAKVDFDVE LLWGEGGYDG TRMLQARAPS QPPIVKHPVT GDPTWFCNVH SHSSKLRHQR ESIYGAERFE DGASQINKSD MFFGDDGELS EAQLKQLDEV TVKNTRYVKM TEGDVVLLDN YKTMHGRNVF DGTRKHGVAW FEGWEGEADM KQQFQAEGAS QKVVA
|
| |