Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23826 |
Symbol | |
ID | 7198954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 288029 |
End bp | 289834 |
Gene Length | 1806 bp |
Protein Length | 552 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185005 |
Protein GI | 219129668 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTTTGCCA CGCAATCATC CTTTCGCAAG TTGTATTATC TATCAAGTGA GACTAGCATC AGGGTCTCGG CAGTCAGGTC CATTGTATCG AACAAAAGAG CGGCTACACG CGAGCCCAAA CAACGCGTCG TTGGCCTTTC TCTAATTATG GAAAGAATCC GACAGGTTCT TCTCACACCC ATGCTGCGGC CAGAGCTCTT TGCTAAAGGT TCAATGAAAC GCCCACGCGG GATTTTGTTG CATGGACCCA GCGGAGTGGG CAAATCGTCG TTGGCACGGC AACTCGGCGA AGAGTTGGAG TCAATTTTGC ATGTTCAGTA TGTGAATTGT TCCTCGTTGC AATCTCAAAC AAGTATCGTT GGGGAGGCTG AGCGAGAGCT GTCCCGTTTG TTTCGGTTAT CCGGTACTGC AAAAAAGCAA AACCGTCTTT TGATTTTTGA CGACATCCAC CTGATATGCC CCAGGCGTGG TGGCTACGCC CCAGGTACTG ATCAGTTGGC ATCTACGTTA TTGTCCTTGA TAGACGGGGT CGATGGAAGC GAGGATCAGG AAGTATCCAA CGAGGGACTT GTTCTCCTTG CCATTACCAC GAATGCATCG CTGCTGGACC CGGCACTGCG TCGACCGGGA CGTATAGACG TGGAAGTGGA AGTACCAATA CCAGATGAAG CTTCAACGAG GGCGGAGATT TTGCAGTTCC ATCTGGGCCA AGCTGGAGCA TCCCCACCAG CAATTTCATC AACTGATTGG ATATCGCTCG CAAAGCTTGC CAAAGGATTC AACGGTGCAG ACTGCATGCT CGCTATGAAG GAGGCAATTC GAAGCGCCAT TCTTCGGAAT TTGAAGGCTA CCGTATCACG CGAAATCTCT TCACCGCCTC CGAATCCGAC ATTTAGTGAT CTAAGTACAG CGATTCGAGC AACCAAGCCC TCCATTATCA AATCCGTCAC GGTAGAAATA CCCAAAGTCC TCTGGTCATC GATTGGCGGT ATGGAGAGCG TGAAACGTGA GCTCCGTGAA GCAATCGAAA TGCCCTTAAC TCACAGCGAT TTGTTCATCA AGCTGGGTAT ACCACCACCT CGCGGAATTC TGCTGTACGG ACCACCAGGC TGTTCCAAAA CGCTTATGGC ACGGGCACTG GCTACGGAAG GACATATGAA CTTCCTTGCT GTCAAGGGGC CTGAGTTGCT GAGCAAGTGG CTGGGTGAAA GCGAACGCGC TCTCGCGTCT CTCTTCAAGA GGGCTAGAAT GGCCAGCCCA TCCATTGTGT TCTTTGACGA AATCGACGCA ATAGCGTCCA AGCGAGGTGC TGGAGACAGC TCCAGTAGTG GTCGGCTTTT ATCCCAGCTA TTAACCGAGC TTGATGGTGT AACAAATACT GTTGGAAACA CGAAGCAGCG CGTTGTAGTC GTGGGGGCTA CTAATCGGCC CGATATACTG GACAGCGCGT TGACTCGCCC AGGACGAATC GACCGGATGA TTTACGTCGG GGTGCCAGAT TCGGACACTC GTGTCCGTAT ATTCCAGATC ACACTTGCCG AAAAGTCTTG TAGTCAAGAC GTCGATATAG AACACTTGGC TAGAGATGAC GTTACCCAAG GATTTTCAGG TGCGGAATGC GTCGCTATTT GTCGGGATGC TGCTCTGCTA GCCTTGGAGG AAATAGAAGA TACAGGTGAA GATATAATTC CACAGATCCG GATGCAGCAC TTACTTGAAG CAGCAGGAGG TATGAAACGC CAAATCACTC CCCAAATGAT CGAATTTTAT GCTTCTTTCC GTGAAAAGGG CTTTGCCAAG GTATAG
|
Protein sequence | MERIRQVLLT PMLRPELFAK GSMKRPRGIL LHGPSGVGKS SLARQLGEEL ESILHVQYVN CSSLQSQTSI VGEAERELSR LFRLSGTAKK QNRLLIFDDI HLICPRRGGY APGTDQLAST LLSLIDGVDG SEDQEVSNEG LVLLAITTNA SLLDPALRRP GRIDVEVEVP IPDEASTRAE ILQFHLGQAG ASPPAISSTD WISLAKLAKG FNGADCMLAM KEAIRSAILR NLKATVSREI SSPPPNPTFS DLSTAIRATK PSIIKSVTVE IPKVLWSSIG GMESVKRELR EAIEMPLTHS DLFIKLGIPP PRGILLYGPP GCSKTLMARA LATEGHMNFL AVKGPELLSK WLGESERALA SLFKRARMAS PSIVFFDEID AIASKRGAGD SSSSGRLLSQ LLTELDGVTN TVGNTKQRVV VVGATNRPDI LDSALTRPGR IDRMIYVGVP DSDTRVRIFQ ITLAEKSCSQ DVDIEHLARD DVTQGFSGAE CVAICRDAAL LALEEIEDTG EDIIPQIRMQ HLLEAAGGMK RQITPQMIEF YASFREKGFA KV
|
| |