Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33756 |
Symbol | |
ID | 7197823 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 277875 |
End bp | 280738 |
Gene Length | 2864 bp |
Protein Length | 711 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178198 |
Protein GI | 219114805 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0254551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGCGAA AGAAGAAAAT TGCATCCCAG CAACCGGAAT CATCGAATCA ACCGGCTAAG AAAGCAAAGC CAGAAACTGG TCCGTCGTTT CTCGTGAAAA AGTACCGGCC GGATAAGCAT TATGCATCGA TGGACGCTTT CGTCAAGTAC GCGAAGTCTC ACCGAATGCT GGACGCGGCC TATGTACAAT GGAACAAAGG CGTAACGGCC GAGAAGCCGT TTGTTTTTTC TGTACGGGTG GGTGGGGTAG ACCTCGGTTG GGGGCGGGGG AAGACGCGTG AGGCTGCCAT GGAATGTGCC TGCCGAGCAG CCTTTGCCTT GGTTGGAGCT CACGGGTATA AAAATTGGAC AATTGATGAC AACTGCTTGA TGGAAGAACC AGTAGATGTG CCTCCTCCAC CACCGCCTCC GATGCCCGGC GCGATGGCGC TCGGCTTGCC TCCTTACCCT CCAGGCAGCC TCCCTCCTCC TCCCATGCCC GGCTTTTTAC CACCGCCTCC CCTTCCCGGT ATGCTTGCGC CTCCGCTGCC TCCCGGTGCG CCACCGCTTC CTGATGCACC CCCACCTCCA ATGGCAGCGG ACCTTATCCC TCAGCCCCAA ATGGTATCGA ACCAAGCGCC CGTAGCAACG AGCGTAGCGT CCGGTGTCGC CAATAATGTG AACTCTGCTG TAAGTGATGC GTATACAACT TCTGCAAGCG TATCTCTCAA TTTTGGAAAG CCTGCTGTTG TGAAGAGTCA GCGAAAGCAA CTCAAGGGTG GATTGACTCT TGTGTACGAT CCGCTCTCGG AAGGAATGGA AGAACTGAGT ATGGAAGAAC GACGAGCTAG CCTAGAAAGA TATCAAAAAA TGTTGGTGCG CTCTGTGGCC AAGTAAGGAA CTACTATCTT GTAAGTAGGA ACAAGAATAT ATGAATGGCT GTGAGTCTCA TTTTGCCTTC AGGGTGGCGA ATAGCGGAGA ATTTTTTTTG GGAATATCAA GGTATGTCGC GTTCTCGCAA CTGGTGGGGT AAGATAGAAA CGTAAGTTGT TACATTGGCA ATCTTTTTGG TGTCTTGCCG GTAAAGCCCA TCATTGGAAG CATTGACTAT GAGGTATGTG GTATTTTCAT CTTTGTTCGC GATGAGATTC ATGCGATAGG ACCCGAGTAG GGTCGTCAGT TTGTTCCTTA CCGGGGCTCT TGGCAGCTTC GTCTTATGAA AACTCCGCTG AGACTACGCT TGTACTAGTA AAAGTTCCAT AAACTACAAG GTGGAACTGA TATCTTGCGT GTCTTTACTT TTTGTTGACG CTACGAGGTA GGGTAGAAAT TCACATAGAT TTCTAACTGT AGAACCGGGT CTGCCTTGCC TTTTCGTCCG AGTAGGTAGT ATTGGGTTTT TTCACTGTCT TTGTCAGTTC TGCCACCACA TCACTGTCAC AAGAAATATT TTTCTTTCTC GGTCTGGCAA TTTGATCATT CGATTCAATG TGATCTTCCA GCTTGTTCTG ATTGTGAAAG CAAATCATTT TACCGTTGTA GGTAATAAGA ACTTTTCTCA TGGCTTCACT TCTACTTTCG CTATTGACAG TGAGCAGACG AGCGAGCTCT ATGAGTGTGG CTAAAGGCGC ATTGACTCGT CAAGGGCCTG GCATTTATCT ATCGAAGTCG GCGCCAATTC GATGCACGCT TCAACAACAA CAACAACAAT CTCTATCAAC CAGAAGCAGC GATGATCAAC GAAATCAGAG CGCTGGTCAA GTTATCAAGG ACCACCAATA CTGCGTTGAT CTTGTCCGCG AGCGGGACAG GGAAGGATAC CGTAAGTAGG TTTGGGTTGG ATCAAGGGCA CACTTTGACG ACGCAAGCTT CTCCAAAACA ACTCACACTT CGTCGTTCGC TTTTGTTGGT AGTCTGCGGT TTACTGATGC CGCCAGGAGC TACCAAAGCG TACTTTGCCA TACGGGCCTT TAATGTTGAG CTAGCGTCCG TCAAAGACTC TCACAACTTG CGCCGTCGGG AGCAGCCTGG CCAACAAGAG TCTTCGAGTA GTGTTGCCCT GCAAATGCGC ATGCAATGGT GGAGAGACGC CTTGAAGGAA ATTTACGAAG ACGAAATGAG TGTTGCTGCT GATCCTATTC TAAGGAATCT ATCGGTGTCC TGCTGGCACA ATCCTGTCGT TCGAGCTTTG TCCCAAGCTC ACCAGCAATG TGACTTGACA CGGCGCTTCC TCGAACGCTT GATTGATGCT CGCGATTACG ACCTCAGTGT TGCGCAGTAT TCTTCGATGA ATGAAGCGGC AACCTATGCA GAGGATACCA TTTCGAGTCT TCTGTACCTA GCCCTGGAGT GTACAGGGGT AAGTTCAGCG GTTGTTCTCT GTGTCTATAG TTCTTCCAAC TAGATTCCTT TTCTCACAAG CGTGTGTAAC TGCAAAGACT CGCGACGACA ATGCTGATGA AGTCGCTTCA TACGCTGGTG TCGGGATAGG TCTAACGACA GCCCTACGCG CGACGCCCTT TCGATTAATG CATGGTGAAA TACCCATTCC AAAGGATCTG CTGCGCCCAG CTTTTCCTTA CCAGGAATTG ATGAAACAGA CCGAAGAGGA ATATACGTTG ATAGAATCCG ACGCGATAGC ATTTCGCGAG GCAGTCCGGC ACATGGCAAA TGCTGCTTCC ACCAGTTTAG CTCGTGCGCG CGATATTCAG GGGCATGTAC CGAGGCATGC TAGAGCTTGC TTGCTACCGG TTGTCCCGTC AATTCATTTC CTTTCAAAGC TGGAGGGCGT CGATTATCAC TTGTTTGACC CGAAGCTGAA CGATGACACA CGACTGCGAT TAATGTTACT CATGGGACGA ACATGGCTCA CAGGAATCTT CTAG
|
Protein sequence | MGRKKKIASQ QPESSNQPAK KAKPETGPSF LVKKYRPDKH YASMDAFVKY AKSHRMLDAA YVQWNKGVTA EKPFVFSVRV GGVDLGWGRG KTREAAMECA CRAAFALVGA HGYKNWTIDD NCLMEEPVDV PPPPPPPMPG AMALGLPPYP PGSLPPPPMP GFLPPPPLPG MLAPPLPPGA PPLPDAPPPP MAADLIPQPQ MVSNQAPVAT SVASGVANNV NSAVSDAYTT SASVSLNFGK PAVVKSQRKQ LKGGLTLVYD PLSEGMEELS MEERRASLER YQKMLVRSVA KVANSGEFFL GISRNVSCYI GNLFGVLPVK PIIGSIDYEV IRTFLMASLL LSLLTVSRRA SSMSVAKGAL TRQGPGIYLS KSAPIRCTLQ QQQQQSLSTR SSDDQRNQSA GQVIKDHQYC VDLVRERDRE GYRATKAYFA IRAFNVELAS VKDSHNLRRR EQPGQQESSS SVALQMRMQW WRDALKEIYE DEMSVAADPI LRNLSVSCWH NPVVRALSQA HQQCDLTRRF LERLIDARDY DLSVAQYSSM NEAATYAEDT ISSLLYLALE CTGTRDDNAD EVASYAGVGI GLTTALRATP FRLMHGEIPI PKDLLRPAFP YQELMKQTEE EYTLIESDAI AFREAVRHMA NAASTSLARA RDIQGHVPRH ARACLLPVVP SIHFLSKLEG VDYHLFDPKL NDDTRLRLML LMGRTWLTGI F
|
| |