Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_18142 |
Symbol | AP2mu |
ID | 7197526 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 521626 |
End bp | 523413 |
Gene Length | 1788 bp |
Protein Length | 425 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177952 |
Protein GI | 219112401 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.361104 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACAGTCAAC TCACAATTAC GTTCTGACAG TGAGTGTGAG TCCAAACTTG ATGCTACTGC TCGCTAGGAA ATCTTCCGTT GTACCAAGAA TTTCATCAAG GCAAACATAT TAATAAGTCG AAATGATTTC CATGATCATG GTCCTGAATC AAAAGGGAGA TATTATGATC TCCCGACAAT ACCGGTAGGT CATAAACGTT TTGGATAGGA TTCCCATGAC GTCTTGTATC GTAAAGTCTG ACATCTGACC TGTCTTGTGT TTCAACGATG TTAGCGACGA CGTTGGTAGA GCGGCGGCTG ATTCATTTCG TCTTCAGGTA CGTTCTCGAG GCATGCCATT TTAGGCATCC CCATGGTTAG CATGCCGAAT CCACGCTTCC TCATATCCAC GATACTACAA ATCCGTGTGA CCACGTTAAG TACTTATGGA CTTAACCGTC AGATAACAGT ACAAATGCTT AACCCTGATT TTTTATTATT GTCAGGTTGT GGCCGCCAAA GAAACCGGCA CGGAAGCTCC TGTAAAGCGA ATTGAAAACT GTTCTTTCCT CTATACTCGC CACCTGAATA TGTACTTCGT TGCCTTGACC CGTTCAAATG TGAACCCGGC TTTGGTTTTT GAGTATCTGT TCCAGCTAAT TAAGATTCTG AAGGCTTACT TGGGAGAGGA ATTCGACGAG ACAGCAATGC GCAACAACAT GACCTTGATC TATGAGCTGA TGGATGAAAC GATGGATTTC GGTTACCCTC AAAATTGTGC TGTTGATGTT TTGCGATTGT ATATTAATTT GGGAACGGCC AAGCCCCAAG ATGAACCCGA GCCTAGCAAG CTGACTAGCC AAATAACCGG AGCCATTGAT TGGAGACGAG AAGGAATTCG CCACAAAAAG AATGAAGTTT ACATTGATGT GCTCGAAAGT GTCAATCTTC TACTTTCTAG TACTGGAAAC GTGCTACGAA ACGAGGTCGC AGGGTCGGTA CAAATGAATA CGAAATTGAC TGGTATGCCG GAATGCAAAT TCGGATTGAA CGATAAGCTT GTGATTGAGA AAGACAAAGA AGATCGTAAA CCGAGCGTTG ATATTGACGA CTGCACTTTT CATCGCTGCG TTCGTCTGGG AAAGTTTGAC GCCGACCGCA CAATTACATT CATACCACCA GATGGTGAAT TTGAATTGAT GCGATACCGA GTCACGGACA ACATCAATCT TCCTTTCCGA ATTATTCCCG CCGTACAAGA ATCGCAAAAT AATACCAAAG TGTCTATCGA CTTAAAAGTC ATTGCGAACT TTTCGGACCA GCTCTTTGCC ACCCATGTGG TTATCAAGAT CCCGGTACCA AAGAATACAT CCAAGACGAA GATCAAGCAC TCGTTTGGTC GCGCCAAGTA CGAGCCAGAA CAGCAAGCTA TTGTGTGGCG AGTCAAACGC TTTGCCGGAA AAGCACAGTG CATTATCAAT GCCGAGGTCG ATCTCATGCC AACCGTTCGT TCCCAGCCCT GGAGCCGACC ACCTATCAAC GTGGAGTTTC AGGTGCCCAT GTTCACAGGT AGTGGTGTAC ACGTACGATT TCTTCGTGTG TACGATAAAT CAGGCTACCA CACGAATAGA TGGGTTCGTT ACATTACAAA GGCAGGTTCC TACCAAATTC GAATTTAAAG GAGTTCTGCT TCGTTTCAGT GTTTGTTCGG CATGGGTACG ATGGTTAATA GTCTCTTTGC CTGGATGGTC GTGTTGTGGC CGGCTTTAGG CACCTACAGT AGTTTACAGT ATAACAAAAG CCGATTTT
|
Protein sequence | MISMIMVLNQ KGDIMISRQY RDDVGRAAAD SFRLQVVAAK ETGTEAPVKR IENCSFLYTR HLNMYFVALT RSNVNPALVF EYLFQLIKIL KAYLGEEFDE TAMRNNMTLI YELMDETMDF GYPQNCAVDV LRLYINLGTA KPQDEPEPSK LTSQITGAID WRREGIRHKK NEVYIDVLES VNLLLSSTGN VLRNEVAGSV QMNTKLTGMP ECKFGLNDKL VIEKDKEDRK PSVDIDDCTF HRCVRLGKFD ADRTITFIPP DGEFELMRYR VTDNINLPFR IIPAVQESQN NTKVSIDLKV IANFSDQLFA THVVIKIPVP KNTSKTKIKH SFGRAKYEPE QQAIVWRVKR FAGKAQCIIN AEVDLMPTVR SQPWSRPPIN VEFQVPMFTG SGVHVRFLRV YDKSGYHTNR WVRYITKAGS YQIRI
|
| |