Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49851 |
Symbol | |
ID | 7198673 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 91596 |
End bp | 94601 |
Gene Length | 3006 bp |
Protein Length | 703 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184645 |
Protein GI | 219128912 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.165107 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATTCC GGCAACACAA TCCCAATGTG ATCACTCCTC CCAACCCTTC TGTTTCTAAC AAGAAGGATG TGGGGGTGAT CGGTGAGAAG GAGCCGACTG TGCGATCGGA ATTCATCGAC AAGGAAAACA TCCCTCCCTC TTTTTCTCCT CCTATGCCTC CTAGCGTCCA CGGTTTCCAC GACGTCGAGA TGGAACGGCT CGTTCGAGGT GTTAATCGCT TCGGATTGAA CCATGGTAGC GATAATCCCA GTCGGGTCGA AGGATCCATC CTTGGGTTAG CTCCGGCCTA TCCTCCCGCC GTCGTACGAC TAACCTCCCT CAAAAAAGCA AACTCGGCTG CGGAGTCGTC TGAGCCCTTG GCGCAGCATC GCTTCAACCG TTCTCTCTGG AACCGAGCTG GCTCCAACAC AACTCAGCAG GACAGTATCA AAAAGCTTGA TGTGCTCGGT CGTGGTATCC GTTCACCGAA GAGGATTCCT TTCGGAAACG TACACCACGG CGGCTGTGAC AGCGCACGCA ATGACGTCAA GAGCGCCGAA ATTCCTCCTC CACGAGCCGA CTTTGCTAGC ATCGTCCGAT GCAACGCCCC CCAGAGTCCA AACACTTTGC AAGATCAACA CCATCAACAT TTGCTCCTCA AAGAAGCCGC TCCCAAGGAT CCGTCGATTC CCGCTGCTGT GGCGGATGCC ATCAAAGCGA GTCGGGAATG CCGTGGCACC ACGAAAGTCG CTACGGTGGA AAAAGCGGAG CTTTATCTTC GCCATGCTAT CGCCGAATAC CATGCTGGAC GCATCACGGA GGCTCCGAAT GGATCGTGCT ACAACAACAT TGTACATGGC TACGCCGATC TCAAGGAACC GGCAAAGGCC GAAGCCATTT TGCATCTCAT GTGGTCCGAT TTTCAACAAG GCAACGAGGT AACTTGAACC GCTTGTACTC GAGTGCTCAT GGCTCTTACA ATACCCTCTT CTAACCAATT TCTTTCTTCG CACAAGTTGG CAGAGCCCAA TGTCCGCATC TATACCAGTG TCTTGTATGC CTGGGAAAAA TCGAAAAAGG AAAGTGCTCC AGAACGCTGT GAAGCCATCC TTCAGCAAAT GCATCGCCTC CACGATTCGG GAATTGCCAA ATTATGCAAA CCAGATCTTT ACGCGTATAC TGTGTGTCTT CATACGTGGG CCGACTCAAA ACGCCCTGAC GCTCCGAAGC GAGCCGAGCA ACTCTTTCGC AAGATGAAAG ACCGCTATCA TAACGGTGAC ACAGAACTCC AACCAGACTC AGTGTGCTAC GTCAATCTCT TAAACGCTTA CGCTAACTCC GCAATTGAAT ACGCCCATAC TGAGGACTTA CTCTGGGAAA TGGTGGATGA CTTTATTGCC GGTAATGAAA GTGCCAAACC CATTATTCGC AACTTCAATA CCGTTTTAGC GGTGTGGTCT AAATCTGGTT CAGCAGAGGC TCCGGAACGC TCCGAAGCGA TAATCCGACG CTTGCATGAG TTGAACAAAT GGGGAGCCTT GGACACCAAA TCTGACCAGT ACACATACTC GCTGCTTCTA AAGACTTGGT AAGTTTTCCC ACGGTAGTTT GTCTGAGCAT GACATGGCAC CTCACCAACA CAACCTATTC CCCATTAGGA CGACGTGTAA TCGTCACAAT TCTGCTCAGG AAGCCGAACA GGCTTTGTAT TGGATGGAAA GCCTTCACAG TGAAGGTGAC CAAGGGGCGC GTCTTGACGT CATTAAATAC ACAACAGTTA TCAGTGCTCT CGCACGTTCC GGCAATCCAG GCAGCGCCGA GACTCTTCTA GAGAAGATGC TCGAAGATTA TCAAAAGGGA AATTCGAAGG CCAAGCCCGA TGCGAAGTCA TTTAACATGG TTCTCTCTGG ATGGTCTCGT TACCACAACG CAACGGTCGC CGCTGCTCGT GCCCAAGCTC TCCTGCATCG TATGTGGAAC TACAGAGCGG TCCATATCGC CCCGGACACG TGGTCCTATA ATACCGTTTT GTTCTGCTGG AAGAACGCGA ACGGTCCTAA ACAGGGGGAA TCGCTGCTAC TCGACATGGA CCGCATGGCA GCAAAGGGGT TTGCCAAAGC TCGTCCTAAT AGTACAAGCT TTCAGGCCGT CATCGACTCT TGGAAGAAAT CGAACTTTCC CTTCAAGCAC CAGCACATTC ATCAGCTTCA GGAGGAGTCT CAGAAGCGCT TTGGTGAAAG TGCAAAGTCC AAAAACATGG ACTCCCGCAG CTTCAAATGA GTAGCGCGGA TACAATTGAG AAGAGCCGTC AAGGTTATGT TGCTCAGAAC CACACACCTT CCAACACCTA TCCGCTACGG GCTCGAATAG CATAGTTCTG CTTTCTGTAT CGCCCTAAAT CTCAACGTCC AATCTCACTG GTAGTAATAT CTGCAACCCT TAAAGACCGC GTGGTCAATT TGTCCTCTGT ATGTTTTAAC GGAAATTGCA TTCGGTCTCA TTTGCGAGAG TATATTCGCG AACCCAAAGT TTCAATGGCA ATCCGTACTA TACGCGTCTC TCGGCGACAG GAGGAATAGC GAAGCGAGCT AGAAAAGTGT TTCGACAACA GAAGAAATGT CGGATTGTGC GGCTACAAGT AAGCTTTTGC GTTTCATAGG TTGTCGTAAA CAGCTTTGGG AATAACATAA ACACTTGATG CTTCGCAGAA AATCGTCCCG GTTTCGGGAC ACTCCACAAC CACCTTCCAG ACCAGTTTTC GCCGCACTCG TTCCACGAGG TGACATCGTA TATAAAGATG TGTATGAGCG GGAACGAATT TTCGGTAGTC GATCGATAAA TTGGCAGTAT ACGCCATTGG CACATCCCCC AAGGCTTCGT AACCGGCACC TAAAACGTCG TCTATAAGCA GCGCCAGGAT ACCACCATGC ACAACCCCCG GGTGGCCGTT CACGTGCGAT CCAATGAGGA CCGATGCCAC CACAACGTTC TCACGGACAG TGTTGCCTTT TGGAGCATCC AATGCG
|
Protein sequence | MAFRQHNPNV ITPPNPSVSN KKDVGVIGEK EPTVRSEFID KENIPPSFSP PMPPSVHGFH DVEMERLVRG VNRFGLNHGS DNPSRVEGSI LGLAPAYPPA VVRLTSLKKA NSAAESSEPL AQHRFNRSLW NRAGSNTTQQ DSIKKLDVLG RGIRSPKRIP FGNVHHGGCD SARNDVKSAE IPPPRADFAS IVRCNAPQSP NTLQDQHHQH LLLKEAAPKD PSIPAAVADA IKASRECRGT TKVATVEKAE LYLRHAIAEY HAGRITEAPN GSCYNNIVHG YADLKEPAKA EAILHLMWSD FQQGNELAEP NVRIYTSVLY AWEKSKKESA PERCEAILQQ MHRLHDSGIA KLCKPDLYAY TVCLHTWADS KRPDAPKRAE QLFRKMKDRY HNGDTELQPD SVCYVNLLNA YANSAIEYAH TEDLLWEMVD DFIAGNESAK PIIRNFNTVL AVWSKSGSAE APERSEAIIR RLHELNKWGA LDTKSDQYTY SLLLKTWTTC NRHNSAQEAE QALYWMESLH SEGDQGARLD VIKYTTVISA LARSGNPGSA ETLLEKMLED YQKGNSKAKP DAKSFNMVLS GWSRYHNATV AAARAQALLH RMWNYRAVHI APDTWSYNTV LFCWKNANGP KQGESLLLDM DRMAAKGFAK ARPNSTSFQA VIDSWKKSNF PFKHQHIHQL QEESQKRFGE SAKSKNMDSR SFK
|
| |