Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47203 |
Symbol | |
ID | 7202190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 805466 |
End bp | 808162 |
Gene Length | 2697 bp |
Protein Length | 702 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181445 |
Protein GI | 219122213 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGAGGCAT CCCTGCCTCC TTCCCTACCT GTACACACAT TCGCACACAC ACACACATTC ACGTGAGAGA GACACATTGA CACATCCGTG TGCATCGTGC ACCAAAAGCG AGCCCTCGTG TTTCTTGCTG GAGTCGCCGT TGTCGTAGAC GTTGTTGTCG TTGTTGCCTC CTCCTGGGTT TTTCGTTTGT GGGAACAAAA CCAGATCAAG CACGACCCTT TGCTGCGGGT TGCCTTTCAC TGGTTCCGTA TTGGTGTTGC TGTGCGGGTT GTGTAGATTG TGTGTTGAGT GTACCGATTG ATCGCAACGA AAGGACTGTA CGAGTCGCCT TCCGTGGCGT GCCTCCTGGT CCAGTCGGGT ACATTCGGAA GTGCGGGTCA GTGGTGATAT CGAATACGTG CACACGTTTG TCTGATCGAT TCTGCTCGGC AGCGTCCATT TCCAAGTTCC AAAAGGGTCG ATCTTCCAGA CCGGATTCCT GTATACCTAC CTACAGATTC ATATCCCACA CCTACTTTCC CTATATCGAT GCTTCACTCC ATCGCGGAAC AACGCCAAAA GCGGGAGCTT CCCCCGCTCT TCTGGTACAT GGAGTTGGGA GAATGGGAAA AGGCGTCCGA TCGGGTACGT CGACACCCAC GGGAAGTCAA GACTTGGGCT ACACTCCGTA CCAAAAACAA CACGGACGAA CCCGCCCCGT TGCCTTCGTC CGCCAGCATT GCCAGTTACG CCACCAAGGC ATCTTCGTCT TCCGGAACCA AGCGACTCGC TCTCCACCAC TCGTGCTTCA AACTGAGGAC TGCTGGCTCT TGTCCCGTCG CGTCCAAGGC TTCGGAAGAT CCCTACGTAC AAGTCTGTCG ATTCATTCTC ATGCTCCTCC GACTCTATCC GGAAGCAGCG GGACAGCGAG AGTCGCGACA CGGTTGTTTG CCCCTACATC TAGCCTGCTT TGCCTCCTGT GCGCCTAGGG CCGACGACGA TACCCTCCAC AACAACAGCA ACCACAGTCG CAACGCGACC TCACCCTCCG CCATTGCCAA GGCACCCGTC GCTCGCCCCA ACATGATTGC GCGGCGCATA GCGTCCGACG CCACCTCCGC CACCACGGAC ACCAACCTTT CCGCCGTGCA CGCCGAAGAA ACATACACCG GAAACATGGC CGACAAGCAA ATACGTCGCG ATCACACGGT ACACGTCGAT CCCAACGTAT CCGTATCCAC CAAGAAACAT CTATTGATCA GTTCCAAACG GGAAGAAATG GCCGTACAAG TACTCAACGC GCTGCTCGAC GCCTACCCCA AAGCCATACG TACCGATTCC GAAGGCGGGC GTTTGCCGCT ACATACGGCC TGTGCCGGAC GGGCCACGCC CCGCGTCATT GCCACCCTCG TCACGGCCTA CCCCTCCGCG GCACGACACC GGAACAAGGA TGGATTCCTA CCCCTACACC TTACCGCACA CTGGGGCGTC GCCCATCCCA ACGTCGTCGT TACCCTTCTC AAGGCATACC CCGACGCTAC CGTTGGACGC AATCGCTGGG AACGAACTCC ACTGGAAGAA GCATTGTGCA TGGCGGGAGA AAACGGTCGA CCACATCAAG CCGCCATGGT CAGGGCGCTA CGGAAGCACC CCTCCTACTG GCAACGAGCG ACCGCCGAAA TTATTCAGGG CACGCGACGT CTACGCCAAC CCGGCAGTAA CGTCGTGGAT GTGGACGAAA GCTTGCCCTC CAACGACAGT ACGTCACTAG AAGAGCAACG CCAAGGTCAT TTCGCTCACG GACACAATCC ACTCGTTGAT CAAGTCGAAC AGGAACATTC CAAAAAGCCC GCGGGCAAGC TATCCCCAGA AGCGGCCAAA AAGAAGCCCA TGGATCATAA ACTGGACGAA CTTATTCGAC AGCACGACTG GGACGCGGTC ATTCGTCGGG TCGAAACGAA TCCCCTCGAG GTGGAGACGG AATTGGCGGT CATGACCCGT GGCGGATTCC TCAGTTGCTC GGGTGTCACC CCACTCTACT ACGCCTGCGA ACGCCAACCC CCCGTCGCCG TTGTACAAGC CCTCATCCAT GCCCATCCCC TCGCCGTTCT CACGCGCGCC ATGCCTGGTG GGAGTCTACC ACTCCACGTA GCCTGTACCT GGCACGCCTC ACCCGACATT ATCTGGGCCT TGTTGGCCGC CGATCAGGGG GCGGCCAAAG TCACCGACGA ACTCGGCAAC GTGGCGCTCC ACTCAGCGCT TTTTTCCGGA GCCGATGTCC GGGTGATCCA AGCGCTCGTC CAAGCCGATC CCGAGGCCGT ACTCTCACGG AATCATCAAG GATCCCGACC CGCCGATATC GGCAAACGAC TTCGGCACGA AAATCGCAAA ATGGTGCTGC CAGTACTCCA AACAACCAAG GCACACCTGT TGGCGTCCCA TCGTCGGTCG CGCTCGTCGG GGACATTGGA GGACATTGCT CAACAAGCGG AAGAATTGAA TCAAAGGCAG GGCACGCCCT TGGGCACTCC GCAAAGTCTT CATCGACTTG CGAAGGATTT TCCGAAAGAA GGCAATCCCA ACCTTCACAC CGACGAGGAG CAGGCGATCG AAGTCAGTTA CGGTGCCCAA GAGAAAAAAG AGCTCATGTG GGTGTAATTG GACTGAAAAT AACAGCTCCT TCTTGGGTAA ACGCTATGCT AACAGCATTT TGGCAAC
|
Protein sequence | MLHSIAEQRQ KRELPPLFWY MELGEWEKAS DRVRRHPREV KTWATLRTKN NTDEPAPLPS SASIASYATK ASSSSGTKRL ALHHSCFKLR TAGSCPVASK ASEDPYVQVC RFILMLLRLY PEAAGQRESR HGCLPLHLAC FASCAPRADD DTLHNNSNHS RNATSPSAIA KAPVARPNMI ARRIASDATS ATTDTNLSAV HAEETYTGNM ADKQIRRDHT VHVDPNVSVS TKKHLLISSK REEMAVQVLN ALLDAYPKAI RTDSEGGRLP LHTACAGRAT PRVIATLVTA YPSAARHRNK DGFLPLHLTA HWGVAHPNVV VTLLKAYPDA TVGRNRWERT PLEEALCMAG ENGRPHQAAM VRALRKHPSY WQRATAEIIQ GTRRLRQPGS NVVDVDESLP SNDSTSLEEQ RQGHFAHGHN PLVDQVEQEH SKKPAGKLSP EAAKKKPMDH KLDELIRQHD WDAVIRRVET NPLEVETELA VMTRGGFLSC SGVTPLYYAC ERQPPVAVVQ ALIHAHPLAV LTRAMPGGSL PLHVACTWHA SPDIIWALLA ADQGAAKVTD ELGNVALHSA LFSGADVRVI QALVQADPEA VLSRNHQGSR PADIGKRLRH ENRKMVLPVL QTTKAHLLAS HRRSRSSGTL EDIAQQAEEL NQRQGTPLGT PQSLHRLAKD FPKEGNPNLH TDEEQAIEVS YGAQEKKELM WV
|
| |