Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48080 |
Symbol | |
ID | 7203431 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 131486 |
End bp | 133333 |
Gene Length | 1848 bp |
Protein Length | 502 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182622 |
Protein GI | 219124672 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGAC TCCGACAACT CGTGCACGAC TATTACCGAG CGCGACGGAG CAATAGGCAA CCGCCCTGGA CGACGCTCCC ACTCGCCGAT CGGGACGCCC GGTCAATGGC CGCCACCCAC CTTTGGGATG TTGCCTGTGG ACTCCACCAG GACCGCAACG AACGTCAAGC GTACACGCGA GCCATCAATA TTCTACACGA GCACTTGTCG ACGCTCGTTC AACAACGATT TCCCGATGCG CGCCTGGGCG TCTACGGAAG CTGTCTCAGC GATTTGAGTT TGGGTAAATC GTCTGACGTG GATCTGAGTT TGGATTTTAA GCGTGCACGA AAGGTCAAAG ACCAATTCGA AATAGGCAAA TGTCCTGTAC AACGGTACGA AAGCGAAATG AAGTCTTTGG TCTACGCAGT TTGTCGGACC ATGGAACGGC GAAAACACGA ATTTCGGGCT ATGCAACCCG TCACCCGGGC CCGGGTGCCC GTAATAAAGG GCACGTATCT GGGAGCCAAC AACCCGTACA CGGTCGACGG ATCCATTGAC TTTGACGTAT GCTTTCTCAA CGACATTGCC GTCGTCAACT CGTCGCTACT GCGTGAGTAT TCCATCGTGG ATGATCGCGT CAAGGCCCTC ATGATTGCAG TCAAGCGCTG GGCCAAGGCG TTTGGTATTT GCTCTTCCCA ACACAATACA CTCAGCTCAT ACGCGTGGAT GAATCTGGTC ATTTTCTACC TACAGAACGT AGGCTTTGTA CCCAACTTGC AAAGTCCCGA GTTGGCGCAG GCGGCGGGAA TATCCCGAGA TCCCAGGAAC GAATGGCACG ACGTCAACAA CCTCGACACG TTTTACCTCA AGTGGGAGGA TGTCTCGTCG GTATGGCAAC GAGCGCCGGC GATGGAATCG GTGTCGGTGA CGAGTCTATT GTACGGATTT TTCCGCTTTT ACGTCGTGGA GTACCCTTCC ATCTTGACGG TGTCCATCAA ACTTGGTCGG GACACGTTTC TACCTAAAAC CGTCTTTCGT AAATCATCGT TGGGCTTTTG GTGCATTGAG GATCCTTTCG AGACGTACGA CACGCACTGC CCTCATGATT TGGCCATTAC GGCCGGCGTG GGCGGGGTCC GCGAGATCAC TTACCGATTA GCGCAAGCCG AGCAGTATCT GGGCGCAAAG CTACGGCTGT TGGCGGAGGA TACGAGCATT CCGCCTCCGA AGCGCTTGTG GCCGTCCCCT CCCGTGGCAA AGAAACCACG AAAGAAGAAA GGTGAGGAGA TGAAACCACA AACGTCGTCG CAATCGCCAC GCGGCTCCGA ACACAGCTAT TTTACCTCAA CAACACAACA CCGACAAGAT CAATTACGTT TTTATCCTGG ATACAGAAGT TTTGCGCAGC AATACTTATT CCGTCCTCAA ACGTCAAGAC ACTACCCACA AGGCCGACCA CATACTATAC GAAACTTTTC TGTACAAAAT GATCCTTCAC AGCAATACCA TCTCTATAAT CCGGTATCAG GAGGACGATG CGAACGACGT GGACGAACAA CTGGTTCCTT ACATGCAGAA CTAGGGGTCA TTCCAAAGCA GTGTCATTGC ACATACAACT CCAGGCTCTG CTCATCAATT CGGTCAACTC CGTCAATCTC ATCAAAATCA AGGACAACAC GTTCAATCGC CACCACTTTA GTATGGACAA AGCCAGCAGC CTTTCTTACT GCAGGTATAC CAATCCCTTC CAAGTGGCGT ACAGCCTTCG AACTGCTGTT AAACGTATGC CGGCGCCCGT GAGCAAGGAG ATAACCTGGC ATCTGGAATG CCCATTAATT GCAAAGCTCT GATTTTTCTG CTGGAATC
|
Protein sequence | MQRLRQLVHD YYRARRSNRQ PPWTTLPLAD RDARSMAATH LWDVACGLHQ DRNERQAYTR AINILHEHLS TLVQQRFPDA RLGVYGSCLS DLSLGKSSDV DLSLDFKRAR KVKDQFEIGK CPVQRYESEM KSLVYAVCRT MERRKHEFRA MQPVTRARVP VIKGTYLGAN NPYTVDGSID FDVCFLNDIA VVNSSLLREY SIVDDRVKAL MIAVKRWAKA FGICSSQHNT LSSYAWMNLV IFYLQNVGFV PNLQSPELAQ AAGISRDPRN EWHDVNNLDT FYLKWEDVSS VWQRAPAMES VSVTSLLYGF FRFYVVEYPS ILTVSIKLGR DTFLPKTVFR KSSLGFWCIE DPFETYDTHC PHDLAITAGV GGVREITYRL AQAEQYLGAK LRLLAEDTSI PPPKRLWPSP PVAKKPRKKK AILPQQHNTD KINYVFILDT EVLRSNTYSV LKRQDTTHKA DHILYETFLY KMILHSNTIS IIRYQEDDAN DVDEQLVPYM QN
|
| |