Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50523 |
Symbol | |
ID | 7199297 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 309435 |
End bp | 311802 |
Gene Length | 2368 bp |
Protein Length | 366 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185467 |
Protein GI | 219130636 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGTGA TTGCTGCTGC GGTTGTGGGG GTGGTCTCGC ACCGTGTGGG CGGCGTGTAT GTTCCGTGAG CGGCACGTTG TCATTAGTCG ATTCCGTCGG ACCAAACAAC AAGACGGAAC TGGAGTCCAT CCCCTCAGCG TGGGGTAGCA ACAACGCCGG ACGTGCCGTA AAAATTGTCC CGGACTCTCC CGCTGTACCA ACTTGGTTGC CCTTGTTGGT CGTTCGCAGA CTGCGAATAC GACGATCTCG CATTCAATAC CAATTGAGGG GACTTGACCA CCGGTTCAAG GAACACTCCC GCAAGGAACG GTACTTGTTA TGCTACTCAA ACAGTTATAT CAGGTTAGTC ATGCGAGGGC GAGGTGAAGT CTTCTTTCGA GAGTCTGCCA TGAGGAAAAT AGGTTTTGGG CAGTTTGGGG TCAAAAATGT AACCCGTGAC TGTGAGTCGC GGGATATACC TTGATAATTC AGAGGCATTC ACATTTTTCG TGGGACTGAA GTTACCGTAT CTGTATATCC CCATTATTTC AGCACAATGA AGCAACATTC GCTTGAACGA ACAACAGCGT GGTTGACGAC TTGGAGTCTC ATTCTGCTCG TGGCCGTTGC TCAAGAAGAT GTTCACCCTG ACGTAGTTGC CGTTGTTGAT GTTGCTGGAG GTGTACGACC GACACCACTT TGGGCAAGCA GTTATTCGGA CGGCGAGAAT TGCTACTGCC TTCCTTCGTT GGATAGCGCC ATTGGGAACT TTGTCGTAGA AACGCCATTA GGGTGGCTGA CGACGCAGGA AGTGTGCGAT CTGCTAGGAA CAGGACCAGG AAGACTAGGA CAACCCCTTT ACAATGATAT CCAATGCGGT AACGGCCCCC CAAACGCTGA TGAAAACGAA TTCCTTTGTC CGGGACGAAC CGATGTAAGT GATTGCGAGG AAAAAGCCGA GCGCTGTCCA CTAGTAGAAA CAAATATATC TCAACGGTAA GCTTTGTACT GATTTGCTCT TCCTCAGATT GGGGAAACAG GTTGTGGTCA AATAGGACCC AAATGGAATT TTGATAATGC AAACCTTGCG GACGGCCCCC CACGACTTCC ATCGTTGCCT GAGGACATTC ATCCCGATAT CGTTGCGGTG ATCGACGTTG TGGGTGGTGT GACGCCGAAT GGAAGATCGT GGGCCGACAG CTATTCCTTT GGCAACAAGT GCTATTGTGC GACAACGTTT GATCACGACA TTGCGGACGT GCTAGTCGAA ACACCGCAGG GATGGATGAC GATCCGTCAA GCTTGCGAGT TACTTGGACC GGGTCCCGGT ATTGAAGGAC GACCGGTGTA CAATGACATA CAGTGCGGGA ATGGACCACC TAATAATGCA GGCGATGAGC ACGTGTGTCC TGGACGAACC GATGTACGTC GACTGGAACG CGGTAGACTG GCTAGTATTT TCAGACCGTT GTTTTCTCGG ACTGTTTATA ATGAAATGCC TCACCATGAT GCTTCTCCGT TGTACAATTC ACAGCTTGGA CCAGAAGGTT GTGGTCAGAT TGGTCCCCGT TGGAATTTTG ATGCCATCAA GTCATTACCG CCAGGCAGCG CCCCCGCAGC TCTGCCCTCT TCTTTAGCAG CCGGAGCAGT GTCTGTCCCA ATGCTACCGG GCTTAGGAGT AATCACCGGA TATTTGTTCT GTGTTTTGAA CTGGCAACTT TTTGATCTCG TTCCGTGACA CCAAGACCTC TGGATAACCA CTGCAGGCTG CCCGGAAGAA AGCCAGGACG TGTTACTTTC TTTCCAGGTC GCTACGAAAA TACTCTGGGG CGAGCCCTGA CTCGCAAAGT GTTTACCGCG CTGAATACGG TTCGCAACGT GCCCCACTAC CAAGCTTTGT GCACTTATAA ATCCGCTTGT TGCTGACCAG GGGCAATATT GCCGGTGATG TTTGGTTTGT CGGTTCCTTG GTGGATCATC TGACTTGTGA CTTGACAAGA GAATCCGGAG TGCAACAGCA AGCCAAGCGA GCAACCATGT ACCCACAAGC AATTGGTCTG ATTCCAGTTT CTCTACTGGT GAAGCCCATG TTGTGTGGGC AGTTCTTGTT TTGGCGTATG TCTTTTGAAT GGCGCCCTTG TGTCGTGCCG CCTGGACACC TCTCCACCCG TTGACGGCGT ACGGCCCAAT GCCGTGTGCA TACGCGAGGA GCCGGACCAT GTGCATTATC ATTGCCATCA ACAACAATGG CAATTTGACC ATTTTGGTAC TAGTGGAAGA CAATGGACGC AGTGGTATCT TATTTGAGTG ACGCAGCAAA GTCCATCAAC ATTTTTGTTG ATCAGTGCCG CTATCTTTGC CAACAATGCC GCATTTTTAA CTTCATAC
|
Protein sequence | MIVIAAAVVG VVSHRVGGVL VMRGRGEVFF RESAMRKIGF GQFGVKNVTR DSWLTTWSLI LLVAVAQEDV HPDVVAVVDV AGGVRPTPLW ASSYSDGENC YCLPSLDSAI GNFVVETPLG WLTTQEVCDL LGTGPGRLGQ PLYNDIQCGN GPPNADENEF LCPGRTDIGE TGCGQIGPKW NFDNANLADG PPRLPSLPED IHPDIVAVID VVGGVTPNGR SWADSYSFGN KCYCATTFDH DIADVLVETP QGWMTIRQAC ELLGPGPGIE GRPVYNDIQC GNGPPNNAGD EHVCPGRTDL GPEGCGQIGP RWNFDAIKSL PPGSAPAALP SSLAAGAVSV PMLPGLGVIT GYLFCVLNWQ LFDLVP
|
| |