Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39421 |
Symbol | |
ID | 7195144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 480655 |
End bp | 482604 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183492 |
Protein GI | 219126496 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCTCT CGGTACACGA CAGATTCACA CGGCGGGCCG GTCCGACGAC CATTCCTTCC GGAACAACAA CCAGTCGGCG ACACCGCTCT TCCTTGTTCG GTCGATTCCG GACGGGCAGC ATCAGTGACG ACAACGACAA TACGGTGCCG GACGAGTACA CGGATGTGGA AAGTCGCAGT GACGACGACG AGAACGACCC CGAAAATCCC TCGGCACGCG CCCAACCGCG TCGTCTCCCC TTATTCGTCG TCGAAACTAT CGCACTCGAC GACCAAACCG TCACCGTACG CCAACTGCAG CAGTCGGCGT CGGCCGCGAA GGAAATGAAC GATTCCTACG CCACATCCAC ACCCACCTTG CGCTTCCGGA GTTCCGCCTT TTATCTAGCC GAAGCCGCAC TCGTCGGAGT TGCGACCGGC ATATCCGTTG CCGTCTTTAA ATTGTCCATC GAGTTCATCC GCGAAGTGTG CTACAACCAG AGCTTTCTCT ACTTGCCCGC GGTGCGGTCG GTGGTACCCG CTGTGGGTGG CGTCGCCGTC GGTATCCTCT ACTGGGCCGG ACGAGGAGCC TTTCCACCCG GACTCCGCGG TACCGTACAA CTCGTCGATC AACAGGATCG TCGCGGCCTC ACGGTGGACA CGACGCAAAA AGTCAAAACA CAGATTGACT TTTTGCGCAA ATCCACCGCC GCCGTCTTTA CCCTCGGTAC CGGATGCAGT CTCGGTCCGG AAGGACCCTG CGTCGAAATC GGGATGAACG TCGCCCGCGG CTGTATGGAC GTCAAACCGG AATTCCTCTC GCGTCCACGG CCGCACTGGA ACGTTTGTTT GCTCAATTGT GGGGCCGCTG CCGGAGTCGC CGCGGGTTTC AACGCGCCCT TGGCCGGCGT CTTCTTCACC CTCGAAGTCA TGCAGAGCGC ACTGAACGGG GTCCGCCAAG AAGAACAGGA AAAGCAAACA CTCCAGGGGA ACAATAGTGA CCTCAACGCG GCTAGCAACG CCGTATCCGC GACGGAAAAC ATTACTCCCA TTCTGTTGGC GTCCGTCTTG TCCGCGCTCG TAGCCCGGAC CATACTAGGC GACACGCTCG TCTTGAGCTT GTCGGAATAT TCCCTGGAAA CACCACTCAT CGAATTGCCC CTCTACCTGT TGCTCGGCGT GATTTCGGGC TTTGTGGCCT TCTCTTTCAG CAAGGCCGCC AACTGGAGCC AGGCTTTCTT CTCGGGAGAG GTTGGTGGCG AATCGATCCA GAGCTTCATG AGTTGCCTAC CGGAACCGGT CAAACCAGTC ATTGGGGGTT TCGCGTGCGG CTTGGTTGGT CTGGTATTTC CCCAGATTCT GTTCTTTGGC TACGAAACGC TCAACTCCCT CTTGGCCAAC GCTTCTTTAC CAACGTCACT ACTTTTTTCC TTGCTTATCG TAAAGACCAT CATGACCGCC GTATCGGCCG GATCCGGGTT GGTTGGTGGC ACCTTTGCAC CTTCACTGTT TCTCGGTGCC ATGGTCGGTG CGGCCTTTCA CAACGTCGCA ACCATCGTCT TCCAAACGCT CATGACGTCG TTTCCTTGGG AAAGTGTGGG AGTGCTGTCC AGTACCGCGG CTCCGGTACT AGTCCTCGCG GACGTTCCCG CCTACGCCAT GGTCGGTGCC GCCTCAGTTT TGGCCGCCCT TTTTCGGGCT CCGCTCACGG CCAGCCTTTT ACTCTTTGAA TTGACCCGGG ACTATGACGT CATCTTGCCG CTCATGGCTA GTGCCGGTGT GGGTAGTCTG GTGGGGGACA TTCTGGAAGA CAAGGTACAA AACGCGCGGA GAACACCGCC ACCGAACGTT TCGGAGGCTC CTTCGCCAGC GACTCTTCCG GTGCCCCCAC GAAGGCGGGA CAAAGATTCC GTCTCCTGGG GAGATTTGGC CGACAAGAAG AAAAGCTCGA CCACTAGATC GGTCAAATAG
|
Protein sequence | MPLSVHDRFT RRAGPTTIPS GTTTSRRHRS SLFGRFRTGS ISDDNDNTVP DEYTDVESRS DDDENDPENP SARAQPRRLP LFVVETIALD DQTVTVRQLQ QSASAAKEMN DSYATSTPTL RFRSSAFYLA EAALVGVATG ISVAVFKLSI EFIREVCYNQ SFLYLPAVRS VVPAVGGVAV GILYWAGRGA FPPGLRGTVQ LVDQQDRRGL TVDTTQKVKT QIDFLRKSTA AVFTLGTGCS LGPEGPCVEI GMNVARGCMD VKPEFLSRPR PHWNVCLLNC GAAAGVAAGF NAPLAGVFFT LEVMQSALNG VRQEEQEKQT LQGNNSDLNA ASNAVSATEN ITPILLASVL SALVARTILG DTLVLSLSEY SLETPLIELP LYLLLGVISG FVAFSFSKAA NWSQAFFSGE VGGESIQSFM SCLPEPVKPV IGGFACGLVG LVFPQILFFG YETLNSLLAN ASLPTSLLFS LLIVKTIMTA VSAGSGLVGG TFAPSLFLGA MVGAAFHNVA TIVFQTLMTS FPWESVGVLS STAAPVLVLA DVPAYAMVGA ASVLAALFRA PLTASLLLFE LTRDYDVILP LMASAGVGSL VGDILEDKVQ NARRTPPPNV SEAPSPATLP VPPRRRDKDS VSWGDLADKK KSSTTRSVK
|
| |