Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47250 |
Symbol | |
ID | 7202303 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 81550 |
End bp | 83585 |
Gene Length | 2036 bp |
Protein Length | 574 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181477 |
Protein GI | 219122283 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.110596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATATT GGAGGGACGA TCGACGAACG ACACCCAACA AACGCAAATT ACGGTACGGT ACGGAAAAAC CAATCTCTAG CTAGCTAGCT AGCTCTAGTA GTAGTAGGGG AATATGCGAG GTGAGTGTCA AAGTGTCAAG CCTGGTTTTT GGCAAATAGG TACATGTGCG AGTGCCACCC TCGGCCCCCA CTGAATAGTA CGTTCGTCCA GGGTTCCGCT TTCGCAGTAA ATGGCCACCT GCTGCTCACC TTCCAATGGA CAACTTACTT ACATACTTAC AGATACTCTT ATTATCCGGA GCAAAGAATA GCAGCGACGC ACTCCGCACA CGACCTTTCC ATCAACAACG TGGCGATGAC TCGTACCCTC CGGAGAGGAA ATCAGCTTTT CTCCGTAGCC GCGGTGGTCT GGGCCTACGC GTACGTCAAC TGGAATAGTA ATCCCTACAA GTTGTTGAGG ACGGCCGAGG CGTTGGCGCC TCCCATCAGC TACAGACGGA CAGTGTCACC GCTGGGCTAC ACGTATCGGA CGGATGCGTC TCGACGAATA CAGTTCTTTC CGTTGACTGC GACTAGAAGC ACCGGTTCCG CCGTCGACAC CGTCAAAAAT GGCGACCAAG ACTCTTCGAT TAGCGTACCG ACCGGCAAAC GTTTCGTGGC GCGTCAACTA GCCTCGCTCC CCCAACGCGC CGTGCGCATC TATTCCGAAT ACGTCAGTCG ACTATGGAGA GAAACCAATC CCGAAGCTCG TCAAATTATA TCCCAAGACA AGGCCGCTAC CGCTATTCGT CGGGTGCAGC ATCTGATCCG AGGCGAGCAG CTGGCGGGGG TCGTCTCGTG GGAAGAAAAG AACGGATTGG CACTTGCGTG TGACCACGTT TTGGAAGCCA TTGAGCAGGC CCATCGAGAG ACGGCGCGCA ACATTTCCGC CCCCGTCACG AATGGAATGG ACATGTCCAT CGAGTCAAAG CCCACAAGCA AAACAGCACC ACCTCCTACG TTGCAGAAAA AAAGCCGCTC GGTTTTGTTC GGTGCCATCA TGGGTGCCGT AGTAGCGTGC TGGGTCTTTT CGGGAAACTA CATTTTCACC GGCTTATTCA CGCTCATGAC CTTGCTTGGT CAATTGGAGT ATTACCGCAT GGTCATGGGG ACCGGAGTCA ATCCGGCACG GCGCATATCG GTCTTGGGGT CCTGCTCCAT GTTTTTGACG GCCCTATTTA CACCGAGTCT CCACGAAATT TGTTTGCCCA TATTCGGTCT CTACGCCATG ATTTGGTTCT TGACCATGAA ACGGACCGTC ACTACAATTC CGGAAATTGC CACGACTTTT ACCGGTATGT TTTATTTGGG TTACGTTCCT TCTTTCTGGG TCCGTATACG GATACTAGGT ACCCAAGAAC CAACGCGATT GGCTTCGGTC GCCGAGCCCT TCTTGCGCTT TCTCGCGGAC AAGTCGCAGG CTAAACTCGT GCCCAGCTTT ATCCCACAAG CTGTGGTGCT TCCCATCACC ACCGGGTCTA TCTTTATCTT TTGGACTTGG CTGTGCCTCG CCTTTAGTGA CGTCGGAGCT TATTTTGTCG GGCGACGGTA TGGGAATACC AAACTGGGTG CAGTCGCGCC CGCCGCGGGA GCAACCAGCC CCAACAAGAC TGTCGAAGGG GTACTGGGAG GGTGTGCTGT CAGTGGTCTA TTGGGTGTAT TCGGTATGTC GATGCTTTAG GAACTTTCGG CTCGGACAGA GAATACGCTG CTAACCCATC GTTTTCGTTT CTTTCTGTTA CGTAGGAGCT TGGGCACAAA AGTGGCCGTA TTGGGGCGTC ACCGGGGCCG TACACGGAAT ATTATTGGGT CTCATTGGTC TCATTGGAGA TCTGACGGCT TCGATGATCA AACGCGATGC CGGCGTCAAA GATTTCGGCG ACTTGATTCC GGATCACGGT GGCATTCTGG ACCGCGTGGA TAGTTTCATT TGGTCGGCAC CCTACTCGTG GCTCGTCATC AACTCTGTCA TACCGTTTTT GAAAAGCGTC GCTTGA
|
Protein sequence | MKYWRDDRRT TPNKRKLRYS YYPEQRIAAT HSAHDLSINN VAMTRTLRRG NQLFSVAAVV WAYAYVNWNS NPYKLLRTAE ALAPPISYRR TVSPLGYTYR TDASRRIQFF PLTATRSTGS AVDTVKNGDQ DSSISVPTGK RFVARQLASL PQRAVRIYSE YVSRLWRETN PEARQIISQD KAATAIRRVQ HLIRGEQLAG VVSWEEKNGL ALACDHVLEA IEQAHRETAR NISAPVTNGM DMSIESKPTS KTAPPPTLQK KSRSVLFGAI MGAVVACWVF SGNYIFTGLF TLMTLLGQLE YYRMVMGTGV NPARRISVLG SCSMFLTALF TPSLHEICLP IFGLYAMIWF LTMKRTVTTI PEIATTFTGM FYLGYVPSFW VRIRILGTQE PTRLASVAEP FLRFLADKSQ AKLVPSFIPQ AVVLPITTGS IFIFWTWLCL AFSDVGAYFV GRRYGNTKLG AVAPAAGATS PNKTVEGVLG GCAVSGLLGV FGAWAQKWPY WGVTGAVHGI LLGLIGLIGD LTASMIKRDA GVKDFGDLIP DHGGILDRVD SFIWSAPYSW LVINSVIPFL KSVA
|
| |