Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50096 |
Symbol | |
ID | 7198886 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | - |
Start bp | 4334 |
End bp | 6473 |
Gene Length | 2140 bp |
Protein Length | 640 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185027 |
Protein GI | 219129714 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAACACTGAA CATCCGTAAT CGTCAAGCCC ACCCCTCAAT CTTCAAGAGT AGTGTTTGGG CAAACCATGC CTACCTGTAC TGAAGATCCT TCGAAAGAGC CCGAGCCGAT AAAGCTGTCT GCATTGTCAA ACGTGTCTGC AGGCCGCCAG GTTCCGCTCG TCGAGCTGTC CCTCCAAAAT GTTACGTACG CCCCCCTCAC CACCAAAGCG TCTAGTAATG GTAAAACGTC CGGTAGGAAG CGGACCACCG TCCTCCAAAA CATTACTACA ACTATTTCTC CGTACAAACT TACGGCTTGG ATGGGACCGT CGGGATCCGG GAAAACGAGT CTCATTTCGA TCGCCGCTGA TCTGACTAAA TCCGGCGATC TCGTAGAAGG AAGCATCATA ACAGTGAACG GAGAAGAAGG AAAAATACCC AAGCGGATTG TTGGTGTCGT ATGGCAGGAT GACCTTCTAC TGACAAATCT AACAGTGGAG GAGAATATTT ATTACTCTGC GCGACTAAAG ACACCTGAAT CTTGTTCTGA CGAACAAGTG AGTGCCGTCG TATTGGAGAC CATGATAGAG CTCGGTTTGA CGGACATCCG AAATAGCGTA GTGGGAAGTC CACTAGGCGT AGTGCGTGGC GTTAGTGGAG GTGAACGCAA ACGTGTTTCC GTAGCGTCAG AACTTGTGGT AAGGCCATCC TTGTTATTGT TGGACGAACC CACATCCGGA TTGGATGCAA CAACGGCCCA AGCATTGATA GGAACCCTCA AGGTTCTGTC CAATATGGGC CACTCAATTG CAGTCGTGAT TCATCAACCC AGAACTACTA TTTACAACAT GTTCGACCAC TTGTTGTTGC TTAGCCAAGG GTCAACCGTT TTCAACGGCA ATCCTTCGAA AGCACGAGAA TACTTGGAAT CTTGCCCCAT TGTTGGTGAA TTACCACCAG AGACAGGCTT GGCCGATTGG ATAATGGACG TGATAAAAGA GGACGAGCAA CGGCGAGAAG CTGCAATGCT TGCGAGTCAC TGGGCAGAAT ATATAAACAA GGAGAGCGTT CATAGTATGA TCTCGGACAG CCTTCGTAAG ACCCTGAGTC GCAGTATGAG CAGTCTGCAC GAGCTTCATG CCGTTCCAAA GTTCAACAAC AGCTTCCGCA CACAGCTTAA GCTCTTGACA TCTCGCACCA TGAAACAGCA ACGCGGCGAA CGATTGACAA TGACTGCTGT GATTTTACAA CTTCTGTATT TGTTCTTTTC AGCTGTATTG TGGTATGTGG CTGGTATGGG AGTTGGATTG CTATCACGGC AAATGAACGT CTCATTCTCT GTGAAATTTT CAGGTGGCGC TTGCCTAATA ATACAGCACG GACGTTTGAG CGCAACTCCT TACTATTCTT CATGATTATT GCGCAAGCAA ATGGAATTGT TATTTCAGCA GTGACAGTCT TCCAGCGAGA ACGAGCTCTG CTGAAACGAG AACGCGCGAA GAAAATGTAT GGCGTGTCAA GCTATTTTTT GGGAAAGACA GCTTCCGATA TGACCAACAA TGTTTTGCTT CCTGTTCTTT ATGGACTTGT GGTATATTGG ACCGCCGGAT TTCGACCTAC GTTTGAGGCA TATCTCAAAT TCTTTGTTGC TTTCTATTTG ACCTTGTCAA CTGCTCAAAG TATGGGGTTA TGGATGAGCA TAGCAATTCC CAATATGCAA GTTGCGCTTG TTTTGGCACC TCCAATTACG CTCTTTTTCA TGATAATGGT AATTACTGCA TCTGTCAGCG CATGTCCCCC GCACTATATT GAAGTTTTAT CTAACTGGGA TCTGCTCTTA CTTTTTAGGG CGGTTTCTAT ATCCCACTGC AAAACATGAA CCGTGGAATA GCATGGGCTA GCTGGATATC TTTTGCACGG TATGGCTATA GTGCCTTGAT CATTAATGAA TACGCGGGTC GAGACATCCC ATGCTTGGAT GATGGTGAAG CTTCTATTGC AATTGGTACA GGGGTGTGTC CGCTACCTGG TGAAGAAGTC ATAGCGAGTT TAGGAATCAC TGGTGTTGCT GAAAGCTATT GGTTCAACAT TGGTATGACC GTTGGTTTGC AGGTGATGTT CCGCGTTGCT GCATATATTT TTCTCAGGCG CGCAGAATAG
|
Protein sequence | MPTCTEDPSK EPEPIKLSAL SNVSAGRQVP LVELSLQNVT YAPLTTKASS NGKTSGRKRT TVLQNITTTI SPYKLTAWMG PSGSGKTSLI SIAADLTKSG DLVEGSIITV NGEEGKIPKR IVGVVWQDDL LLTNLTVEEN IYYSARLKTP ESCSDEQVSA VVLETMIELG LTDIRNSVVG SPLGVVRGVS GGERKRVSVA SELVVRPSLL LLDEPTSGLD ATTAQALIGT LKVLSNMGHS IAVVIHQPRT TIYNMFDHLL LLSQGSTVFN GNPSKAREYL ESCPIVGELP PETGLADWIM DVIKEDEQRR EAAMLASHWA EYINKESVHS MISDSLRKTL SRSMSSLHEL HAVPKFNNSF RTQLKLLTSR TMKQQRGERL TMTAVILQLL YLFFSAVLWW RLPNNTARTF ERNSLLFFMI IAQANGIVIS AVTVFQRERA LLKRERAKKM YGVSSYFLGK TASDMTNNVL LPVLYGLVVY WTAGFRPTFE AYLKFFVAFY LTLSTAQSMG LWMSIAIPNM QVALVLAPPI TLFFMIMGGF YIPLQNMNRG IAWASWISFA RYGYSALIIN EYAGRDIPCL DDGEASIAIG TGVCPLPGEE VIASLGITGV AESYWFNIGM TVGLQVMFRV AAYIFLRRAE
|
| |