Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20899 |
Symbol | |
ID | 7201858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 802767 |
End bp | 804647 |
Gene Length | 1881 bp |
Protein Length | 496 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180899 |
Protein GI | 219120316 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGTCT TTCATTTGAC TCTGAATTTG ATCAGCGCGT TCCTCTATTG CATGAATTAC TACATTGTTG AGCCATCTTC AACAATGTAC GTCAATCGAC TCGGTGCGCA CGACGCCATG TCAGGGACAC TTATTGGTAT GATGCCCCTG GCTGCCTTTG CTTCGAGTTT ACCCTATAGT ATTTGGACCA ATCGATCTTT TCGACAACCC TTCATCGCGA GTGGCGTCCT ACTAATTTGC GGCAACCTGC TGTACAGCAT GGCCGACCGG TTTCAACGTA TCGATATTGC GCTGGCTGGA CGATTTATTG CCGGTCTTGG TGCTCCAAAG TGCATCATTC GCAGGTATAT GGCTGATACG ACTCCACTCG CTCTGCGCAC ATCTGTCAAT GCAGGATTTG GTATGGTGGT TGCCGCGGGG TCCGCCATGG GACCGGCAAT GGCCGTTATG TTGAATCGCA TCGAATATAC GGTGGCCTTT CCCTATATTG GCGTCATTTC CTTGAACGGC TTAACTCTGC CTGGATACTT TATGGCATCA CTCTGGCTGA CCTTTACAGT CATTGTACTG TTGACTTTCG AGGAACCCGA TCGAGAAGGA TTGGAGGAAC AAAAGTTGCT GGAGAGTCAA GGGGATATTC TCGTTAGTCC GACAAATCGC TCGACCACCG ATAATTCCAC TTCGTATCGA AACCAGTACA ACGGTAGTAT CATAGGTCAC TATGGCAAGT ATTCTGAAAT GCACGACAGC GACGATCGAT CGTTCGAGAT TAAATCCCAG CAGCTGTCAC AAGATATTCC GCTTGGGTAC GAGTTACCCG AAGACTCGAC TTTTTGGCAC AGAATCCATT ACTTTTTTGC TCTCATCACA TGGCCGGTTC GCCTATGTCT GGGTCTGCTC TTTTGCAAAG TTTTTACAAT TGAAACCCTT GTTAGTGCAA CATCGGCGTT GTCCAAGAAT CGGTACGGGT GGCAAGTAAA CCAGGTTGGA ACACTAGGGT TCATCATTGG TTGTTTGGTC ATTCCATTTT CCATCTTGGT GGGAAGATTG TCTATGTCGC ATCAGGATCA CGTTTTGATG CTTTGGTTGG TTGGCACGGG GTGTTTGGGC ATGTTTCTCT TAATCGATCT TTCCGATTTG GTCGAAACGC AAGATCGGCA CTACAACGAA GGCCATCCAT TGGCTGTTGG TCCCAACCGA TACATTTGTG GCTACTTTTT GTCATATCTG TCAATACAAT CCTTCGAAGG AGTGATCGGC TCGACGTTAA GCAAAGTGAT TCCGACTGCA CTGGCCTCCG GAACAATAAA CTCGGGACTA TTGGCCACCA TGGTTGATAC ATTTGGTCGT GCCTGCGGAG ATCTCTTTAT CTCTGCTGTT GGCTTTGTTA ACCTACGCCA GCTCATGAAT TTATTGTTCA TACCTGGTTT TGCGATTATG CTGATCTGTT TCGTCGTCAT CGAACGATTC CGAGACTTAC TGTCAGTGTA AAGCTGCAAC CACGACGAAA ACACAATGAA CACCTGGTAA ATCACCAACT TTAAGGTCTT TTACCCAGAA AATGCATATC TGTTCAAAAA TTCAGATGGT GGTCAGACTT TCTGCTTCCG AACGAAACAT GCCCCACAGG ATAGATCCGA TGGCCTGCAG AGCCATGCCT GTTTCTTCCA CATCTTGGAT TCGGGAACGA CTCAAACCAT TTCGACTCAC CATGACGGTC GAAGTTGCCG TGACTAACGA CTGAGCTACA TGAATGCTGC GTTGTTATCA TTTGCAATCC ACTTACAATG AATGCAACAC CCTGACAGTA CATGATTATT GAGGAAATGA GATGTCTGCT TAGAAAATAC CGAAAATAAA TCTAGAGAAC AGTTAGAAAA T
|
Protein sequence | MQVFHLTLNL ISAFLYCMNY YIVEPSSTMY VNRLGAHDAM SGTLIGMMPL AAFASSLPYS IWTNRSFRQP FIASGVLLIC GNLLYSMADR FQRIDIALAG RFIAGLGAPK CIIRRYMADT TPLALRTSVN AGFGMVVAAG SAMGPAMAVM LNRIEYTVAF PYIGVISLNG LTLPGYFMAS LWLTFTVIVL LTFEEPDREG LEEQKLLESQ GDILVSPTNR STTDNSTSYR NQYNGSIIGH YGKYSEMHDS DDRSFEIKSQ QLSQDIPLGY ELPEDSTFWH RIHYFFALIT WPVRLCLGLL FCKVFTIETL VSATSALSKN RYGWQVNQVG TLGFIIGCLV IPFSILVGRL SMSHQDHVLM LWLVGTGCLG MFLLIDLSDL VETQDRHYNE GHPLAVGPNR YICGYFLSYL SIQSFEGVIG STLSKVIPTA LASGTINSGL LATMVDTFGR ACGDLFISAV GFVNLRQLMN LLFIPGFAIM LICFVVIERF RDLLSV
|
| |