Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45200 |
Symbol | |
ID | 7200089 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 465049 |
End bp | 466776 |
Gene Length | 1728 bp |
Protein Length | 547 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179436 |
Protein GI | 219117283 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGTCTA AGTCAGGTCG CCGAGGTACT ATAGAAGGTT TGGTCCAGGA TCTCATATCC ACTGCCTTGT TTGCCAGCTC TTTCTTGATC GCGAGAAGAA ACGTGGTTTC TGAAAAGCGT GACAATGATG GGGCAGCTCG AGAACTTTAC GATATCTTCT TAAATGAAGA GGCCATCGCT GAGCATGCTC GGCAGCTATA TAGTCGACGT CGTCGGCAAG AGCGTCTCTC ATTTCATGAC GACGTACAAA ATTTAGTTCT TAATCAAGAG GACGAGCTCC ATAGTGAGTT GTCTCCGTAC AATGATGCAG TCACAGGTGA CTTCTACCCA AAAAATAGTA ACTGGAACCA CTTCGAGTTC TACAACGAGA TTCATCGCGA TGATGGGCAC AGCTCAGGCC AGCGCAATCC AGTTCCTAAA ACTGTTGACT CACTAGCTGG TGATTACCAG CCTGGTTTTT TTGATGATGA CTCTGATGCT GCATCTTTGA CATCTCAAGA TCATTTTGTC TGGACTGAAG CTCGATATTC ATTCCGACCG CGCAACGTGA CCAACCTCGC TACGATACCA TCGGAACAAC AGATGGCAAT CGATCCTGAC GGTGAGCAAT TTTCTCGGCA AGGAAATCCT TCATTGCTTC CTGGAATACC TCCCAATCGC GATGTGCCCT GTCGGGCGGT TAGTCTGGAC GACCGAACTA TATATAGTTT ACCTCTACCT AACTCTGGTT GTGAAAACCC GCTGTCTCTG CGGCGTAGTC TGTCTATTCC GGAGCTTACA GCTACAAAGC CCACTTCAAA AAGCTACCAG AACAACCTGT TGAATCAAGT CCGGACGCAA AACCGGAATG CTCGGGCAAG CTACAATGCG CGGATCATGC CGGAGAAGCT GGTTATGGTT CGCCATGGAC AAAGCATGGG AAATGTCAAT GAGGCTCTGT ACAGTTCAAC ACCAGACAAC GCCATGCCTT TAACAAAATT GGGATGGGAA CAAGCAAGAA AAGCTGGTAA ACTATTGAAG GATGAGGTGC TTCGGTCTTC AACAAGTGTA CACTTTATCG TCTCACCGTA TGTTCGAACG GTAGAAACGT TTCATGGAAT TGTTGCTGCA TGGTGTGACC CTTCAAACTT CAACCACATA ACTGATCGAG ACAAACGATT AAACGCCTGG TATGGTAGAT TAATCGAAAT GGGGCTCACA TGGAACGAGG ATCCAAGAAT TCGAGAACAG GATTTTGGTA ACTTTCAAGA CCCTGAAAGG ATAAAACAAG CCAAAAAGGA CCGACACTTC TTTGGAGCTT TCTACTACCG CTTTCCCCAC GGAGAATCAG CTTCGGATGT CTTCGATCGA ACCAGCACCT TCCTCGATTC GCTGTGGCGA TCTTTCGACA TGAATAAGAA CCGGAACTAC GTGATTGTAA CCCATGGTAT ATCGATAAGA GTTTTGTTAG CGAGGTACTT TCGGTACACA ATTGAACAGT TCCACTTGTT GTCTAATCCT CGGAACTGTG AAATGGTGAC ACTTGAACAT GATGGAGGTG GCCGTCTGCA AATGGCTGGC CGCTACGAAA TGGATTGCCG GTCAGATGAC GATACAGGCG ATACCCACGT AGTTGGATAC AAGTTTTACC AAAGATTAAG AGTACTGCCA CCGGATTGCA TAAGAAAAGT CCAAATACGA ATTCAATATG AGGATTCCCC CGGTGAAGAA GCCATGCGTG ATTGTTGA
|
Protein sequence | MESKSGRRGT IEGLVQDLIS TALFASSFLI ARRNVVSEKR DNDGAARELY DIFLNEEAIA EHARQLYSRR RRQERLSFHD DVQNLVLNQE DELHSELSPY NDAVTGDFYP KNSNWNHFEF YNEIHRDDGH SSGQRNPVPK TVDSLAGDYQ PGFFDDDSDA ASLTSQDHFV WTEARYSFRP RNVTNLATIP SEQQMAIDPD GEQFSRQGNP SLLPGIPPNR DVPCRAVSLD DRTIYSLPLP NSGCENPLSL RRSLSIPELT ATKPTSKSYQ NNLLNQVRTQ NRNARASYNA RIMPEKLVMV RHGQSMGNVN EALYSSTPDN AMPLTKLGWE QARKAGKLLK DEVLRSSTSV HFIVSPYVRT VETFHGIVAA WCDPSNFNHI TDRDKRLNAW YGRLIEMGLT WNEDPRIREQ DFGNFQDPER IKQAKKDRHF FGAFYYRFPH GESASDVFDR TSTFLDSLWR SFDMNKNRNY VIVTHGISIR VLLARYFRYT IEQFHLLSNP RNCEMVTLEH DGGGRLQMAG RYEMDCRSDD DTGDTHDSPG EEAMRDC
|
| |