Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46234 |
Symbol | |
ID | 7201194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 680490 |
End bp | 682199 |
Gene Length | 1710 bp |
Protein Length | 535 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180688 |
Protein GI | 219119874 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGATGGCAAT GTACGATTGC GCCTACACCT TACTGTAAAG ATCTTGGTTT GTTTTTGTTC TGTCCCAGGA TTTACTATTA GAGCCGCATA TAGGTTATTG CTATGCGAAA GCGAAGCCGT TTTTCGACCC AAGTCCAGTC TCATGAGTTT GCTGCACTCG AGTCTCGCCA TCCCTATGGA GCTTTGCCAG GCGGAAATCG CTTTCTCCAA CATGTTGTCA GTGACAAATC GACCAAAAAC ATTGGTCCTT CTGATTTGTT GACGGACACG TGCTGGGATA ATGTTTTGGG ATTTTGCGAT GGTGCCGAGC TCGGGAAGGT AGTTCAAACT TGTCGCTATT TGTATGTCGC GGGGTACCAG CCTGAGTTGT GGAGAGATCT TGTCCTTCGA AAATTGGGTA CAAACCGGTT ACAAGAATTT CGTTCAAGTT GGAGGGACAC TTTTGTTGCT CTGTATTGCC CTTCCGCAAA AGATTCTAGT CACGTACCGA TGAGAATGCC GGGCATCTAT TCAGACGTGT TCTACAGGCT GCATTCGTGT AGGGCGTTTG CGTTACCCCT TGCATGGATG GATGCTAATT ACGGAACCGT TCCTCGTATC TCAATTGAAG ACATGACCTC AAAGGTATTC ACCAACAATT ATGAGGAACC CAACCAGCCA GTTTTGATTA CTAAGGCGGC CAAGAGTTGG AAGGCCTTCG ATAAATGGCA GGATTTGGGC TATCTTCTGA ACGAAACGAA AGGGAGCTCG TTTCGCGCCA CTAGTGGATT GGCCCCGCTT CCTGTGGACT TTAGCTTAAA GGCTTATTTG GATTACGCGA CGTTGGAGAA CCTCGAAGAG GCGCCACTCT ATCTCTTTGA TCGGACTGCC CTTCAGCCGG GCTCGCACCT TTGGAACGAC TACATGGCGG ACCTCCGAGT GACATGCCCT TGGTGGGACC CGAAGTCGAA CGAAAATGAA CACGACCTCT TCAAGGTTCT CGGTGAAGGT CAACGGCCGG ATCACACTTG GTTGATCATT GGTCCGCGAC GTAGCGGGTC TGTCTTTCAC ATTGATCCAA ACGGGACGCA TGCTTGGAAT GCAGCCATTG TCGGCCGGAA GCGATGGATT TTTTATCCGC CTGGGGCGAC TCCACCAGGA GTGTATCCTT CAGAAGATGG GGACGAAGTG GCATTACCAC TGTCTCTCGG AGAATGGCTT TTTCAATTCT GGGATGAACA TGTTGAACGA ATGCAATCGG CGCCGCCGCA CGAACGTCCA CTGGAATGTA CTGCAATGCC GGGAGACGTA ATGTTTGTTC CCCACGGATG GTGGCATGCA GTCATCAATC TTGACAAAAT TAATGTGGCA ATTACACACA ACTACGTGTC CGGGAGTAAT CTTGGGAACG TGCTCAGATT TTTGAGTAAA AAGGAAAATC AGATTAGTGG TTGCCGTGAC AGGTTGGAAA GTATTAAACC GGATCGGCTG TACCACGAAT TTGTCACTAG CCTCGATAAG TATCGGCATG ACCTTCTGCA GCGAGGTTTA TCCCAAAAAG ATTGGACTTG TCAAGCTTGG CGGAATACAA CCACCAATGA GATCAAACGA AGCGGTCGAA AGCGTCGTAA AGCAGAAGTC GCCAAATCGG AACGAAATAT AACGGAGAAG GCGAGCCTTA TGGCCAGAGC CAAAACAAAG GAGCCTTCTT TCAGCTTTTC CTTTCTTTAG
|
Protein sequence | MRKRSRFSTQ VQSHEFAALE SRHPYGALPG GNRFLQHVVS DKSTKNIGPS DLLTDTCWDN VLGFCDGAEL GKVVQTCRYL YVAGYQPELW RDLVLRKLGT NRLQEFRSSW RDTFVALYCP SAKDSSHVPM RMPGIYSDVF YRLHSCRAFA LPLAWMDANY GTVPRISIED MTSKVFTNNY EEPNQPVLIT KAAKSWKAFD KWQDLGYLLN ETKGSSFRAT SGLAPLPVDF SLKAYLDYAT LENLEEAPLY LFDRTALQPG SHLWNDYMAD LRVTCPWWDP KSNENEHDLF KVLGEGQRPD HTWLIIGPRR SGSVFHIDPN GTHAWNAAIV GRKRWIFYPP GATPPGVYPS EDGDEVALPL SLGEWLFQFW DEHVERMQSA PPHERPLECT AMPGDVMFVP HGWWHAVINL DKINVAITHN YVSGSNLGNV LRFLSKKENQ ISGCRDRLES IKPDRLYHEF VTSLDKYRHD LLQRGLSQKD WTCQAWRNTT TNEIKRSGRK RRKAEVAKSE RNITEKASLM ARAKTKEPSF SFSFL
|
| |