Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47678 |
Symbol | |
ID | 7202872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 497170 |
End bp | 498883 |
Gene Length | 1714 bp |
Protein Length | 467 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181920 |
Protein GI | 219123206 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACGGTGAAA TGATAAAATA ATTTGCATTT CATGGTCAGC GACGACCGTT TGGACAGACC ATTGCGGCGT TGTGGAAGTG GATCGCCTTC AAAGTTGTTG CAAGAAACTG TGTACTGACT TTAAACAAGT TAGCTGACGA GTAGTACGGA ATCACTGGTG CGAGTGACAG GACACAAACT CAGTGTTTGC GTATTCCGAG CGTGGATTGA ATTTTTGCCA CGCCGACCGG TTTCTTGGGG CGAACTAACT TGGAACAACC GTTCAACGGG AGCGATCCAG CGCAACAGAG AAACAAAAGA TTAAGAGGTC ATGGTGGCTA TGAGACCTGC TAGTAGAAGC AAAAGAAGAC GTACACTTGA GCTGCTCGTT CTGACGACAT TTTTGGTGAT CTATTTCGTC CTATCAATCA AATTACTGTT GTCAGGCATC TATCCCAATC AATCCGCCCT CAGTATGAAG AATCTTAATC TTGTACAGAG CGCAGGAAGC GGGCTTGTGT CTCTCAGAGG AAAAATTGCT GCATCGGCGT CAGCGCGTGG CGGCGATGTT CTTTACGGTG TCCATATGGC TTCTAACTTA TCGGGCGTCC TAGATATCCA GAACTTTGTT CGATCACACT GCTCGACTCA AACCAAACAG TTCTATGGAA TCGGCAAAGC AGCAGTGGAG CTGTGTATAA AGGGCTCCTT TCCACCGTTC AAGTATACCC TTCCCAACTT TGTCCGCCCC GACGACGAAC GGATTTTTCT CTTACAGAAA AAGAATCGCG AATGCTTTGA CACAAACTGG ATCGACTGGC AATACGAAAC ATGCGTCACA ATTCACCCGG CGGCAAAAAA TGTGTTACAC GCCCGTATTC AAGGCTACGA TGCCTGGTCG TTTCAGCATT TTCACGACAA TGCATTGCCC TGGATATATC AGGTTCGTCA AATCATGGAT GTTTTACAAA ATGCCAGCAC TTGCCGCTTT GAAGAGCATT TGCAGGTGGC AGAACCTCCC AATGACGTCA CGCTGGGGGA ATGGCAACGG CTGGGCTTTG CCAGAAACTC GTTAGATTTC ATGCCCCGAA TGCGACAGCT ATTGTTAAAC GGCAGTGGCA GAGATTTCAG TCTGTCGATT CCCAATTTTC CCAAAGACTA CGACCAAGTC CGCGTCCACC CGCAACATGT TACATGGCTA CGCAACTCAC TTGGACTTGG CCAGCGAGAA CGTTCTGCAA AAACAGTTTA TTGGATAAGC CGGAAAGAAG CACGACAAGG TCGTCGGTGT AGCAATGAGG AGGAGGTTGT TGAAGCTCTA AAACGCAATG GAGTGAATAC AATTTTTTTT GATCTGGCGA AAGCCAGCAC AAGCTCAGGC TCTGATGTGA TAGAGGACTT GATGGCCCTC TTGGACAATG CTTGTGCTAT TGTTGGTGTT CACGGTGGTG GGCTGTATAA TCAGTACTTT GCCCCCGCTA CGACAGCTTT GGTGGAGTTG ATTCCAATAC AAATCAAGAA CGATCTATTC CATGATCAAT TTGATGGCAA CGTCACCGCT CCACGAATAG CATCGAGGGC CTTTTGGCAC AATTCACAAC TCATTGGACA ACCATACTGG CGGATCCATG CTCGGACTGA ATCAGATCGG ACGTTTGCAC TTGACTCTCA AGCAGTAAAA GATACGGTAG CAGCACTGCA AGCGGCAGGC TGTGCATTTA AAGGATCATC CTGA
|
Protein sequence | MVAMRPASRS KRRRTLELLV LTTFLVIYFV LSIKLLLSGI YPNQSALSMK NLNLVQSAGS GLVSLRGKIA ASASARGGDV LYGVHMASNL SGVLDIQNFV RSHCSTQTKQ FYGIGKAAVE LCIKGSFPPF KYTLPNFVRP DDERIFLLQK KNRECFDTNW IDWQYETCVT IHPAAKNVLH ARIQGYDAWS FQHFHDNALP WIYQVRQIMD VLQNASTCRF EEHLQVAEPP NDVTLGEWQR LGFARNSLDF MPRMRQLLLN GSGRDFSLSI PNFPKDYDQV RVHPQHVTWL RNSLGLGQRE RSAKTVYWIS RKEARQGRRC SNEEEVVEAL KRNGVNTIFF DLAKASTSSG SDVIEDLMAL LDNACAIVGV HGGGLYNQYF APATTALVEL IPIQIKNDLF HDQFDGNVTA PRIASRAFWH NSQLIGQPYW RIHARTESDR TFALDSQAVK DTVAALQAAG CAFKGSS
|
| |