Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29223 |
Symbol | |
ID | 7203003 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 679358 |
End bp | 681445 |
Gene Length | 2088 bp |
Protein Length | 439 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182441 |
Protein GI | 219124292 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTCCGGGGA GTTTAGTGGG CAAAATTGAC GATCTTGCCA GACCACTCGA CGTGCAAATC TTCCTTCATT TGGTTGGTAC GACTGAGTTG GGTTTATTAA AGGCTTTACC GACACAACGA GAGCTACTCG TGCGGGGGAA GCCTGCCTTT CTTGACGGAC GGATCCAGTA TACGTGACAA AATCTTGGAG GATCTCGTCG TAGAAGTCCT ACGAGAAGCT ACACGACAAA ACGTGGACCT CGAGTGCCGT GCCGAAGAAA GCCTGCCGAT TTTCGTCATA TCTAACCAGC CCCTTCCTCA GGAACAACGG TTCATTTACT GGTTGGAGCA AGCTTTGGAG GTTCATTCTA GTTTCTCACT CCACGAAACG CTACCGTTTC GAGAAGCCGT TTCGAGAAGA ATTTATCCGC ACGAAGAACG AACGCTGTCG TGAATTGTCG CATTTGGATC GTTGCTATTC CCAAACTTTG GCTTGCTTTT CATGTCCACA TCGTTCTCGG CGGCTCTCAC ATCTTCAACG ACGACAGCGA TGGAGGCTTC GGCAACTAGT CCCCGCAAAG GACACGCGAA TAGTATGGTG GAAGCATCCG GTTCCTCGGC GACATCCGCT CCGGCAGCAA ACGCGCAAAG TGACAACACC CGTAGCGGTC CCTTGTCGGA ACAGCAGCAA ACAGCATCCA CCGCCGCGAG TAACAATATA TCCTATTCAG CGGAGCGTAT TATCGGAAAC GGGTCGTTTG GTGTAGTTTT CGAAGCAAAA GTAGTGGGGA CCGGGGAAGT CGTTGCGATC AAAAAGGTTT TACAGGATAA GAGGTTCAAG AATCGTGAAC TCCAGATTAT GAAACAGCTA GTCAGAGATC CTCATACCAA TATCGTAGGG CTTAAGCATT GCTTTTACTC ACAGGTACGT TCGAACTGAG GAGAAATAAC TCTTCATCAT CTCTCCATTA TGTGCTCAAC CAATTTTCCT TCCACAGGGC GAAAAACCGG ACGAGCTGTA CTTGAATTTG GTGCTTGAAT TTGTACCGGA AACCGTATAC TCTATCAGTC GAAGACATCA GAAGCATTCG ATGCAACTGC CACTGATGAG CGTCAAACTC TATCTTTACC AGCTCAGCAG GGCCCTAGCT CATATCCATT GTTTGGGAAT TTGTCACCGA GATATCAAAC CGCAGAACTT ATTGGTGCAC CCGCAAACTC AGCAACTAAA ATTATGTGAT TTTGGTTCTG CCAAGGCGCT CATTCAGGGC GAACCTAACG TATCCTATAT TTGCTCACGA TACTACCGTG CACCGGAACT GATTTTTGGA TCGACGGATT ACACCACCGC GATTGACATT TGGTCGCAGG GTTGCGTGGG CGCAGAATTA CTGCTTGGAC AACCCCTATT TCCGGGAGAT TCGGGTGTCG ACCAGCTCGT AGAAATCATC AAGGTACTGG GGACACCAAC TAAGGAGGAG ATACGATCCA TGAATTCGAA CTATATGGAA TTCAAATTTC CACAAATCAA AGGTTGTCAG TGGAAAAAGA TTTTTCGTAA CAAGACACCG CAGGACGCCA TGGACTTTAT CGCGGCGACC TTGGCTTACA CGCCGTCGGA ACGGATCTTG CCGCTCGAAG GATGCGCGCA CGAATTTTTT GACGAACTGC GACAGGAGTC GACTGTACTG TCAAACGGAG GCGGCAAGCT CCCGCCTCTA TTTGATTTTA CAACTCACGA GTTAGCAAAA TCGCCCCAAC TTTTGACAAA GTTAATACCG CCGCATTTGA AAGGATCGTT CGAGATTCCA TCGGTAGAAA CTGATGACGT CGCTTCCGCA ACAACTCCTA TACCATCGTC ACTGGATCGG AAGCAGGAAG CTACCCTTCG ATGAGCTTAA TGCACAACAA AAAGACATAA TCAAGCCGTA CCGTACGCAC TATCTATGTC CATGTCCACT GGCGCTCGTC TGGGCATTGG TCAGCGTGCG GATACGAAGA GGGACGGTAG AGTCGTCCTT TTCCGTGTCG GCATGGTTTT CCACGAGCAA TCCCACCCAC AAATTTCCCA AACTTAACGG AAAAACTAAT TCCTTCTGTT TTCCTATA
|
Protein sequence | MSTSFSAALT SSTTTAMEAS ATSPRKGHAN SMVEASGSSA TSAPAANAQS DNTRSGPLSE QQQTASTAAS NNISYSAERI IGNGSFGVVF EAKVVGTGEV VAIKKVLQDK RFKNRELQIM KQLVRDPHTN IVGLKHCFYS QGEKPDELYL NLVLEFVPET VYSISRRHQK HSMQLPLMSV KLYLYQLSRA LAHIHCLGIC HRDIKPQNLL VHPQTQQLKL CDFGSAKALI QGEPNVSYIC SRYYRAPELI FGSTDYTTAI DIWSQGCVGA ELLLGQPLFP GDSGVDQLVE IIKVLGTPTK EEIRSMNSNY MEFKFPQIKG CQWKKIFRNK TPQDAMDFIA ATLAYTPSER ILPLEGCAHE FFDELRQEST VLSNGGGKLP PLFDFTTHEL AKSPQLLTKL IPPHLKGSFE IPSVETDDVA SATTPIPSSL DRKQEATLR
|
| |