Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21350 |
Symbol | |
ID | 7202164 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 363706 |
End bp | 366096 |
Gene Length | 2391 bp |
Protein Length | 557 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181363 |
Protein GI | 219122041 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00403963 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGCGGTAGGT ACCACTTTTT GGGCTGTTCA CTGGCGGGGT CAGTTTCTAG CTTATAGGCC TTTTCATCGA ACCGTCCCTC CGTCACAGTC AGTGTTTTGC CCTCGATTAA CCTGTAAGCA ATAACGCATA ATGGCTTCGG TAGTATGGAG GAGGCCTTGC ACAAGGCTGC TGTCAACAGT ACAATACACA CTATCCGCCA GCAGGGTTTA TAGAAGAAGC GCAGTTTACC TTTCGGTCTC GACAATTCAG CTGTCTGTGA GATCTTTTTC GGACAAACAC GATTTGGAAA GTCCTAAAAA GCGTTTACAC TACGAGATTC CTCAGCGAGT CCGACAGAAA GTGGTCAGTG CCTCGGATGC GGTGGCCCTT GTTCGAGATG GCGATACAGT TTCCTGCTCC GGATTCGTAG CGCAAGGTAC GTCGGACCAA GGCAATCCAT TGCGTTTTTG TTCGGTTTGT TTGTACCAAT ATCCGAGCTC GCATGTCTCA CAATATTGTT CGTATTTCGA CTCTAGGCGT TGCGGAAGCT GTCTTGAAAG CTTTGGGTGA ACGGTACGAC CGTACTGGCC ATCCCAACAA CTTGACGCTG CTCTTTGGCG GAGGTCCCGG AGACTGGGAC AGCCGCGGGC TGAACCATCT CGGAAAGTAC TCCGAAGACA GCAGCAAGCC ACACATGCTG CGACGCACCA TCGGATCCCA CTATGGGCAA ACGCCCAAGG TAGCCCACAT GGCTCTCCAA AACATGGTGG AAGCCTGGAC TCTCCCATTG GGCTCGGTGT CTCGTATGAT ACGGGCTCAG GCGACGCACT CACCAGGCCA TATCACTGAA GTGGGCATTG GTACTTACAT TGACCCAGAT ATTTCAGGTG GGGCTACCAA CGAGACTGCT TTGGAGAGTC CGTTGCACAG CAAACTGGTG CAAAAGCTGG ACATAGATGG CCGTCCGAAT CTGATGTACA AGGCCCTACC GATCCACGTA GCCATCATTC GTGGAACGAC GGGAGACTGC CGCGGCAATA TATCGATAGA GGAGGAGTCT GTGATCTGCG ACCAGAAGAT CCTGGCGGCC GCCGCGAAGA ACTCGGGAGG CATTGTGATT GCTCAAGTGA AGCGGTTGGC CGCGGACGGC ACCATTCCTT GCCGAAGTGT CGCAATCCCT GGGCCGTTAG TTGACGCTGT AGTGGTCGTG GACGAGAAGG ACCACGACAG TCTGCATCCA ATGAGCTATG TGGAGCGTAA CAATCCATCG CTGACTGGTC AAATCAAGAC GCCCCAGGAC GAAGTCCAGA AGATGCCGTT GGATGTTCGA AAGTTGATTG CTCGTCGGGC GTTCTTCAAG TTGAGTCCCG ACCAGATTGT GAACCTTGGA ATAGGTTTGC CAGAGGGCAT TGCAAGTGTG GCTGCCGAGG AGGACATGCT CCAGTATATA ACACTGTCAA CGGAGGTGGG CGTCTTTGGC GGTCTGCCGG CTTCTGGGCA TAACTTCGGT CCAGCGTACA ATGCTACGGC GATGGTGGAA ATGAACCAGA TGTTCGACTT TTACGATGGA GGCGGGTTGG TAAGTAGGAT ATGGATGGGT ATGCGGATTA ATCTGTGTTC GTTGTTGCTT GTTGAATGTT GAATGTATTT GTGTCGGCCC TGAACTAACT CTCGGGAACT GCTGGTTGCG GTATACTGCG TGTGACTTTT CGCCGTCCGG TGTTGCAACA GGACATTTGC TTTCTGGGTG CTGCTCAGAT CGGAAAGAAC GGAGATGTGA ATGTGTCTCG TATGTCGAAA GACCGACTGA CGGGTCCCGG GGGCTTTATC GACATTACGC AGTCTACGCG GCGTATTGGT TTTGTGATGG CATTTACTGC GAAAGGGCTG GAGGTGGACA TTCCTGGAGA CGGGAAGCTG GGAATCAAGC AAGAGGGTAG AGTGAAGAAG CTTGTTTCTA GTGTGTTTGA GAAGACGTTT AGTGGAGACG AAGCGGTACG ACGGGGTCAA GAAGTCACCT ATATTACCGA GCGAGCTGTT TTCCGTCGAA CTGGGAAGTT TGATGTGCAA AATTGAGTTG ATGGAGTTGC CCCTGGTATT GATTTGCAGA AGGATATATT GGACCAGATG GAATTCATAC CAGCAATCAG TCCCAAACTA GCTGAAATGG ATCCACGTAT CTTCAAAGAA GGCAAGATGA ATGTAGCGAC GGATTTGTTT GGCTCATTCG ACAACAGATT TCGATACCAA GAGGATGATC ACATCATGTA TCTTGATCTG TTTGGAATCA CTTTAAATAC CGAAGGAGAT ATTCAGTGGT TTTTCCAGGT TGTCAATGAC ATGCTAAAAG CCAAGGTTAC CGAGAAAGGA AAAGTTGCAC TGGTTGTAAA CTACAACGGA TTTGATGTTC G
|
Protein sequence | MASVVWRRPC TRLLSTVQYT LSASRVYRRS AVYLSVSTIQ LSVRSFSDKH DLESPKKRLH YEIPQRVRQK VVSASDAVAL VRDGDTVSCS GFVAQGVAEA VLKALGERYD RTGHPNNLTL LFGGGPGDWD SRGLNHLGKY SEDSSKPHML RRTIGSHYGQ TPKVAHMALQ NMVEAWTLPL GSVSRMIRAQ ATHSPGHITE VGIGTYIDPD ISGGATNETA LESPLHSKLV QKLDIDGRPN LMYKALPIHV AIIRGTTGDC RGNISIEEES VICDQKILAA AAKNSGGIVI AQVKRLAADG TIPCRSVAIP GPLVDAVVVV DEKDHDSLHP MSYVERNNPS LTGQIKTPQD EVQKMPLDVR KLIARRAFFK LSPDQIVNLG IGLPEGIASV AAEEDMLQYI TLSTEVGVFG GLPASGHNFG PAYNATAMVE MNQMFDFYDG GGLDICFLGA AQIGKNGDVN VSRMSKDRLT GPGGFIDITQ STRRIGFVMA FTAKGLEVDI PGDGKLGIKQ EGRVKKLVSS VFEKTFSGDE AVRRGQEVTY ITERAVFRRT GKFDVQN
|
| |