Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47426 |
Symbol | |
ID | 7202558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 598860 |
End bp | 600410 |
Gene Length | 1551 bp |
Protein Length | 490 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181592 |
Protein GI | 219122522 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.308736 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TATTTTTGTT CTTACACTTA TCTTGGAAAG TGAAGTACCC AAGGACATCG CCCTATCTTT TCATTTTACA TTGCAATAAT GGCACTGTTA GCGAGAGCAA CGTCGCTGAT AACGGGATTG ACCTTGATAA TGTTATTTCA GGAACTATAC TCGTACACAG TCAGCCCGCA ACCATTTCTT CACTCCAAAA CGGAAACAGC ATTATCATAC ACACATCCCA ATCGATATCG ATCTCGGCGA CACCTTATAG AGAATAAAAT GTTCTCTACT TCCAATGAAA ACGAAGAAAC TCCCAACGAC GCCGATGAAG ATGGTGACCA CGAAAACGAT TCGTACATGG TGAACGCTTC CTCAGAGTTT CTTGACGACA CCTCTGCGAG TAGCGGGAAT ATGGTCAAAT CGGAAAGTTC ACCAGCAAAA ACATTGGACT GGGGTGGAGC GTTAGGAAAA CTACGGCAGC GTGTGGAAGA TATGGAATCG GGAAAAGCAG GAGATGCTTC ACAAGTTCTT TTTCGACTAA TGTCTTCGGA ATCACCCAAC CAAGCGATTG GCTCCTTCGT TTCTTCGGCC AATCCTCAGG TCGTTCAAGC CATGTCTGGC GCCGTCAGTA GCTTGCTAGG TGGTCTTTCC AATCCTCAAA TGGGAGCGGA TGTACTCGTG AAAGCGTCTG GCGATAAAAT AGGCTCACTC TGCTTCCATC TTCAGATGAC TGGGTATATG TTTCGAAACG CCGAGTACGT GATGGCCTTG AAGGAAGTGA TGCACCTGAG AGGATCGACA TCTCTACAGG ATTACAAAGA TGCTTTTGAT CGCATAGATG CAGATAATTC CGGATTTATC GAATATTCCG AGATCAAGGA ACTATTGGAC GACGTCTATG AAGGTTCGAC ACCAGCGTAT GAAGTGGAAT CATTCATCAA ATTTTTTGAT GAAAATAACG ACGGCAAAGT TTCTTGGGAA GAGTTTGAAC GCGGATTGGG CAGTGCCATT GCACAACAAC AGCAACTGGT AAAGAAGCTT AACTTGCTTG CGCCCGCTGC TCTTGACGAC GATGACGACG CACCCAACCT GGATCCTGAT GTGACGGGAA TAGTCGAACT TGAGCTTGAA AACGGAAAAA TTGTGGAGGT GGAAGCAAAA GAATACATTC AAAGCCTCAA AAAGGAAGCG CAGGCTTTAA AGGACGCTCT GAGAAGAGAA AAGTTTGGCG GGGCACCTCC CAACCCGAGA GCTGGCAATG GCAGTGACCT CTCTGCAAAT GAACCTGGGG AAGGCTTCGG TGGAATCACC GGCTATATTG CGAGTCGTCA GGGCGATCTG AAAGCGTTGA CGCAAGGAAT TAGTCCCGAG ATTGTGAGCA CGATGAAAAA GCTTGTCGAT TTCGTTCTAG AAGGCGGAGA AAGTGGGCAG ACGAAAAACG TCCCGAAGGA AGAAATCCAC ATGGAAATTC CAGGATCTGC TCTGCAACAA CTAGCGCTCT GGCAATTGAT TCTAGGATAC CGACTTCGCG AAGCTGAAGC AAAAGGCGAC TATTTGAAGC TTTTGGAGTA A
|
Protein sequence | MALLARATSL ITGLTLIMLF QELYSYTVSP QPFLHSKTET ALSYTHPNRY RSRRHLIENK MFSTSNENEE TPNDADEDGD HENDSYMVNA SSEFLDDTSA SSGNMVKSES SPAKTLDWGG ALGKLRQRVE DMESGKAGDA SQVLFRLMSS ESPNQAIGSF VSSANPQVVQ AMSGAVSSLL GGLSNPQMGA DVLVKASGDK IGSLCFHLQM TGYMFRNAEY VMALKEVMHL RGSTSLQDYK DAFDRIDADN SGFIEYSEIK ELLDDVYEGS TPAYEVESFI KFFDENNDGK VSWEEFERGL GSAIAQQQQL VKKLNLLAPA ALDDDDDAPN LDPDVTGIVE LELENGKIVE VEAKEYIQSL KKEAQALKDA LRREKFGGAP PNPRAGNGSD LSANEPGEGF GGITGYIASR QGDLKALTQG ISPEIVSTMK KLVDFVLEGG ESGQTKNVPK EEIHMEIPGS ALQQLALWQL ILGYRLREAE AKGDYLKLLE
|
| |