Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44564 |
Symbol | |
ID | 7197803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 909420 |
End bp | 911415 |
Gene Length | 1996 bp |
Protein Length | 582 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178601 |
Protein GI | 219115611 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAACGTACAA GACTTGGTGT CTTTTCAGAG AAAATTGGGA AAGTGATATT GGGTGGTCTT CGACGCAGTT TTTCTTGACC GAGACTCTTG GCTGTCTACT GCAAATTCTT CTTTTCATTA CTTCAGCTGC GGTCGTCCTA TTTCGTCGGT TACGAGCCTA CAAAACATTG AAGCGACGCT TTGGATCCAT TCCGCATACT ATGGCTAGTA ATAGTGATGT TGAGTTTTCA ACGAATCATT CACGCTGCGG TTTTCCTATG GCTGTTGTAG CTCACGCTTT TTCCGGTTAC GGCGAGGGAG AAATGCCGCG ATCGAAAGCC GAGCTGGAAA TCATGCAGCT AGCAGGCTCA ATTCGGGATA AACCACTTTG GTGGAATAAA GTTCGCGATC CTTCCATTAC CAATCGCTGG CAAGCCGAGT CGCTGAATGG TGTCAATGAT TCGGAAGCTG ACTGTGAATA TCGTACACGA CAATTTCTTT TTGCGCTTAG GGAATGCCAG TGGCAGGCGA TGCAATATCC GGGACCAGGT CGTCCAGCGG CGGTCGATCG AGCCTTTTCG AGTGATGGCC ATGATGAATC CGAGCTCTGG TCCAAGCTGT TGTCTGAAAT CAATAAGTTA CGATCGCTAC CAGCGGTTGG GTCGAACAAG GAAGATCGTC ATCCGGGAAC TCCTCAGATG GTGGACCTAG TCCATCCTTC TCTTTATGCA TATGAGCGAG GGAAAACAGA AGTCCTGCCT CACGTGCCCG CTAGTATGAG CCAGATGCCA CATTGGGATA ACTTCTTGGG TGTGGCCGGT GTCACTGAAA TACCGCCAAG GAAGATCGAT GGCCAAGGAT TTTTTTCACC GGGCGGCCTA CAGTGGCTTC CTGCGGAATT CAGAGTCAAC GCTGCTGGAG ATTCATGTAA AATTAGTTCG TACATCAATT CACTCCATCC GACAGAGTTC GCGGAGTTGT ACGATTCCAT CGGAGAGTTG TTTTGCAGGG CGCTTCCCCT TTTGGAAAAT ACCTTGCGGG AGGCTGGTCA TGGACCTTTG GAAGACGGCG AACACTGGCA AAAGAACTAT CGTGAACGTG ACCACAGAGT GCCTATCGTG TCTGATTGGT GGGAAGCGCC TCGAAGTCAG CTGGAGGGCG AAGATGACGA TTTATATTAT GATTACCTGG ATGACTTTCA TGAGATTCGA GAGTTTATTC CGCCTGAAAT ACCAGACTTC ACGCCCCCAT CGTATAATGT GCCAGAGGCT GATGTTTCCT TGCGAAACTG TCCCCTCCAA GTGATTGTCA AGGTTGCGTC CTTGGAGATT GAACCGGGAG AATCCTACGA GGGGGGAGTT TGGCATGTAG AAGGTACTCT AGATGAACGG ATAGTGGCAA CGGCCTGTTG CTATCTAAAT AACACGAATG TGCAAGGTGG CGATCTGGCT TTCCGCGTGG CAGTAGCCGA GCCAGATTAC GAGCAAGGCG ACGACACGGG TGTAAGGAAT GTCTACGGTT TGGAAGATGA CGAACCGTTG GTTCAATTTA TTGGCAGCTG TAGCACTCCG ACTGGACGAA TACTGGCCTG GCCCAACACG CTGCAACATC GGGTAGGACC CGTTCGTCTC ATTGACGAAT CAAAATATGG AAAGCGCCTC ATAGTTTGCT TTTTCCTTGT GGATCCAACG CTACGGATCC GTTCGACGGC AACCGTACCA CCGCAACAAC TTGCTTGGGT ATCCGATATG TGCAAACCAA TCCTGAATGT GGTCGGCTCA GGTGCTGCTG AACCAGGAAT TCAAAATATG ATTGTTGCAA GATTGGCTTC CCCTTCTTTG CTTACCTATG ATCAGTCGTG CGAACGTCGT AATCGTCTTA TGGAGGAAAG GCGTGCAATA TATGAAAAAA CTGGTGGATA CGATCATCAA AAATTTTACG AGCGACCGTT CTCTCTGTGC GAACACTAAC TAACTGTAAA TCATATGAAA ATGAGAAAAG AACTTTATAA TGCGTT
|
Protein sequence | MASNSDVEFS TNHSRCGFPM AVVAHAFSGY GEGEMPRSKA ELEIMQLAGS IRDKPLWWNK VRDPSITNRW QAESLNGVND SEADCEYRTR QFLFALRECQ WQAMQYPGPG RPAAVDRAFS SDGHDESELW SKLLSEINKL RSLPAVGSNK EDRHPGTPQM VDLVHPSLYA YERGKTEVLP HVPASMSQMP HWDNFLGVAG VTEIPPRKID GQGFFSPGGL QWLPAEFRVN AAGDSCKISS YINSLHPTEF AELYDSIGEL FCRALPLLEN TLREAGHGPL EDGEHWQKNY RERDHRVPIV SDWWEAPRSQ LEGEDDDLYY DYLDDFHEIR EFIPPEIPDF TPPSYNVPEA DVSLRNCPLQ VIVKVASLEI EPGESYEGGV WHVEGTLDER IVATACCYLN NTNVQGGDLA FRVAVAEPDY EQGDDTGVRN VYGLEDDEPL VQFIGSCSTP TGRILAWPNT LQHRVGPVRL IDESKYGKRL IVCFFLVDPT LRIRSTATVP PQQLAWVSDM CKPILNVVGS GAAEPGIQNM IVARLASPSL LTYDQSCERR NRLMEERRAI YEKTGGYDHQ KFYERPFSLC EH
|
| |