Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32803 |
Symbol | |
ID | 7197303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 833009 |
End bp | 835411 |
Gene Length | 2403 bp |
Protein Length | 800 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178015 |
Protein GI | 219112527 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.028074 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAATAG CGCTCAGAAG AATGATAAAG CTGCGGGCAT CTATACTTGC GGTGTTGTTA ATATCGTGTT ATGTTTCGGC TTTCCAACCG AATCCGATTC ATTCCAATAG GAGGGCGACG AGCAGGGGTC CAAGCACGTC CTGTGAAAGG AACATTCCGA CTGGCTTCGG TCGTCCGAGT CCATCAAGAT TTGACCTTGA CTTTCAGAAA CATCAATCTC AGTATGGAAA GCTGTTTTCT TTGAATAAAC TCGTTGAAGA TATCAGCAGC AGATCTCCGG GCCAACTACC CTCCACCGTC TTTGTCGGTG GAAAAGGAGG AGTCGGGAAA ACCACCGTGT CATCGGCGCT GGCCGTTAGT TTGGCTTCAG CCATCGAAAA GGATTTGAAG GTTCTGATCG TATCTACCGA CCCTGCTCAC TCTCTAGGTG ATGCCTTGGA TGAGGATTTG CGCAAGAACA ACGGTCGTCC TGTTGCTATG ACGGACTCTC TGACAGGTGG TAGATTGGAC GCATGCGAAG TCGACGCTTC GGCTGCGCTC GAGGACTTTC GCGAAAACAT TGCTGCCTTT GATATCGATC GACTAGCTGA TGCTCTCGGT GTTTCTGTGG ACTTACTGGA AAGCTTCGGT TTGAGAGAAT TCAGTGGTCT TTTGAACAAC CCTCCGCCAG GTTTGGACGA ACTCGTGGCT TTGTCGAATG TATTGGATTC GGAATCTGTG GCCAAAGGTT ACGACGTGGT AATTGTGGAC ACCGCACCCA CCGGACATAC TTTGCGACTG TTAGCTTTGC CGAAATTCTT GGACGGCCTA TTGGGGAAAC TTATTAAAAT TCGCTTGCAA CTATCGGGGC TGGCGTCCAC TTTACAAACC TTCTTTGGAA ATGACGAAGC ACAGAAACGT GCAAAAAGCA TCGACGATGC CGTCAACCGA TTGGAGCAGT TTCGTCGAAA GATGAGTAAT CTTCGCGAGC GGCTTCAAGA TTCCCAGTCG ACGCGTTTTG TTGTCGTGAC AGTCCCTACC AAGCTCGGAG TTGCCGAATC GAAACGCCTT GCCGCCGAAC TCAATTATCA AGGAGTAAGT ATCACGGATA TAGTCGTGAA CCAATGTGTC GGTGGGATAG ATGACGATGT GGACTCTGAA GCTCTACAAC AATATTACGA TCGACGAAAG GATGGACAGA AAAAATGGAT CGCCAAGCTT GAAGAAGCTG TTCAGGACGT GAGCTGTAGT GAAGAGTACA AAGCAAATGG TAGTTCCGCT CCTATTGGCA TTACCAGGGT TCCATTTTTC GATGTTGAAT TGGTCGGAGT GCCCGCATTG GGATACCTTG CTGCACAATG CTTTACAGAA AACCTCAGCT TTGCGCATTT GATGAATGTC GATAGCTCGA ATGAGCCACG AGTTGTAATT TGTGGGGGGA AAGGAGGAGT CGGAAAGACA ACGACTTCGT CGGCACTAGC GGTTTCGATG GCTTCGAAAG GCCACAAAGT AGCGCTGATA AGCACGGATC CGGCTCACAG TATTGGTGAT GCTATCGAAA TAGACCTCTC TGGTGGAAAG CTTGTGGATG TTCCGCTAAT AGGAATCCCG ACGACGGATG GCTCACTGTC TGTTTTAGAA ATCGATCCGT CGACAGCAAT CAATCAATTT AAAGGTGTTG TGGATCAACT CATTGGTGGA GACGATAATC CTTCAGATGC TGGTCTTCGA AATACGCTGC GTGACCTACA AGAGGTGTTT GATACTCTTC CGGCAGGCAC GGACGAGGTG GTGGCTTTGG CGAAGATTGT CAATCTGGTG AAGAAGGGCG GATTCGACCG GATTGTATTG GACACGGCCC CAACAGGGCA TACACTTCGA ATGCTGAGCA CACCAGGCTT TCTTGCCGAG CTTATAGATC GCCTGCTTAT TATAGCCGAA AAAGTGAATT CGAATACGGC AATAAAAATG TTAATCGGAA GTTCCGCACG GTCAGAGGAC ATCTCAAATG CTGCAGCAAC AGCAAAGTCC ACTCTTCTGT CCTTCCAGCT CCAAATGTAC GATCTCGAAA ATTTGTTTGC TGATGCTGCA CAAACGGAAT TTCTCATCGT AACAGTGCCC ACGGAGCTTG CCGTAAGGGA AAGCATGCGA CTTCTAAATG ATCTGACGTT TGAGTCCCCA GACATGCCTA TTAAATGCCG AAACATTGTG GCAAACCAAG TTCTTGGGGA CGATGGAAAC GATGCAAAGA CTTTTCTGGA TCATGTGGGG CAGACTCAAG CAATATCCGT AAAAGACCTT GAAGATGCTG TTTCGAGTTA CCCTGCACCT CCTCTAATTA CCAAAATTAA GTACCTGGAC ACGGAACCCC GCGGTGTGTT TGGACTTAAG GTATTGGCCG ACGAACTACT GAGAGAGATA TAG
|
Protein sequence | MVIALRRMIK LRASILAVLL ISCYVSAFQP NPIHSNRRAT SRGPSTSCER NIPTGFGRPS PSRFDLDFQK HQSQYGKLFS LNKLVEDISS RSPGQLPSTV FVGGKGGVGK TTVSSALAVS LASAIEKDLK VLIVSTDPAH SLGDALDEDL RKNNGRPVAM TDSLTGGRLD ACEVDASAAL EDFRENIAAF DIDRLADALG VSVDLLESFG LREFSGLLNN PPPGLDELVA LSNVLDSESV AKGYDVVIVD TAPTGHTLRL LALPKFLDGL LGKLIKIRLQ LSGLASTLQT FFGNDEAQKR AKSIDDAVNR LEQFRRKMSN LRERLQDSQS TRFVVVTVPT KLGVAESKRL AAELNYQGVS ITDIVVNQCV GGIDDDVDSE ALQQYYDRRK DGQKKWIAKL EEAVQDVSCS EEYKANGSSA PIGITRVPFF DVELVGVPAL GYLAAQCFTE NLSFAHLMNV DSSNEPRVVI CGGKGGVGKT TTSSALAVSM ASKGHKVALI STDPAHSIGD AIEIDLSGGK LVDVPLIGIP TTDGSLSVLE IDPSTAINQF KGVVDQLIGG DDNPSDAGLR NTLRDLQEVF DTLPAGTDEV VALAKIVNLV KKGGFDRIVL DTAPTGHTLR MLSTPGFLAE LIDRLLIIAE KVNSNTAIKM LIGSSARSED ISNAAATAKS TLLSFQLQMY DLENLFADAA QTEFLIVTVP TELAVRESMR LLNDLTFESP DMPIKCRNIV ANQVLGDDGN DAKTFLDHVG QTQAISVKDL EDAVSSYPAP PLITKIKYLD TEPRGVFGLK VLADELLREI
|
| |