Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44112 |
Symbol | |
ID | 7203872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1014924 |
End bp | 1017235 |
Gene Length | 2312 bp |
Protein Length | 757 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186449 |
Protein GI | 219113731 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATTC TGTGTATCAC GGTTCTTTCT TTGGTCGCGG TTACTGTTGC GTTTCGTCCT TCCCAACGAT CGCTGGTGCG GAGTAGCGCC GCTTCTTTGA CTTTACGCCA TGCCCCCTTT AGCCGCTTAA CGAAGATCTA CGAGCAGACG GATCAAACCC AAACGGAGGT TCTCAAGCTT GAGCCTATCC TACTAGTCTC AGATACGGAT AAAGTGGTTG ACCCTGGCGA GGAATTGTTT TTGGCTACGG TAGAAGAGGC GGTAGACGAG GCGTTGCAAG CGGAGGTGGA AACACTTGGT TCCGAAAATG AAGTCGAGAC TTTCGGCGTC GTATTAGAGA GCTTCACAGA AGAGAGTGCC AACGTAATAT CCTCATCAGA GATGTCTGCA GAATTTGCAT CCGTATCAGA GACATCTGCC GAAGTAGCAT CTGATGTTGT GAGTGCCATT CTCGCTGCTT CGCAAGAAGC CGCTGATGCT GCGGAAGCAA CCCTCTCGGA TGAAGACATT TTTAATTATT CCACACCCGG CTTTAAAAAT GCTACGGTGG AAGTTAACAG AATCCCTGAG ATCCTTCCGG CGTCGCAGAT CGTTGGGGAT CCCTCAGTGG CACCAAAAAT AGCCGCACCG TCGGTTGGAA AAATTCTTAA ATTCGCGTTA CCTGCCACCG GGGTGTGGCT CTGTGGGCCT CTCTTGTCGT TGATTGACAC GAGCTCTGTA GGCATTCTGT CCGGAACGGT CCAGCAGGCT GCTCTGAATC CCGCGGTTGC TGTCACTGAC TACGCTGCCT TGCTTATTGC ATTTTTGTTT ACGGGGACGA CGAATCTCAT GGCGTCAGCC TTGGAGTCTG ATCGTGGAGT AGAAGGATCA CCCCGGAGCA CAAGTACCCT GAAAGGAGCC ATACAACTTT CGACTTATGT CGGCGCTGGC TTGGGCGCCG TTTTATTTGT CTTCGCCCGA CCCTTGCTGC AAGCTTTAAT TGGAAATGAC GCCATGAGTC CTGCCGTATT TGCCGCCGCA ATGAAGTACG TTCGCATCCG GGCGCTTGGA ATGCCGGCAG CTGCCGTAAT TGGGAGTACT CAAGCTGCTT GCCTTGGCAT GCAAGATATC CGCAGTCCTC TCTATGTTCT ATTGGCGGCG GCTGTTGTCA ATTTTATCGG AGACATGCTT TTCGTCGGGA GTACCAACCC TTGGCTTGGT GGAGCGGCCG GAGCCGCTTG GGCTACCGTA TTCAGTCAAT TTGCGGCCGT TGGTTTATTT GTGCACTGGC TTTGTCACAA ACCGCAAACG AAAGAGCGTA AACAGGTGGT CAACGTGTCT CGAGCTATTT TGGAACTGAC TGGAAAGTCG GATAGCGCCG GTGAAAACCG AAGGCGGCGC TTCATAGACA CTTTGCAGTC GTTCCGAGCG AATTTATCAG AAGAGAAGTC GATAGCAGTT CCAAGCAGAA CAGGACACGC TACGACAAAG ACACGTCGAT CCAAATGGAC CAAAAAAAAC AAGCCATCAT CGAAGGAGAA ATCTTTTTCA GTGCGCGGAT TCCTCGAGAG CAAAATCCAG AGGCGGGAGC TTGTCAGGCT TCCGTCCAAA AGCATTATCA AAGAATTTTA TCCATATATG TTGCCGGTTA CCAGCACACA GGTTGGTCGG GTTTCAGGTT ATGTTGCTAT GGCGCACGTT GTTGCCAGTT CACTCGGTAC CGTCAGCATG GCGGCTCAGC AAGTAATTGT CAGCCTTTTC TACTGCCTCT GCCCCATTGC GGATTCACTT AGTTTAACAG CGCAGTCCTT TGTGCCAGCG ATTGCCGAAA AGAAGGTTTC GAAAGAACGA ACCAATGCAT TACGAAAGAC GACGAGAAAC TTTTTTAAGG CCGGCTCAAT TTTCGGCTCT GTGATGGTCA GCGCTGTTCT CTGCATTCCA TTCTTGTCGC AATTTTTTAC CGCTGATCCT GTTGTCAGTT CCATGGTAGC GTCCATTGCC CCGTTGCTTG TGGGCGTGTT TGCCGTGCAT GGTATTGTTT GCGCATCTGA AGGTCTCTTG TTGGGGCAAA AGGATCTGGG GTTCTTGGGC AAAATGTACG CCGGCTTTTT TGCAGTTGTT CCTTTTTTTA TGCTGCGGGT GAAACGTGCG GCTGCGCGCG GCGTACCAGG AACTAATTTG AGTTCTGTCT GGAAGGTGTT CTTAGGCTAC CAACTTTTCC GATGGATGAT GTGGATGTCT CGAGTGGTCA CAATTCAGCG AAGAACTGAG CGAGAATCAG CCGGCTTTAT GTAGCTGAAA GCATTTCCTA AACTTTGTAC ATACCACTTC TA
|
Protein sequence | MQILCITVLS LVAVTVAFRP SQRSLVRSSA ASLTLRHAPF SRLTKIYEQT DQTQTEVLKL EPILLVSDTD KVVDPGEELF LATVEEAVDE ALQAEVETLG SENEVETFGV VLESFTEESA NVISSSEMSA EFASVSETSA EVASDVVSAI LAASQEAADA AEATLSDEDI FNYSTPGFKN ATVEVNRIPE ILPASQIVGD PSVAPKIAAP SVGKILKFAL PATGVWLCGP LLSLIDTSSV GILSGTVQQA ALNPAVAVTD YAALLIAFLF TGTTNLMASA LESDRGVEGS PRSTSTLKGA IQLSTYVGAG LGAVLFVFAR PLLQALIGND AMSPAVFAAA MKYVRIRALG MPAAAVIGST QAACLGMQDI RSPLYVLLAA AVVNFIGDML FVGSTNPWLG GAAGAAWATV FSQFAAVGLF VHWLCHKPQT KERKQVVNVS RAILELTGKS DSAGENRRRR FIDTLQSFRA NLSEEKSIAV PSRTGHATTK TRRSKWTKKN KPSSKEKSFS VRGFLESKIQ RRELVRLPSK SIIKEFYPYM LPVTSTQVGR VSGYVAMAHV VASSLGTVSM AAQQVIVSLF YCLCPIADSL SLTAQSFVPA IAEKKVSKER TNALRKTTRN FFKAGSIFGS VMVSAVLCIP FLSQFFTADP VVSSMVASIA PLLVGVFAVH GIVCASEGLL LGQKDLGFLG KMYAGFFAVV PFFMLRVKRA AARGVPGTNL SSVWKVFLGY QLFRWMMWMS RVVTIQRRTE RESAGFM
|
| |