Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47154 |
Symbol | |
ID | 7202052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 634128 |
End bp | 636204 |
Gene Length | 2077 bp |
Protein Length | 639 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | iron ion binding protein |
Protein accession | XP_002181240 |
Protein GI | 219121785 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCCC AAATTCGTCG GCGGCAAACG AAAAGCCGAC GGAATCGCGA TGGCCTGATT CGGTTCGCCC GTGACAGAAC GAGCCAAAAT GGCGAAGATG GGATTATCGA ACGATTATTT CAACTACTAC CAACCGAAAG CGAGCGTTGG TGCGTCGACT TGGGCGCCTG GGACGGAGTT CATTTGAGCA ACACTAATTC GCTGCTGGTT GCCACAGCTG ATGAGCATGT GTCTGCCGCA AACAGCACTC TATGGCATGG TGTGCTAGTG GAAGCCGACA CAGACCGCTT TCAACACTTG CAACAGCTTT ACGTCAATAG AGGCAATGTT TGTTTGAACG TATCAGTTTC CGGCATGTCT GATTCGCCGC ACACACTGGA AAATATTTTG AAGACACACG GTGACCGAAT TAGCTTACCC AGTGACTTTG ATTTTCTATG CATCGATATT GACGGTGCCG ACTACTGGGT GTGGCACGAT CTATTGAAGT CGGAATCCTA CCGCCCTAGG GTTGTCTGCG TGGAGTTCAA TCCGACCATA CCGGATGATT TAATTTACAT TCCAGAAAGA AGTGACGTGA TACGACAAGG GTGCAGTCTA GCGGCGTTGG TAGAGCTTGC CAACGAATAC GACTACGTCC TGGTCGAAAC GACCTTGTAC AATGCTTTTT TCGTCCCGAT ATCACTCTAC GACTCCTATC TGGCTGACGA AATTCCGGAT ACTTCGATTG AAGTCCTACA CGAGACTACA ATGGGTACAG CCCTTTATCA GCTTTATGAC GGTTCCATCA AGTTGTGGGG TTGCAAAAAA CTCCTCTGGC ATCGACTACC GATGGACGAG TCCAAGGTGC AGATGCTACC GAGAGGGCAG CGTCAGTTTC CATTTGCCCC CCGCGAGACC AAAAATTCAT TGGCAATGAG CCACGCTGTT GACTTGCGCG TTTGCTACAG CGAAACATCT TCCGCCGACC AACGTGCAAT ATGCTCCGCC AATCTCGTTC GTCAGTTACA GAAGGACGGC TTTTGCTACG TGCGTGGGAC AGGAATTGCT CGACAAACGT GCCAAAGAGC ACTGGAGGCG ACGCACTCGC TGCTTCAAGA TGCGGACGAA TGTGTGCGTC GGTCGTGTTT GACCACAGAC CGTGCTCGGC GCGGGTACAG TCCCATGTGT ACCGAGAATT TCAGTTCGCT GTTGGGAGAA ACGGGACCGA ACGATTTGGT GCGCAAGTTT CGGGTAGGAC CCGTCGATGG ACACGAGGGG GGCGGTGCGC TGCTGCAGCC GAATGTCTGG CCAGTCGAAG GGACGTGGGA CGCGCCGACG GCGGCCGCTT TTCGCGCACA CGTCGAAGCG TACTACGGTT CCATTTGTGC GGCAGCCACG ACGATGGTGA CGACCATTTG CCAAGGTATC TTGGCGATGT ATCCGGATCT CGAGGCCGCA TTGGCTCCAC TGATGAAGGA ATCACTGGCA CACTCATCGA TTCTAACGCT TCTGGGATAC CGCGTTGGCT CTCGTCACAA GGGTCGATCG AAAGGTCCGT TGGTGGCGGC GCATACGGAC GTGGGCGTGA TTACCGTGCT CGTCTTCGAT GACGGTGATT GCGCAACCTT GCAACGCCGT ACCGGACAGG GAGACTGGGA GGACGTTGTT TTGCCCGCGT CGGTGCCGGA CGATCCCATT TTTGTCGTGA ACGTGGCGGA CTGTTTTTCC GAATTGAGTG GTGGACGTTT GCCTTCGACG ATTCATCGAG TAGTCGCGCG ACCGGGAAAG ACGCAACCGC GCAACGGCTG TGCCTTATTT GTGGGACTGG ATCCTCACGA AATGCTATGT ATCCAGGACG AAGCGATGAC GTACGAATGC TGGCGCAAAC GACGGATCGC ACGAGCGCAA ACGGTACACC GAGAGTCGTC GTCGTCATAG TGGTCCACAT CGGCCGAGTC TTGCTCTAGC GCAAGAACGC CTGTGTAAAT CGGCGTAAAC GACAGTCGAC TATAGTGTTG ACCGGAGGTG TACGACAATT GCGTCGCGTG TCTTTGAAAG GGTATATAGT TTATACTATA ATACATACTT TGTAATT
|
Protein sequence | MEPQIRRRQT KSRRNRDGLI RFARDRTSQN GEDGIIERLF QLLPTESERW CVDLGAWDGV HLSNTNSLLV ATADEHVSAA NSTLWHGVLV EADTDRFQHL QQLYVNRGNV CLNVSVSGMS DSPHTLENIL KTHGDRISLP SDFDFLCIDI DGADYWVWHD LLKSESYRPR VVCVEFNPTI PDDLIYIPER SDVIRQGCSL AALVELANEY DYVLVETTLY NAFFVPISLY DSYLADEIPD TSIEVLHETT MGTALYQLYD GSIKLWGCKK LLWHRLPMDE SKVQMLPRGQ RQFPFAPRET KNSLAMSHAV DLRVCYSETS SADQRAICSA NLVRQLQKDG FCYVRGTGIA RQTCQRALEA THSLLQDADE CVRRSCLTTD RARRGYSPMC TENFSSLLGE TGPNDLVRKF RVGPVDGHEG GGALLQPNVW PVEGTWDAPT AAAFRAHVEA YYGSICAAAT TMVTTICQGI LAMYPDLEAA LAPLMKESLA HSSILTLLGY RVGSRHKGRS KGPLVAAHTD VGVITVLVFD DGDCATLQRR TGQGDWEDVV LPASVPDDPI FVVNVADCFS ELSGGRLPST IHRVVARPGK TQPRNGCALF VGLDPHEMLC IQDEAMTYEC WRKRRIARAQ TVHRESSSS
|
| |