Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36312 |
Symbol | |
ID | 7201908 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 135764 |
End bp | 138180 |
Gene Length | 2417 bp |
Protein Length | 740 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180950 |
Protein GI | 219120423 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAG ATGCCGGAGC CGGCACGATT GCGTTCCTGC AGCGTTCGCT ACAAGCGAAC AACTCGACGG ATTCCACCGA GGAGGCAGCC TCGTACAACG ACGGCAGCGT GTTGCGCGAT ACCTTTACGG TGTACGGGTC GATTTTGCTC GTAATCTTCA TCGCATTTTG CTGGTTGCGG CGCAAGTATC CACGAGCGTA CAACGTCCGA AATTGGGTCG AAGATATTAA GACCCCGCTC GCCAAAGACC AGTTCGGTTT CTTTTCGTAA GTCCACGGAA GATCCATAGA TGTGGCGCTG TCTCTTGTTG GGGGTAGCCA CGACAACAAC ATCTGCATTA ACAGTATCCC GCTCACGCGT CTTTTCCTTC CACAGGTGGA TTTGGGAGAT TTCCACCATT ACCGAAGACG AGATCATGGA CGAATGCGGC TTGGACGCCT TGTGTTTCGT CCGGATCCTT AGCATGGGCT ACCGCATCAG CTTAATGGGT GTCTTCAACG CCATCTGGCT CATGCCCGTT TACGCCACGG CTGACGTATC GGACGACACC CGCGGTATTG TGGACCGCAT TGTCGAAGTC TCCATTGCTC ATGTTCCTGC GTCGTCGCCC CGACTCGTAG CAACGGCTTT GGCTGCCTGG ATCGTCTTTG GCTACACCAT GTACCTCATT TTACAGGAAT TCGAATGGTT CATCGACAAG CGTCACAAAT TCCTCGCCAA ACCTCGACCC CAGAACTACA CTGTCTACGT CCGAAACATT CCCATCGAAT ACCGCACGGA CTCGGGCTTG GAAGACTTCT TTCGGCAGTG CTTTCAGTAC GAGTCGGTCC TCGAAGCCAA CGTGCGCCTC CGGACACCCA ATCTTGCCAA GCTCGTGGCG CAACGAAGCG TGCTCATCGC CAACCTCGAG CACGCCATTG CGATTGAAGA CATTACCGGA GAGGCGCCGC AGCGATCAGC TTCACTCAAA TCCTCCCTCA TGATTATGGG CGGGGAAAAG GTCAACGCCA TTGAGGCTTT CGCCGAAGAA CTCAAAGCAC TCAATGCGGA TATCAAAGCA CGTATTGAAG AGCTCGAAAC CAAAAAATTG TCCCAACTGT TCATGCAAGA TGTGGAACAA CAGAGTTTGG CGACCTTGGG ACACTCGGTG GCCGGGCGCG GCGATTCAAT GTATGGAGCA GGCGACAATG TGAACGCGGA GGAGTGCGCG TCCTTGACAC CTAGTGCCCT GGCGGTTGTA CCGCGTCCCA ACGGATATGG TACCGAAATC TCCAACGTAG AAACTGCCAT TTACAACGAC GTTATTGTAG AGGAGGAAGA CGACGATGGA GACTTGTCGA CTCTGGCCAG TCGTCAGAAC GTGACCAACC ACAGCAGCTC ATCGAAGTCG ATTCTGGATG CCAAAAAGTC GATCAAGCAG TCAGTCCACC TTTTTAAAAA GGCCGCCAAT GCGGTCAAAG ACTCGGCTGT GGCGGTAGGA GAAAACGCCG CCCACATGTT GCAAACGAAC GCCGACGGTG AGAGTTACGA AGCAGGTTTT TTAACTTTTA CCAATCTACG AACAGCACAA GCGGCTTTAC AGATGCTACA TCACAGCAAG CCGTTTTCGA TTGAAGTGCA AGAGGCTCCG GATCCGCAGG ACGTCTTTTG GTTCAATGTA GGACGCACGC ACAAAGAATT GCAGATGGGA AATTTGTTGT CGTTGGCAGC CACGACTGCC TTGTGTCTTC TTTGGACGAT TCCCATGAGT TTCATTGCTT CACTGTCCAC GATTGATGCT CTCCGCTCGG AATTTGATTT TATTGACAGC TTGCTGGATG ATGCCCCCTT TTTGGTTCCC GTGTTTGAGA TTGGAGCGCC CTTGTTGGTC GTGGTGGTCA ACGCGTTGTT ACCCGTGATC CTACAAGTCT TCTCCATGAT GGAGGGCCCT GTGTCTGGGG CAGTTGTGGA AGCTTCGCTC TTTTCTAAGC TTGCAGCCTT TATGATTATC CAAACCTTTT TCGTCAGTGC AATTTCTGGT GGACTCATGC AGGTACGTGT CAGCAAAAAC GGAAGGCTAC AACAAGACTG TCGTGTAGTG GAATGGTGAG CTCACGCGCA TTTGCTTTTG ATCACAGCAA CTCTCCGAGA TGATAAATGA TTATACCCTA ATTATTGATT TGCTGGCAAC CTCCTTGCCC GCTCAAGCAA CCTATTTCAT TCAAATTATC TTTGTGACTA CGGTTTTTTC TTGCGGTATG GAAATCTTGC GAGTCATCCC GGTAATTAAA GCAGCATTGC GAAAGTGCAT CGGACCTCGC TTGACCAAAA GAGAGCGTCA AAAAGCATTT ATGGGTTTGC AACCTCTGGG CGACCCGCTA GATTTTGAAT TTGCGGATTT TTCTTCGAAC ATGGTAAGCT CCTCACGTCC AAAGTAG
|
Protein sequence | MTEDAGAGTI AFLQRSLQAN NSTDSTEEAA SYNDGSVLRD TFTVYGSILL VIFIAFCWLR RKYPRAYNVR NWVEDIKTPL AKDQFGFFSW IWEISTITED EIMDECGLDA LCFVRILSMG YRISLMGVFN AIWLMPVYAT ADVSDDTRGI VDRIVEVSIA HVPASSPRLV ATALAAWIVF GYTMYLILQE FEWFIDKRHK FLAKPRPQNY TVYVRNIPIE YRTDSGLEDF FRQCFQYESV LEANVRLRTP NLAKLVAQRS VLIANLEHAI AIEDITGEAP QRSASLKSSL MIMGGEKVNA IEAFAEELKA LNADIKARIE ELETKKLSQL FMQDVEQQSL ATLGHSVAGR GDSMYGAGDN VNAEECASLT PSALAVVPRP NGYGTEISNV ETAIYNDVIV EEEDDDGDLS TLASRQNVTN HSSSSKSILD AKKSIKQSVH LFKKAANAVK DSAVAVGENA AHMLQTNADG ESYEAGFLTF TNLRTAQAAL QMLHHSKPFS IEVQEAPDPQ DVFWFNVGRT HKELQMGNLL SLAATTALCL LWTIPMSFIA SLSTIDALRS EFDFIDSLLD DAPFLVPVFE IGAPLLVVVV NALLPVILQV FSMMEGPVSG AVVEASLFSK LAAFMIIQTF FVSAISGGLM QQLSEMINDY TLIIDLLATS LPAQATYFIQ IIFVTTVFSC GMEILRVIPV IKAALRKCIG PRLTKRERQK AFMGLQPLGD PLDFEFADFS SNMVSSSRPK
|
| |