Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46516 |
Symbol | |
ID | 7201672 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 576336 |
End bp | 579674 |
Gene Length | 3339 bp |
Protein Length | 1051 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180860 |
Protein GI | 219120234 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATCGCCCCA CCACTCCGTG AAGACACTGC AAGTTGCACG TCCGACAGTT TTTGGCTAAC CATACGTCAA TCACAGCATG GTTCGCAAAC GCTTGGACGA CCGGGTGCGT GCACTGTTGG AGCGCTCCGT CGTGACGGGC CAGCGATCCA TGCTGGTCCT GGTCGGCGAT CACGGCAAGG ATCAAGTGCC CAATCTGCAC CAAATATTGA CGAAATGTTC CGTACAGGCC CGTCCCAAGG TGCTCTGGTG CTACAAAAAG GAACTCGGCT TTTCGACCCA CCGCAAAAAG CGCATGAAGA AACTCAAACG CGACAAATCC CGGGGATTGG TGGGTGGGGA AGCCGACCAG GCCGATAACT TTGAACTCTT TGTCAGTCAA ACCGACATTA CCTGGTGTTA TTACAAGGAC AGTCACCGCG TGCTGGGAAC CACCGTTGGT GTGCTCGTCT TGCAAGACTT TGAAGCCTTG ACACCCAATC TCATGGCCCG TACCATCGAA ACCGTGGCCG GCGGTGGCTT GGTCATTTTC TTGCTCCGGA CGGTCAAGTC GCTCAAACAG TTGTACGCCA TGAGTATGGA CGTACACGCC CGGTACCGCA CAGAGTCCGC TGGTGACCTG GTCCCACGCT TCAACGAGCG ATTCATTTTG AGTTTGGGAA AATGTCCCAA CTGCTTGGTG TGTGACGACG AACTCAACGT ACTACCGGTG AGTCGCAAAG CACTGAACGA CTTGTCGCCA AACGCCGGTT GGTCCAAGGG TGATGCCGGC GAGGTAATTG TGCAGGATAC ACCGGAGCAA CGGGATTTGA AGGAAATTCA AGAAGCACTC CTGGATACAC CCCACGTTGG TGTGCTAGTG GAGCTAACGA AAACACTGGA TCAGGCCAAG GCCTTGTTGG TGTTTTTGGA AGCCTGTTCG GAAAAGACAC TCAAATCGAC GGTAGCCATG ACGGCCGCTC GTGGTCGGGG AAAGTCGGCA GCCATGGGAT TGTGTCTCGC TGGCGCCATT TCGCTGGGAT ACTCAACTAT CTGTGTGACC GCCCCGGAAC CGGAAAACTT GGTCAGCGTC TTTGACTTTC TATGCCGCGG TCTCAAGGCA CTCAAATATC AAGAACACAT GGACTATAGT GTAACGTACA ATTCAGCCAG TGGTCGCGAA CAGACCAAGT GTATCACGGC CATCAATGTA CATCGTAGTC ACCGGCAGGT GATCCAATAC GTTGATCCAG CGGAAACGGA CAAGTTTACG AGCGCCGAAA TTGTGGCCAT TGACGAAGCT GCAGCAATTC CGTTGCCCGT TGTGCGAGCT CTCATGAGCC ACCCAGATCG CTTGACTTTT TTGAGTTCGA CTATTAACGG ATACGAAGGT ACCGGTCGTG CTCTTAGTCT CAAACTCATT AAGGAACTGC GGGATGCCAA AGGAGGACGG CACGCCGAGA TGCAGGCCGC CTCTTCCGCA GCAAACTCTA TCGTTGGAGC AAAATCGAAA AAGGGCGAGG CCAAAGTTCA TGAACAACGG TGGGCCGCAG CAGCGGCTGC AATTCTGGAA GCCAGCGAAG GTTCTGATAA GCTGTTTGGG CCATTGAGAG AAATCGAACT GCTGACGCCC ATTCGATATG CACACGGCGA TTCAGTCGAA GCATGGCTCA ATAAGCTCCT TTGTTTGGAC TGTGGATCAG CGTCTAACTT GAAACTGAAC GGAGGGGCTC CTGCTCCAGG CGATTGTGAA CTTTACAGCG TAGATCGTGA CGCTCTTTTT TCTTTCCACA AGTTATCCGA AGCTTTCTTG CAGAAGGTCA TGGGACTTTA CACGAGTGCT CATTACAAGA ATTCACCCAA CGATTTGCAG ATGCTCTCTG ATGCTCCAGC TCATTCACTT TTTGTACTGC TTTCGCCGTC GGCTGAACAA GATGCAAATT CTCTCCCGGA TGTTCTTACC GTTGTTCAAG TGGCCCTAGA AGGACGTATC TCTAGAAAGG CTGTCGAAGC GCAGCTTGCC CGAGGACATC GCTCGGCCGG CGATTTGATT CCTTGGACAA TTTCTCAGCA ATTTGGTGAC TCTAAATTTG CTCAGTTGAG TGGAGCGCGA ATTGTGCGTG TTGCTGTCCA CCCTTCGGTG CAAGGAATGG GATACGGGTC CAGGGCGATC GAACTTCTTT ACCGATTCTA CAACGAAGAA ATGGTTTCGC TCGTCAATGA CGAAGGTAAC GATGATGCTG ATTCTGACGC AGAGCGCAAT GGAGAAGAAG AAAGCGACAA TGATGAGCCG ACGACATCTG GAATTGGAAT TTTGGGTGAA AACTTGAGGC CCCGCAAGGA ACTTCCACCT CTTTTGCTTC CTTTGACCGA AGTGGATATG CCGAGACTTG ATTGGGTTGG GACATCGTTT GGGCTAACTC TTCAACTTCA CAAATTTTGG AGCCGTAGCG GAATGCGGAT GCTATATTTG AGACAAACCA AGAATGAGCT TACGGGTGAA CATTCATCCA TTATGGTCCG TGCTCTACCA AGGCGAAGTG GTGTCGATGA CTCTTGGCTT TACGCGTATC TGAGTGATGC TAGGCGACGA TTCACCACTC TCTTTAGCGG GCCTTTTCGC CACTTGGACG TTAGGCTTGC TCTTTCCGTG TTCGATAATA TGGATGTGCC AAGCAACACC ACCGAAGCTA AGCAACGCGC AGGAGCTTTG GCAGGCACTC TTACCTTCAA GGAACTCGAC TACTTCTTGA CACCATATGA CTTGAAGCGC CTTGAATTAT ACGGACGAAA TTTATGTGAT CATCACCTTG TAATGGATCT ACTACCAATA ATTGGGCGAT TGTACTTCAC TGGGCGTTTT GGATCTGACT TCAACTTATC TAGCGTCCAA GCCGCGCTCT TCTGTGGGAT TGGACTACAG AACAAGAGCG TCGACATTTT GACGAGAGAG CTTGGTCTGC CAACCAATCA AGTCCTTGCA ATGTTCAATA AAGCAGTGCG AAAAATGTCC ATTGCCTTGA ACTCTGTCGT TGAGGAGAAA GAGAAAGAGA GTCTCTTAAC TGGTGAGAAA CGAAGCAGAA TTGAAGAAAG TGCCGAACAG ATGCGTCATG TTTCTCGGCA AACTTTGGAT GAAGATGCCG AGCAAGCTGG CCAGGAAGCG ATCGCAACGC TCAGGGCGAA CGAGATGGCG AACCATCTAC CGGAGCTTGC ACACGATACT GAAATGCTGA AGTACGTCGT CAAGGGATCT GATAAACAGT GGGAAAAGGT CCTCCAAGAC AAAGATGTAA GTGGAACAGG CACTGTGCAA ATTTCGGAGG TACGAGAAAA GAGAAAAATT GTTGACGATG ACGACATAG
|
Protein sequence | MVRKRLDDRV RALLERSVVT GQRSMLVLVG DHGKDQVPNL HQILTKCSVQ ARPKVLWCYK KELGFSTHRK KRMKKLKRDK SRGLVGGEAD QADNFELFVS QTDITWCYYK DSHRVLGTTV GVLVLQDFEA LTPNLMARTI ETVAGGGLVI FLLRTVKSLK QLYAMSMDVH ARYRTESAGD LVPRFNERFI LSLGKCPNCL VCDDELNVLP VSRKALNDLS PNAGWSKGDA GEVIVQDTPE QRDLKEIQEA LLDTPHVGVL VELTKTLDQA KALLVFLEAC SEKTLKSTVA MTAARGRGKS AAMGLCLAGA ISLGYSTICV TAPEPENLVS VFDFLCRGLK ALKYQEHMDY SVTYNSASGR EQTKCITAIN VHRSHRQVIQ YVDPAETDKF TSAEIVAIDE AAAIPLPVVR ALMSHPDRLT FLSSTINGYE GTGRALRGRH AEMQAASSAA NSIVGAKSKK GEAKVHEQRW AAAAAAILEA SEEIELLTPI RYAHGDSVEA WLNKLLCLDC GSASNLKLNG GAPAPGDCEL YSVDRDALFS FHKLSEAFLQ KVMGLYTSAH YKNSPNDLQM LSDAPAHSLF VLLSPSAEQD ANSLPDVLTV VQVALEGRIS RKAVEAQLAR GHRSAGDLIP WTISQQFGDS KFAQLSGARI VRVAVHPSVQ GMGYGSRAIE LLYRFYNEEM VSLVNDEGND DADSDAERNG EEESDNDEPT TSGIGILGEN LRPRKELPPL LLPLTEVDMP RLDWVGTSFG LTLQLHKFWS RSGMRMLYLR QTKNELTGEH SSIMVRALPR RSGVDDSWLY AYLSDARRRF TTLFSGPFRH LDVRLALSVF DNMDVPSNTT EAKQRAGALA GTLTFKELDY FLTPYDLKRL ELYGRNLCDH HLVMDLLPII GRLYFTGRFG SDFNLSSVQA ALFCGIGLQN KSVDILTREL GLPTNQVLAM FNKAVRKMSI ALNSVVEEKE KESLLTGEKR SRIEESAEQM RHVSRQTLDE DAEQAGQEAI ATLRANEMAN HLPELAHDTE MLKYVVKGSD KQWEKVLQDK DKREKLLTMT T
|
| |