Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43268 |
Symbol | |
ID | 7196977 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 2485975 |
End bp | 2489447 |
Gene Length | 3473 bp |
Protein Length | 1050 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177530 |
Protein GI | 219111557 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGCAGCGTC ACCCGAAAAA GCTCAAAAGC AACCCAACAG TAACCATGAT CGAGAGTACT ACTAGACACA GCGCCAACGG TAGCGAAGGG GACTCGAACG AGGTGGTAAG CCACAGCACG AGAACCCCAC TTTCCTCACA GCTGTCTTTG GGTCCCGAAC GCATTCCCCC AACTTCAGCA AGAAAGAGTG GTGACGAACT AGTGAAGTAT TTTGCTCTTC CTTGGCGGAC ATTGCTTCGG AAGCTTGCGC GTGTCGGCAC GAAGCAACTG TCCGCAGCCG TAGCAGCCAC GGGAAGGGCG TACTACAACG TGGGAACAGT GAAGAAATCG GGGATTGGTC ATCGCGATGT AGCACCAATG GCAGCGGGGC TGCGGTAGTT ACCGAAAAGA CCTCCTCGGG AGATGACGTC GAGTCACGTG CCAACACTTC GAATGTCGCG TCTTTATTGG GAGATAGACG ATCCCTACCA ATGTCTGTAG GCCATGACAA TGGTGAAGTG TTTTGCGAAC CCTCGGGAGC AGCTCAGCAA TTTCATCACT ACCCGCATCG CCAGCGAAGA CACTCTGAGA GAGAGGGAGC TGGAACTACA GGAGAATTAC CCCTGGGCGG GGATCCACTT CAGGATTGCA ATTCACACTT ACACGCAGCC AATCGCCGTC GAGTCACCGC ATCATCACCG CTTTCCAAAC ATTGTCCTTC TCCAAAGGCA TCGTTCGAGG CAATGCTGGC AACCACCACC AGCTCGATTC GCTCAGGAAA AAATGGGGCG GCAAAGAAGA ACACAGGATT ATCGAAGGGA CGAAAGAACA CGAAAGGTGT ATTGGCACAT CATATGGCTG GGACGTATCC GATGCAGAGA ACAACTCATC GATTTGCCCA ACTACAACAA AAACGCACCG ACTCGGATGG GATGACCTCG AATGAACCTC GAGATGCTTT TAAAAAAGCA TGGATGAAAA TGGATAGTGC TTCCTTCTCA AATGGAAGCC CTTCATCTCC CAGTAGCCGG ACTTCCGACG ACGACATAGA AGGCTGGAAC GGGAGCTACT CTGGAAGTGG CACAGAAGGG GGTTATGCGG CTTCGGCGTC GTCTAACGAA ACTCTTTCAG GAGCTGAAAG ATCATGCTCG TCTGATTTAT CTGCAGAAGC TTTAAAGTTA AGACACATCA ACACTTCCTG CAATCCGGAC GCTTCTGAGC ATGGCAGGAA GCAAGAGTCG ACGTTTTCAA CATCTTCCGA CATTGCCGAT TTCAGCTCAG GTGGCTCCGA GTTTGAATCC GTTGACGAAA GCATACCTGA CGCGGTATCG GATTCCGCTT CGAACGCATT GTCTTCGAAC GGTAGCCGCT TCTTCGACAA AAGAGAGGAT GAATTCAAGG CCACCAAACG AAACCATCCA AGACGTCAAG CTTCCATACC TCGCAATCAC TTGAGCTTGT CGCAACTTGC AATGCAGAAT AGTAATAAAG TTTCGAATCA GACCAACCTG ACTGTGAAAC CAAGCGCTTG TCACGAAATT AATGGAAAGG CCCCTATTTT AGCCTTGGGA GGAGATGTTA TGGCGCACGT CTTGGCATTT TTGGAGCCAC CAAAGATCCT GGAGGTTATT ACGGCTCCAC TCTCGAAAGA TTGGCTGAAC GCCTTCTCCC ATCAGTCAGA GCTTTGGCGT GTGCTCTGTC TTCAGGAGCC TTTGAAGGCT AGAATCGAGA ACGAACCGGA CAGCGACGAC GAATCTTTTA CAGGTTCTTT CTCTTCTATC ACCGGATCCC AACACCGCCT CACATTTGGC AAGTTTAGAC TCTTATACAC GGCATTTGTC CGTTGTATGA AATATTTGGC TCGCATTAAG GACGACGCAT CAAATGGAAG ACCTCTATCG GTGGTAGATT ACGGTGTTGC CGACGGCATG GGAAGCCACG ACATTGGTTC CAATCAGAAT CTTCGACAGT TTCTGGCAAG AAGTAGAGGT GTAAGTTTCC TCACTGGACA GAATGAACAG GCCGATACGC CCCACAGCGG CAGCCGTCAT CACGCATTTG CTACTATAAC TCAGTCGATT GGAGTCTCGG ATAACGGCAG TGCGACTAAA TCCAAACGGA AGCATAAGGA GGAGGAGGAA TGTATGGCTG TCAAGAAGAT GCGACGCTTC GCCAGTGGGC CCTCGGCGCT TACGCAGAGA TTGCTCGGAC CACCCAGCAC CGGAACTCCA GGTAACACCG AGTTGCCATG GTCCTGTGCG ATCTTTTCGA TCGTTAATTG GATGCTTGCT TTCTCGGATG TGGAGGGTAT TCAGACGATG TGCCTGAGAG ATTTGCCGTC TTTGCTCGAA GATGAACAAC AGCGAATCAC TGCCCAGCGA GCAGGACTGA CGGACGTGGT ACTCCGCGCT ATGGTTACGT TTCCCGACAG TAGCCCACTG CACACGGCAG CGTTTCACAC CATTGTCCTT CTGGCCCGTC CATTGGGTGG TCGAGAGGGC ATGTTGTTCC ATACGTCAAT GGTGAATTCA TCGGGCATCT TCAGTGCTAG CAGTGTGGCC TCTCGCAATG GTAAGAGCGG TATAGCTGTT ATGCTTGATT CAATGAAAAG GTTTCAGCAA GACGAAGTAC TTCAAGCCAT GAGTTGTTGG TCCCTTGTAA ATATTGCTCT AGCCCCGGCA CAGAAGGAAG TTCTTGTGAA TCTTGGTGGT ATCGAAGTGA CATCCAGGGC TATGTGTGCT CATCCGCATA GCGCTGAAGT TCAATTCCGT GCGCTTTTTG CTCTGATCAA TCTTGTAATC CCCTCGCGAG ACCAAGGAGA GCCTTTAAGA GGAGAAGTGA TTACCGAAAA GGAAATGCTC GATGAAAGTG TTGATCAGAT CATCCACCTC GTTCTCCTGG CGATGAAGAA CTTTTGCGCA TCTGAAGCAA TCGTGAATAG AGCGTGCCTT GTTCTTCACA ATGTGTCACT TACTCGAGAG TACCACGAAA CACTCCTTTG TTGTCCAAAC TGTTACCAGA TGCTGGAATG GTGCTTGGCC AACTACCCAA CTGATCAGGT CCTGCAGCAA AGCGCTTCGG GAACCTTACA CCGCCTCCAG CTCACTTTGA ACAGTGACGA AATCCTTCGA ACTCGATTTG CTACTACTTT ACAAGCGCAG CAGCAAATGT CCCTCGAGAA TGTGCATAGA GAGGCGATTG TCGCTCATGA GCAGCACGCT CAAAGTCGGA CGATATAACT GTAAAATACA CCGCCGGAGC ACATGTACTG AAATGTTGTT GACAGTGAGT GACTTTTGAT TATATTAAAT TTGACCATAA GGACGTGATC AACTATTTGA CTGAATCGAC ATGATATTGC TACAACTGTT TGTTAATATA CGCACAAACA ACGTAGCATC AATGGACAAT TGGTCTTCGC GCTTTCTGGT ATTAATAGTG AGGAAAATAG GCGCTATACA TTAGACTAAA CGGTGTAATT TCACGCGATG CCG
|
Protein sequence | MIESTTRHSA NGSEGDSNEV VSHSTRTPLS SQLSLGPERI PPTSARKSGD ELVKYFALPW RTLLRKLARV GTKQLSAAVA ATGRAEEIGD WSSRCSTNGS GAAVVTEKTS SGDDVESRAN TSNVASLLGD RRSLPMSVGH DNGEVFCEPS GAAQQFHHYP HRQRRHSERE GAGTTGELPL GGDPLQDCNS HLHAANRRRV TASSPLSKHC PSPKASFEAM LATTTSSIRS GKNGAAKKNT GLSKGRKNTK GVLAHHMAGT YPMQRTTHRF AQLQQKRTDS DGMTSNEPRD AFKKAWMKMD SASFSNGSPS SPSSRTSDDD IEGWNGSYSG SGTEGGYAAS ASSNETLSGA ERSCSSDLSA EALKLRHINT SCNPDASEHG RKQESTFSTS SDIADFSSGG SEFESVDESI PDAVSDSASN ALSSNGSRFF DKREDEFKAT KRNHPRRQAS IPRNHLSLSQ LAMQNSNKVS NQTNLTVKPS ACHEINGKAP ILALGGDVMA HVLAFLEPPK ILEVITAPLS KDWLNAFSHQ SELWRVLCLQ EPLKARIENE PDSDDESFTG SFSSITGSQH RLTFGKFRLL YTAFVRCMKY LARIKDDASN GRPLSVVDYG VADGMGSHDI GSNQNLRQFL ARSRGVSFLT GQNEQADTPH SGSRHHAFAT ITQSIGVSDN GSATKSKRKH KEEEECMAVK KMRRFASGPS ALTQRLLGPP STGTPGNTEL PWSCAIFSIV NWMLAFSDVE GIQTMCLRDL PSLLEDEQQR ITAQRAGLTD VVLRAMVTFP DSSPLHTAAF HTIVLLARPL GGREGMLFHT SMVNSSGIFS ASSVASRNGK SGIAVMLDSM KRFQQDEVLQ AMSCWSLVNI ALAPAQKEVL VNLGGIEVTS RAMCAHPHSA EVQFRALFAL INLVIPSRDQ GEPLRGEVIT EKEMLDESVD QIIHLVLLAM KNFCASEAIV NRACLVLHNV SLTREYHETL LCCPNCYQML EWCLANYPTD QVLQQSASGT LHRLQLTLNS DEILRTRFAT TLQAQQQMSL ENVHREAIVA HEQHAQSRTI
|
| |