Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45954 |
Symbol | |
ID | 7200831 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 765913 |
End bp | 769235 |
Gene Length | 3323 bp |
Protein Length | 1041 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180113 |
Protein GI | 219118691 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.340544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACATTG ATTGTGGCCG TGGATACAGG ACAGTGGTCA ACGAACCAAT ACAAAATAGT CCAGAATTCG TGCTGACCAC CGCTAATGTA AGTGGATCTG AAATCAATTC TGGGTACGCG ACATCTCTAG GATATCTGTT CTGAGTCCAA AATTTACTTA CATCAGGGCC ACTCCCAGTA TAATCGCTAT GTTTTTCGGG TTCCCAGAAT TTCAAGATGA TACGGCTCGA AAAGGCGTGT ACCAGTTACT GTATACGGTG TGTAAAACTC TGCGCTGACA ACAGTCAACA GAGCGACGAT CCCGTCGTAT GTTCATGTTC CTCTTTGGAG ACGAGGGATG TATTACTTCG ATCATGACGA GCAATAGCAA TAGCAATACC AATACCACTA CTTCTACTCC TGCCACTTCC GCCTCTATCC CGCTAGACCT TACGAGTAGC AGTAGTGCTC CCCGCAGTCA CAATCGTCGC TACTTGTGTG CGGCCCCACG TCCGGATCAG TACGACGAGC ACGCGTCGCC AGTGTTGCAA CGTCTGGGTC ATACCCGTGC ATCGCGGAAG ACTTTGCTGG AAGCCACGTT GCGGGCCTTG CAAACGCGAC ACAATCTGGA ACGGAAAACT GGGCCGGGTA GAAAGTCGAA CGCTGAGCAC GCCATTCATT TTTACGAATA TTTATTGGAG TTGGTAGAAA ATGGCGAGGA GGAGGGTGAG GACATCGGGG ACGCCGACTC TATTTCGAGC GCTAGTAATA AGGATGAAGC AGAGCACGAG AGAAATGTAG AGTCTGTAAA CAAAAATACG CTCAAGGACG AGAATAACGC GACCGAAGCA CAAGAGGATT CTGTCAAATC GGAATTAGAC GAAGCCTCTT CAATGGACAC GGCAACCTTT GTGAAAGCTC ACAACGACCT TTGCGAAGTG TGTGACGAAG CCGGCGATTT GCTCATGTGT GAAACCTGTA ATCTTGTCTT TCACGTTGCT TGTGTCCGTC CTGCCTTGGA AACGTTGCCG GAGCAAGATT ACAAGTGCGC GTATTGTGTC CTGTCGACAG AACCGAAGAA TAGTAGACCA CGTAAGCAGG CCGCCGCGGC AGTTCGGCTT ATGGCGCGTT TGCGTAATCA GTTTCAACGA AACAAAAGAC GTGGACGAAA CGATGCAGGA GACAGTAAAC GCGACAACGA GGACGAGGAC CAAATCGTCA GCAGTGATCT GGGCGACAAG CCTTCCGAAG CATCAGGCAA CGAGGAAATA TCGAAGTTAG AGCCATCGGC CGGAACGGAT GAGGAGAAAA GCAGTGAAGA AGAGAAAGAC AAAACGAACG ATACTAAAGA CACAACGAAA GAAATAGCGG ACCCACGGAA GGTGACAGAA GACAGAACGA ATATAGAGGA AGCCACACCG CTTGCTTCCA ACAAAAGGCG TAAGCTGGAG CTCTACAGGA TTACTGATGC CTTTTCGACT CCTGAAAATA TGGAAGACGG ACGATCTAAA CGCAATCGTA AGCAACCAAT GCTTTACGAC CCTCAGGCGG GACCGGCTCG CAAGTGGCAA TCGGACGAAC CTAAGTACTG GAATTCTGAC TCACAGTCAG ACAATTCGAT TTCTGGTGAA AGTGAAACCC GCGATGCTGA CGAAAAAGAT GCATTTGGTA CTGTCGTGAA AAAGGTAGGA TCGGCGCGCA AAGAAAACGG TGAGATACAC TGCAGCTTTT GTGAAGACGA CCCTTTTATC GAGCTATGCT GCTTCTGTGG CTGTCGTGTA TGTTTTGGGA AGCATCATCA ATCAAAACTG TTGTTATGCG ACGAATGTGA CGACGAATAC CACACTTTCT GCCTTAGCCC ACCGCTTAAA TCACTGCCAG CGTCCAATGC AGAATGGTTC TGCCCTTCAT GCTCCGTTAG CCAACAAAGA CGACAGATCA CTACGAAATC GCTGTCGTCA CGTATCGGGA CCCGGGGTGC CATGAGCAAG TCCCCAAGTC CAACACGTAT ATCCTCTCGC CAAGCCGCGA CCAAAGCGTC GCACTCGACG TCTTTGACCG AACACGTTAA GCGCGGTCCA GGGCGTCCGC GAAGCAAGGA TCGAATTCTT ACAGTGGCCG TAGGCAAGAA GCGTGGACGA CCGCCCAAGT CAGCATCGCA AGGCAGCCCA GACAAGAAAC GTGCAAAGAC AGAGCCGACA ACGAAACTTT CTTCACAGAC ACCAAAATCG GATGACGGGA AAGCACGGAG TAAGTCTTTG GTTGGCGCCG TCTCCGATCA TGACCCGGTC ACAGACTATA TCAGCGCCAC GGCTCCAATA CAAGAAGTGA AAATTAGCCG AAGTGGACGT CCAGTCAAGC GGGGCAGTTT TCACGACGAA ATTGAACAGC GCGAGCAGCA TCTGAGATCT GACCGGTCTC ATCCAATATC ACCGTCGAAA GCTATCACAA ACCAGACATC ATCTACTCCT GCCACCATCC CAGCTGAAAT TATTGAGGCC TCAACCATAG CTGATTCTGA ATCTGATTCT TTGAAGGCCA CCAAAACGAC TTCTGTGTTA CAGGGTAGTC AACAGGCTTC CGCAAACGCA ACTGCACTGG AAAATGCCTC TCCTTCTCCT GTATTTTCGG AACCAGCGTC CCCGCCAGCG CCAATGAACA ATAAAGCCAG CAAGGTGTCA GTGAAAGTTT CTGTCTCGAA TCCCACCCCA AATGTGCAAC CATCGGTTCG AATTAACACT GTTCCTTCTA TTGCCACTAT TCCTCCCATG GTAGAGCCCA CACCCGTGTC CGTGCCCCCT ACAGCGATGA AGCCAGTGTC AGCTCCGATG TCTTCTTCCA CAAGCACTTC GTTGCCAACA ACGGGACAAT CTACACTTCT ATCACCCTCT AAAATATCGG ATCCCGCTCC TGCAACTAAC AATGTTGAGC CCGCTGCCGT TATTTCAACT GCAGCCATAA CTATGGCCAA AGCAGCTGTA CCACAGCCGC TTACAGCTTC TGCCAGCGTA CCACCGCCTA TCGCCCAAAA CAAAGAAGTC AAGGTGCCTC GCCGCAAACC TGGAGCGCGA GAGTGCATGC AAATCTCTCG GCGATTCGGT GTTAGAGCCA TTCCTCAAAA GTACATGGAT ATTATGACGG ATTATTGCAA GCGCGGAAAA GTTGAACATT TGATTCGGAT GCGCGAGCGA TTGGACGATC ACTCCCGCTT TCTAGAAGCA CAGTTGGCAG GATTGGAGGC CTTGGTCCTG GAAAAGGGAG AATCAAGTGT TGTGGTTCCT TCCATGCCGT CAGGTCCTGA TCGCAAGCTA GAACGCACTT TGGGGACAGA GTATGATTTG TAA
|
Protein sequence | MYIDCGRGYR TVVNEPIQNS PEFVLTTANS TERRSRRMFM FLFGDEGCIT SIMTSNSNSN TNTTTSTPAT SASIPLDLTS SSSAPRSHNR RYLCAAPRPD QYDEHASPVL QRLGHTRASR KTLLEATLRA LQTRHNLERK TGPGRKSNAE HAIHFYEYLL ELVENGEEEG EDIGDADSIS SASNKDEAEH ERNVESVNKN TLKDENNATE AQEDSVKSEL DEASSMDTAT FVKAHNDLCE VCDEAGDLLM CETCNLVFHV ACVRPALETL PEQDYKCAYC VLSTEPKNSR PRKQAAAAVR LMARLRNQFQ RNKRRGRNDA GDSKRDNEDE DQIVSSDLGD KPSEASGNEE ISKLEPSAGT DEEKSSEEEK DKTNDTKDTT KEIADPRKVT EDRTNIEEAT PLASNKRRKL ELYRITDAFS TPENMEDGRS KRNRKQPMLY DPQAGPARKW QSDEPKYWNS DSQSDNSISG ESETRDADEK DAFGTVVKKV GSARKENGEI HCSFCEDDPF IELCCFCGCR VCFGKHHQSK LLLCDECDDE YHTFCLSPPL KSLPASNAEW FCPSCSVSQQ RRQITTKSLS SRIGTRGAMS KSPSPTRISS RQAATKASHS TSLTEHVKRG PGRPRSKDRI LTVAVGKKRG RPPKSASQGS PDKKRAKTEP TTKLSSQTPK SDDGKARSKS LVGAVSDHDP VTDYISATAP IQEVKISRSG RPVKRGSFHD EIEQREQHLR SDRSHPISPS KAITNQTSST PATIPAEIIE ASTIADSESD SLKATKTTSV LQGSQQASAN ATALENASPS PVFSEPASPP APMNNKASKV SVKVSVSNPT PNVQPSVRIN TVPSIATIPP MVEPTPVSVP PTAMKPVSAP MSSSTSTSLP TTGQSTLLSP SKISDPAPAT NNVEPAAVIS TAAITMAKAA VPQPLTASAS VPPPIAQNKE VKVPRRKPGA RECMQISRRF GVRAIPQKYM DIMTDYCKRG KVEHLIRMRE RLDDHSRFLE AQLAGLEALV LEKGESSVVV PSMPSGPDRK LERTLGTEYD L
|
| |