Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49441 |
Symbol | |
ID | 7195924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 256360 |
End bp | 259512 |
Gene Length | 3153 bp |
Protein Length | 1045 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184214 |
Protein GI | 219128004 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0740088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAATACATC CTAACATGGC GTCTATGTTG ACGGCAATGT CTTCTTTACC CGCCAAGAAG CGAAGAGGTT CAAACGCTAG TAAGAGCAGT GTGGAAGGCG AAAGTAGTGA TGCGAACGAA GCATTGCCAT CGTTGTCGCA AAGGAACAAT CCTTCCGATA CAACTGCCGC AATGGCAAGC GGATCATCGG AAAAATCAAA AGCAGAATCT GCTACTCCTC GGTCTCCTGA AGGCCTTCAT GAGTCAAGCA ATCATGCAAA GCCTCTGAAA ATTAGTGAAA ACCGGCTGCC GAGCAAAAAG CGAAAGGGAT CACATGACTC CAATCCACAA GGCATCCACA GTCAATCGAA ACGGAAAGGG TCGCAAGAGT CATCCTCTTC ACTTCGGAAT CATCTTCAAG TACCGCTTCC AGCACTTCAT GATTCTAACG ATTGTAAACA ATCATCGCAA AGAGGCTCTG GTTGCTCTGA GATGCCTTTC GAAACAGATC ACAAAGCACG AAAGCGAAAA GGATCGCACG ATTCGAATAC CGTCAAGTTT GACAAGGACA CCAAAGTGCC CGCAGAACGG AAGGGATCTC ACGATGCATC GGTACAGTTT AAAGAGGACG TCAAATCGCA AAAACGAAAA GGAGCACAGG AAAAATCGTC TACGCCTTCA ATAGATTCGG CGGCGGAACA GAAACGAAAG GCTTCTCATG ATTTTACCGA ATCTGTTCTC CTTAACGGTG TATCGTCGTC GATACCGGGA CGCAAAATGT CTCACGATTT ATCAGTACAA TTCGTGACAG GCTTCAACGA ACGAAAGTGG TCGCAAGACA TTACCATTCC TAGTTTCGAG GGAGTGCCTC GCAAAGATAG CTACGATTCC TCTTTGAAAC TGGACGAGTT GCTACCGTTC CCCCCGCCAG AACCAGTTTC CAATCTTAGA CACCCGTCAT TCAACACTAC CAATACAGAT ACCCCAAGTG TGCTAACAGG CGTCCATCCG ACGGCAATTA GTGCTATGGA CCATCTCAAG GCACTCAGTG GCGCAAATGG TGATACAGCT GCGGTGGCAT CAAAAAATGA TCTAGAAGAT GAAGATAGTC GAAGCAGTTC AACGTTTGCA AGTTCGGGGC AACGCATCCT GCTAGAAGCC TTTCCCAATC CCTCGCAGGA AGGAAATTTT AGCTCGGGAA ATTTCGGGAG CCATTCAGAT ACCTCACACC CGACGGGAAG ACATCGTTTA GAGTCTTGGG GCGCCATGTC TGATTTGAGT GCCCCGTTGG CTGGTGGGGG CTGTTCAGAC TCCACAGCTG CTGCTTTGGC ACACTCAGCT CTCCAGCACG CGGACTTGGC AGACGATGTT ATGGATGCCG CCGCAGGATT AGACTCTATC AACGATTTGC ACGAGGCACC AGACGCAATT CCGAATCGTA TTTCTTTGGG ACGGGAACGG TTTAACTCTG TGGCGTCGTT GTCGGAAGTC TCGCTCTCGG GCCTTTTGTT CGACGGCATT GAGGTATCAG GGGACATGCA AGCTTTTGTA TCAGCGGCGA TGGCTACAAT GGGAGATCAG CTCGAAGCAT TGGCGGGTGC TGTGGAGACG GTCGCGAACT CAGCCGGTCC ACATGCATTG GATGCGCTTC GACGAGAGTT GGGCATGGAC AGTGATGCCG ATAGCGATGC GTCCCCTATG ATAGGCGCCA TGTCCGATAA CGGGCGGCAC AAAGGCCGTC CAAGATCCTG GTCGACTTCG TCCGGAAGAA TTTCGGTCGA TTACGAAGCT GTTGCGGCTG CGGTCGATGC GGGAGAAGCC GCTGACTTGT CCGGAGTTGC AGCTATCGAT CCTTCAAGTA GTTCAATATC AAAAGAACGC CAAAGGCATG CGAGTCGACG ACAGCTGCCG TTGCAACGAG CTCGTGACGA CAGCGATCTC TCCTTGAATT CCGACGAGCG GGCTCACCTA AATGCTTCGT TGGTCGGGTC TTCTCTGACA GATGATGAGA TCAAGCGTAT TCAAGAACGC GCTCGGAAGA AAGCTGGGTA CATTCCACCG ACCGCAAGAA GCAAAGCTGA GAATAAAGCG AACAAAGAGA AGACGGGGCC TTTCAAAAAG CGTGTCAAAC GCAATTCTCC TGAGCCGGCA CAGATACGGT CGGCTTCGCC CGGAGGAACT CACACACCGA AGGCTTCAAA TAAGTCGATG ACCATGTCTG ACAATCTGCC GCTGGTACCG GACTTGGTGC TCTCAGGGAG CACGGCAGCC AGCAAAGCTG CTAAAGGACA AGCAAGTCAA AAGTGGGAAA GCATGTTTGA ATGCCTTGTT GAGTACGTTG ACCAATGCAA AAAGGAAGAG ACGAAAGGAT TTTCTCAGAC CGAGGTTGAT CAGTGGCAAT GGGATGGCAA CGTACCCACG AGTTACAAGT CAATCGATGG CAAAGCGTTG GGTCGCTGGG TTAACAATCA GCGTTCAGCC AAAAGCAAGG GAACGTTAAA AGGTGAACGC GAGCAACGAC TTCTGGATGC AGGTTTAAAA TGGAGTGTGC TAACTTCCAA CTCTTGGAAC GAAATGCTGG AAGAATTACG ACTTTATATC AAAGAACAAG CGGCGAAGGG GAAGAAATGG GATGGCAACG TCCCGACGAA TTATCAGATA AAGAACCGAT CAAATGGCCG GTTTGCTGGC GAAGACAAAA ATCTTGGACG TTGGGTCAAT CGTCAGCGAA GTCAATTCCA TGCAGGAAAG TTGAGGAAAG ACCGTCAACT AGATTTGGAG AAAGTTGGTC TGAAATGGTC AATGCTTGCC ACCAACTCCT GGGATTCCAT GTACGAAACT CTTTGTGAGT ACGTGGACCA GAGGAAAAAG GAGGGCGGAG GATGGGATGG GAATGTCCCT GCCAACTATC GAACCAACGA TTATCCGCCT CGTGCGCTCG GCCGATGGAT CAACCGCCAG AGATCGGCGT TCGCAAAAGA CAAGCTAAAA AGCGAATACG TTGAAAAATT GAGCGAAACG GGCTTGAAGT GGAGCGTTCA CGAGCGTACT TGCTCTGAGA AGGACGACAT GGATGGCGAA GACCACGAGG CTTCCTCTCA ACCGAGCGTG AAGCCAGAGA CGGTAACGTC TACCAAATCT AACGAAATGG CGGTGGAAAC GATTAAAGTG TAG
|
Protein sequence | MASMLTAMSS LPAKKRRGSN ASKSSVEGES SDANEALPSL SQRNNPSDTT AAMASGSSEK SKAESATPRS PEGLHESSNH AKPLKISENR LPSKKRKGSH DSNPQGIHSQ SKRKGSQESS SSLRNHLQVP LPALHDSNDC KQSSQRGSGC SEMPFETDHK ARKRKGSHDS NTVKFDKDTK VPAERKGSHD ASVQFKEDVK SQKRKGAQEK SSTPSIDSAA EQKRKASHDF TESVLLNGVS SSIPGRKMSH DLSVQFVTGF NERKWSQDIT IPSFEGVPRK DSYDSSLKLD ELLPFPPPEP VSNLRHPSFN TTNTDTPSVL TGVHPTAISA MDHLKALSGA NGDTAAVASK NDLEDEDSRS SSTFASSGQR ILLEAFPNPS QEGNFSSGNF GSHSDTSHPT GRHRLESWGA MSDLSAPLAG GGCSDSTAAA LAHSALQHAD LADDVMDAAA GLDSINDLHE APDAIPNRIS LGRERFNSVA SLSEVSLSGL LFDGIEVSGD MQAFVSAAMA TMGDQLEALA GAVETVANSA GPHALDALRR ELGMDSDADS DASPMIGAMS DNGRHKGRPR SWSTSSGRIS VDYEAVAAAV DAGEAADLSG VAAIDPSSSS ISKERQRHAS RRQLPLQRAR DDSDLSLNSD ERAHLNASLV GSSLTDDEIK RIQERARKKA GYIPPTARSK AENKANKEKT GPFKKRVKRN SPEPAQIRSA SPGGTHTPKA SNKSMTMSDN LPLVPDLVLS GSTAASKAAK GQASQKWESM FECLVEYVDQ CKKEETKGFS QTEVDQWQWD GNVPTSYKSI DGKALGRWVN NQRSAKSKGT LKGEREQRLL DAGLKWSVLT SNSWNEMLEE LRLYIKEQAA KGKKWDGNVP TNYQIKNRSN GRFAGEDKNL GRWVNRQRSQ FHAGKLRKDR QLDLEKVGLK WSMLATNSWD SMYETLCEYV DQRKKEGGGW DGNVPANYRT NDYPPRALGR WINRQRSAFA KDKLKSEYVE KLSETGLKWS VHERTCSEKD DMDGEDHEAS SQPSVKPETV TSTKSNEMAV ETIKV
|
| |