Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46568 |
Symbol | |
ID | 7201849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 748626 |
End bp | 751205 |
Gene Length | 2580 bp |
Protein Length | 821 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180890 |
Protein GI | 219120297 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.927409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGAAATTGAA GTCGTTTTGG AAACATGAGA TGAGCAAGCA ATAGTAGAAG TGGATCGGTG ACGCTGGACT TCCGTTTTGT CCAGCGATGT CTCTTCTTTT GCGAGCGAGT AACTTTTGCC GCCCTTCTCC GACGAGAATC GATGCGAAAC TTGAGCTCAT CCATGCTGGC TGCGTCGGGG TCGACGGGGA ACAACCCAGT TCGGCTGGAA ACCCTTGCTC GGAAGCTCGG GTGACATTGG ATGTCCTCCG GACTCTGTGT ATCTTCTCTG GGAGTCTGGT ATGGATAGAA TACGATAAGA GACGGATTGC AGTGCGTTTG CAGGTTCTGG AGGACGAAGC GAGCAATTCT GATACGGACC AGCCGGCTGG ACGGAGGCCC GAGCGAGCGA TACAACCGTC TGTCTACGTA TCACATATTA CTGCCGCCAA TTTTGGCTTA TACGGTGATC GGGCTCATGC GATTTCGGTT GTGTTGAGCA TTGTTGAACA GGCGGCATGC GAGGCCGATT CGGTTTCTCT GTTGACCTGC GGGAGGCCAT TGCCGCAATC GTGGAGGGCC TGGAACGAGA CATCGATCTA TGCAACGGAA TCCTGGCCCT TACCGCCTGT CGAAACAATC TTGCAAACGT CGTTTCTGAT TTCAGTACAT GAGAATGAAC AACTTTATTT CTATCAAATT GCGGAGATAG ACGGAAACGC CACATCCGAT AATGTATTCG TGTGTGGTAA TTCAACAAAA TTCACTCTAG TGTCCCCCCT ACCTTCAATC TCTTGCCGCC ATCTCCCCGA TCTGAACGCA ACGTCACGAC TCTACAAACC CGCTCTGTCC GAGCTACCAC CACACCAAGA TACGGGGAGG CTTGTTGCGG CCTTAACATT GTCTCCCTCT GCTCACATGG AAGAGCGCGT CTGCCATGTC GTGGGAACGG ACGCGAACCA CGATGTTTGT TTAGCTATGG AAGCGGCCGC TCTATGTCTC GGTCGTCGAA TCCTTCATAT CCATGGATTA GCGGGACACG CGTATGCTTC GGGAAGGAAC ATTTCTAATG GATCGCTGGC GGATCAGTTA TCAGGTTTGG AGGTTGCCAT TGCCCAGGCC TATGCGCACG CCCCGTGTGT GTTGCATCTG GTTCGTCTCG ACCAGGAATG GGCAACAAAC GACCACGAAT TACTTTGTGA TACCCAAAAT CGAGTATGGT CGGTATTGAT GGATTGTTTG AGGGTGAGTG ACAACAATGC GGTCTCCAAC GGCTCACTTT GGAACGATAG GTCAATTTCA ATCCCGTCAG TGCTGGTTGT CATTTCCACC GCCCGTCTAT TACAGCCTGG CCCTATGCTT CAGAACCTGG TGTATCCATC TATTCAGTTG AGGCTACCAA ATCTTTTGTA CGTGGAGCAC CTCTGGCAGC GCACAACAAT AGACGCCACG ATTCCTTCCG ATGAGCTATT ATCCATATGC AAGCAACTCG AAGGGCGGCC AGTGCATGAG ATCTGTCTAT TGCAGAAGCA GTATCTGGAG ACAGATCGAG ATTCTACTGG GGACAATTCC ATGCAACTGT TGGAATCGCT TTGTGTTCGT TTGGACCAGT CGCGGCGGCA GCGTTCAGGG GGCCCCAAAA TTCCATCGGT ACAGTGGGAA GATGTCGGTG GCCTTGAACA CGTTCGCCGG GAGATTTTAG ACGCTATCGA ATTTCCTCTC AAGTATCCTC ATTTGTTCCT CGGCAGCTCT ACAGGACGAT CCGGTATATT GCTGTACGGC CCTCCCGGGA CAGGCAAGAC TCTTGTGGCC AAAGCCGTTG CAACGGAATG TCAGCTACCT TTTTTATCGA TTAAAGGTCC CGAACTTTTA GGATCGTTTG TTGGCGAGTC GGAAGGTCAC GTACGAGGTA TATTTGCCCA GGCACTACGG CTGGCATCAC AAAATACACC CAAAACGGCT TGTATTTTGT TCTTCGATGA GCTTGATAGT CTGGCACCCC GTCGCGGCGA CACTGCTAGT GGCGGCAACG TCATGGACCG CGTCGTTGCA ACCTTGCTCA CTGAACTAGA TCGACGGCAC GAGTTTACCG AAACTGTTTT TTGCATGGGT GCCACGAACC GCCCAGATCT ACTAGACCCC GCCTTGCTGC GGCCGGGTCG CTTGGACCGC CTCGTGTATT TAGGCGTTTC CAAAAGCAGT CAAACAGGAA TTCTTATGGC GCAAATTCGT AAATTGCGAC TGCAAGGCGA TGCAGCCCAA TTTGCTACAA AAATCGCCGC GGTTTTGCCA GACAACTTGA CCGGCGCAGA CTTGTCAACG ATTTCGTCGG GAGCACTCTC TCGAGCAACT CTTCGACTTT GTGATCAAGC GGATGCGGAA CTTGCAGTGC TGCGGGAGAG CAATCCATCG GCAGTACTGG ATGATGTACT GGAGTCCTGG GATGAACGTC AGTTGGAACC CACGGTGACG TTGTCGGATC TTTTGGAAGC TGCCGAGTCG GTCGTGCCAA GCGTCCGACC CGATGAGCTG GAACGCTATG AGAAGTTGCG TGATCAGTTC CAAATGAGCT AAAATGAACT TTCAGCTTTC CCGTGAATGT
|
Protein sequence | MSLLLRASNF CRPSPTRIDA KLELIHAGCV GVDGEQPSSA GNPCSEARVT LDVLRTLCIF SGSLVWIEYD KRRIAVRLQV LEDEASNSDT DQPAGRRPER AIQPSVYVSH ITAANFGLYG DRAHAISVVL SIVEQAACEA DSVSLLTCGR PLPQSWRAWN ETSIYATESW PLPPVETILQ TSFLISVHEN EQLYFYQIAE IDGNATSDNV FVCGNSTKFT LVSPLPSISC RHLPDLNATS RLYKPALSEL PPHQDTGRLV AALTLSPSAH MEERVCHVVG TDANHDVCLA MEAAALCLGR RILHIHGLAG HAYASGRNIS NGSLADQLSG LEVAIAQAYA HAPCVLHLVR LDQEWATNDH ELLCDTQNRV WSVLMDCLRV SDNNAVSNGS LWNDRSISIP SVLVVISTAR LLQPGPMLQN LVYPSIQLRL PNLLYVEHLW QRTTIDATIP SDELLSICKQ LEGRPVHEIC LLQKQYLETD RDSTGDNSMQ LLESLCVRLD QSRRQRSGGP KIPSVQWEDV GGLEHVRREI LDAIEFPLKY PHLFLGSSTG RSGILLYGPP GTGKTLVAKA VATECQLPFL SIKGPELLGS FVGESEGHVR GIFAQALRLA SQNTPKTACI LFFDELDSLA PRRGDTASGG NVMDRVVATL LTELDRRHEF TETVFCMGAT NRPDLLDPAL LRPGRLDRLV YLGVSKSSQT GILMAQIRKL RLQGDAAQFA TKIAAVLPDN LTGADLSTIS SGALSRATLR LCDQADAELA VLRESNPSAV LDDVLESWDE RQLEPTVTLS DLLEAAESVV PSVRPDELER YEKLRDQFQM S
|
| |