Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32929 |
Symbol | |
ID | 7197513 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 1130105 |
End bp | 1132989 |
Gene Length | 2885 bp |
Protein Length | 794 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177759 |
Protein GI | 219112015 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.508293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAAGA ACTCCACCGG CGTCGCTTCC GCCCCGCTGG AAGAAGAAGG AAATATTCCT CCTCTAGTAG CGGCCCATGT GCCCTCCTAC GGAACAGGCG ACGAATCAGA ACCAGATCCT GTTCGAGTGA TGATCGACGA AGACGTTGCC TCGGAAGAAG TACCTCTCGT TGTCACTCAT GACGATGATA AAGATGCGAT TCCCCAGTTC CGGGATGTAC CGTTCGCCGT CCTGTTTCTC ATTCACGCCA CATTGATGGT ATGGCTGGGA ATCTTTGTTG CACCCAAAGG ATACTCCAAA ATCAATATTG ATTTTGACAT GATCGAAAAG GAAATGCGCA AAGGAGACGA TATGAGTGAG CAAGATATTG CGGATTTTGA GAGGTTTGTT GCGTTTGTGG GCAAGTACGC ACAAGTCTAC CCGAAACGCA TTCTCCTATC CTTTGTTTTT CCGACAGCCC TCTTGGCATT TGTGATTGCT CTTTTCACAA CCATCTACGT GGTTAAACCT TGTCCCAAAA CGCTAACCTA CGCAAGTTTG GTTGGATCCT TTGCCTTTAC GGCCATTGTC ATGATTTCAT CGTCCGTTCT GAATAATAGC CTGTTTGGTG CTGTGATGAC CATTGTCGCT CTCGGTGCGG TTGCATACTA CGTATGTGCT GCCTGGCGTA TGGTTCCGTT CGCGGCGGTC AATCTGAAAG TGGCTTTGAT AGGCATGAGC CGCAACTGCG GAGTGTATCT CGTTGCCTTC TTCGCATCGG AGCTCGGATT TCTGTGGCCC ATCTACTGGG TCTACGTACT GATTGGCGTA TCGGTCGACC GCAATGATAA GTGTGAAAAG GCACACCCGG GGGCAAATTT TGATATGAGC TCGAATGATT TTGACGATGT GTGCCATCCT CCACCGTTGG TGTTTCTCCT GTTCTTGCTT TCGCTCTATT GGACAAGCAC CGTCCTTTTG GTAAGGCGTC CCAGCTTAAA TTTCTGACTT TGTGTATTTC AATTTTGATC CCTCACAGTA ATTCTCTGTC TCTGAAGAAC ACGGTACAAG TATCGGTTGC AGGTGTTATG GCAACATGGT GCTTCGACAA GCGCGACGCT GATCATTGTT GTTCCCCAGC AGTATTTGGT TCAGTGTACC GTAGCATGAC CTATTCCTTT GGTTCAATAT GTCTTGGGTC TTTGCTACAA GCTCTCATTT CTGTATTTCG GTACATAGTG GAAAGTGCCC GAAGCCAGAG GGAGCGAAAT GACGGTGGGG GTGCTTGTGG CAACATTCTG CTCTGCATTC TGGAATGCTT CGCCAAGCTA CTCGAAGATG TCATCGATTA CTTCAATCAA TGGGCGTATG TATTTGTTGG AATCTATGGC TATTCGTATC TCGAAAGTGG CCGACGAGTG ATTGAGCTGT TTCGAGCACG AGGCTGGACG GCGATAATCA ACGACAATTT GGTGGGATAT GTATTGGGCT TCACAACCGT TCTGATCGGT GTTCTGACCG GTGCCACCGC TTTGCTGCTC GAATTTACTG TTTCTCGGAG CAAGCTTGAA GCGAATTCTG AGTACGAGTC TTACATTTTC GGTCCTATTC CAGGGTGGAG GTGGTGGGCT TTCGGGTACG TTTGGCTTGT ACATTCTCGT AAGCCTTTAG ATTGCTATGT ATGTCTCGCA CTGACAGGTT GCTCGCTTTG TTGTAGCATT GGATTTTTTG TGGGAATATG GGTTTCAAGC GTTGTGATGA ATGTTGTCAA GGCAGCAGTG AACACTTTGG TTGTGTGCTG GGCTGACTCA CCGGCAGTAG TGGAAATGAA CCACCCTCGA TTGACATCAG AGATGGCAGA TTCGTGGTTG CAAGTTTTCC CAGAAGCGAA TACCCAGATT CGACCTGCAT ACAATGCCAT TGTTGTCTGA AATCGTTATT TTCGAAGGGA TGTAGGTAGT AGCCGTGACT GTTTTGTATT TGATTTTCCG TACTAATGTC CATAGCTACG GTATCGCAAG GTCTGTACGG CGAGCGATTT TAATGTTTAC GACACGTGGA ATTTGTCACA GTCAAAATGC GCTCCTACTA ACCGTAAGGG CAACTGAAAG ATGAGAAATC TTCCATCTGC TACGATGCGG ATTTGGATGA TCCTGTCTCC AATCGACATC GCCTTGTGAT CAAGACTACG GAAAGCAGCA AAGCTACGAC TTTGTTCTCT ACTTTTCACG CTACTCTACA CAGAATGGCT AACGCCTGTA TATCTGGTCG GAACACGAGG AGGTATCAGA TCTCGTGCCG GCGATTCATT CAATTGTTAT TTTTGTTGGC GGCGGCTGTT TCCGCAATTT CGGGACATCG GCAAACCTTC GTCCGCGTTC ACAACCAACC GCAGATATCA TCAGCATTAT CATGGAATAT AGAAGCAGGA TCATACGACG AAAAAATCGC CCTTAGAATA GCTTCCTTTC CGAGAGGCGG CTCAGATACT GATGTCGAAG GAGGCGAAAG CGATGAGTAC GACGAGGATA GAGAAGTAGA AGAAGACGAG TACGACGATG AAAGTGACGA AGGGGATGAT GACGAAAAAG TTGTACCAAT TGAAATAAGC ATTGAAAAAT ACGATGAGCC ACTGGTCGCA TCGCCAATGA TAAATCTATA CGCCTCGTTC GGTGTGATGA TGCTAGCACG AAGAATCGAC CTTTTCAACC CAATCGTCGT TCGTGTCGCA CGGTACGTAC ATAAATCTCC CACTGGACCA TATCATTTGA CTTTGTACCT TAATATGCTT TCATTTGACT CTCTCAGTGC CATGTTTATC GCTTATCTTG TTCTACACCA GCTGTTCGTG CTCTATGTTC GTATTCAGGC AAAATCTGCC GATGATAGGA CTCCAATCAC TCTAA
|
Protein sequence | MYKNSTGVAS APLEEEGNIP PLVAAHVPSY GTGDESEPDP VRVMIDEDVA SEEVPLVVTH DDDKDAIPQF RDVPFAVLFL IHATLMVWLG IFVAPKGYSK INIDFDMIEK EMRKGDDMSE QDIADFERFV AFVGKYAQVY PKRILLSFVF PTALLAFVIA LFTTIYVVKP CPKTLTYASL VGSFAFTAIV MISSSVLNNS LFGAVMTIVA LGAVAYYVCA AWRMVPFAAV NLKVALIGMS RNCGVYLVAF FASELGFLWP IYWVYVLIGV SVDRNDKCEK AHPGANFDMS SNDFDDVCHP PPLVFLLFLL SLYWTSTVLL NTVQVSVAGV MATWCFDKRD ADHCCSPAVF GSVYRSMTYS FGSICLGSLL QALISVFRYI VESARSQRER NDGGGACGNI LLCILECFAK LLEDVIDYFN QWAYVFVGIY GYSYLESGRR VIELFRARGW TAIINDNLVG YVLGFTTVLI GVLTGATALL LEFTVSRSKL EANSEYESYI FGPIPGWRWW AFGIGFFVGI WVSSVVMNVV KAAVNTLVVC WADSPAVVEM NHPRLTSEMA DSWLQLRYRK VWQLKDEKSS ICYDADLDDP VSNRHRLVIK TTESSKATTL FSTFHATLHR MANACISGRN TRRYQISCRR FIQLLFLLAA AVSAISGHRQ TFVRVHNQPQ ISSALSWNIE AGSYDEKIAL RIASFPRGGS DTDVEGGESD EYDEDREVEE DEYDDESDEG DDDEKVVPIE ISIEKYDEPL VASPMINLYA SFGVMMLARR IDLFNPIVVR VARQNLPMIG LQSL
|
| |