Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43037 |
Symbol | |
ID | 7196840 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1868054 |
End bp | 1872104 |
Gene Length | 4051 bp |
Protein Length | 1122 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176858 |
Protein GI | 219110213 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000542513 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGAGGCGGA CTAACAGTAA CCGAGCTAGA GCAGATTGCT CACTCCGTAC ATCTCTCTCT ATATATTCAT TCCTACCTAC CTACCAACAC TAGATACATA TTGTATCTAG AGGTAGATTT CACCACAGTT CCGTCGGAAG CGAGAAGGAA ATCGACTTTG GGAGTCGAGC ACGATACTCT AGCGGTAGTA GAAGAGTATT GGAGGAACGG CACGACGAGA AAGAAGTAAA GGCATTGCGT GGATTGTGAC TGCGACTGCT ACTTCTCTAC AGCAACTACT ACTACTGCTG GAGAAGAAAG ACGTACTTAT CCGTTTGCGG GGATTGGTCA AGTTTCGGAC TGTAGTTTGG GACTCGTAAA ACACCATGCG GTCTGGGCTA CTCACATTGT TCGGTTGCGC GGCAGTGTCC GACCATGTCT CAGCGTTTGT CCCGCAATCA CGTCTCCTGC CCGTCAGCTA CAGCTCTCAC GGAGTGACTC CACTACCGCG TAGCTTGGCC TTACAGGCAA CGGCCGACAA CGGCGGTGGC TCGGGACTGG GCAACGCCAT TCGCCGGATG ATGGAAACAC CAATTCCGGG CACGGACAAG GTCGCCGCCA CCGCGTCAAC GCCAACCACG CCCGTAGTCG TTCCCGGCGT ACCCGTGAAC GTTCCCGATT GGTCCCAGAC GCTGAACGAC GCCGTAACAA CCGCAGCAAC GACGGCCACA AAATCGTCGC CGACACCCAC CAGCATCCCC GCGGACGCCG TCGGTACCAA TCTCAACGTG TGGAAATCCT ACCTGGCGCC CAATACGGCG GATCTGGCCT TGCCGGACAG GGCAGCCTTG CAAGCGGGCG CGGATCAAAT CGTGACATCG GTACAGTCCA TCCCAGTGGA CAAGTTGAAT CAAGCCGCGT CGGATATTGC CAACGTCTTG TCTACCGAAG GTTGGTCCAC TGCGGATATC ACCCAAGCTC TCAACATTGA CGAACTAGGA GTGTGGTACG CCGGAGCGCT CGGGGCGGGC ATTCTGTTGG CGGCGGGGCG CAACTCGACT TTCACCAAGG CAACAGCTCC CCCTCCTCCG CCCCCACCAC CCAAACGCAC GTTTCCCGAT CCACCCAAAC TACCCCCCTT GCCCAATCTC CCGCTACCGG AAAACGTACC GGTCCCCGTA GTCGCCACCG CGGGTGGCAT TGGGGCACTC ACCTTCGCCT CACTTTTGGG GTTTGGGGAT TCCATCAAGA ACGCCATCAA GTTTGCCCTC GTGCCGAACG CGGCCATGAC TGCCAGGAGT GCAGCGGCCG TGGCTGCTTC CACCAACACC CAGACAGCAT CGGTCCCACT GGCTTCCAAT CCACCGCCGC CACCCCCTCC CACCGTCACA CCGGAACCCA TTAACCAAGC GATCGAAGCG GCTTCCACAA CAATCGGGTC CAGTGCATCT GCGGGATTGT CCTCGTTGAA AACCTACCTC GTACCGGATC CCAAGCTCTT GGAACTACCC GACAAGGCCG GTCTTCAAGC GACGGCGGGG AAATTCATCG ACGCCGCCGT CGCCATACCC CAATCAATTC CCGTCGACAA GTTTGGGACG GCCGCCGTCA CACTCGCACG GGTGTTGCGA ACGGAAGGAT GGTCCGCTGC GGACGTCACT AATGCGCTCA ACGTGGAAGA ACTGGGGGTG TGGTACGCCG GTGCACTCGG CGCCGGCGTA CTCTTGGCCA GTCGCAACGT CACGAACATT GGAACGTCCG CATCGGGACC GAAAAAACCT GCGTTTCGTC CCACCCGCGT CGTGGCTCCC GAACCCGTGC CGACACCGTT GGAGTCCCTG CAGAGTCAAG TGCGGGAAAT CAGCAAAGTC GAAATTTCAC CGGCCGCCAA AGTCACAATT GGTACAGTCG GCGCATTGAC GTTTGCTTCT CTTGTTGGTA TGGGAGATTC CGTCCGGTCC GCCATCAAGT TTGCCTTGGT GCCGAACGCC AAGAAACCGG CCGTATCGGT TCTAGCGCAA TCTGTCACGC AGGAGCCTCC AGAACCACCC AGCCTGCCGG ATCCGCCGTC CTTTTATGCA CCAGCAGTGT CTTCGGAGAT CACTGATAAG CCGGTCACGG AACAATTGGC TGGGAGCCTG GTGGAGTCCA CCAAAGGGGT CATGTCCGAC ACGGTGGACT CGTCCGTTGC TTCGTCCAGC TCCGTCCAAG CGGCCATCGC TTCAGTACAG TCGAACTATC AGAACGCGCT CAATTCGATC AAAGAGGCCC CAGGGACTTC GGGCCAAGCC TCGAAACTTT CCGAATTTCT GAAAGAAAAA GTACCATCCT TCAGCACAGA TTTCGGCAAA CTGGATCTGC CCAAGCCTGA TTTTAATGTC GATCTGAGTG GTTTCGACCT TACAACGGAA CGCATCAAGG CTTCATTAGA ATCTATTCCA GTCGACAAGT TGTCCAGCAC GTTTGACAAT CTGGGGAAAA GCTTGCAAAA TGGCGGCTTC ACCGCGCAGA GTATTCTGGA GTCAATGAGC TCCGCCGAGA AGGGTTGGTA CTTGGCAGCT GGTAGCGTCG TGTTGGCAGC GGTCGGGGCC GGGATTCGCA ACACATATGA AGACCAGCTC GAAACCACCA CGTTAGAGAC CAAGGAAAAG CAAGCCAATG CGAAAGAGCC TGCCAAGATT TCTGAAGCTG ATAGTGTTGA AGACGACATG GTCTCTCAAA TCAAGGAACT GAGTCAGATG ACAACGGCAT TATCGAACGA GCTCAAGCAG ATTAAGACTC AAAAGTCAAA GAAGGACTAC GATGTGGCCA CAATGCAGAG CGATGTGCGC GAACTGCAGA ACGCAATGGA CGCACAAAAG AAGTCTGAAA AGGCGTTGAA ATTGCAGCTT GCGCAGACGG AAAAAAAGCT GGCGGCGGAG ACCGCCCAGC TACAGCAAAA GCTAGAAGAA GCAAACAAAA AGTTCAAAGA CGAGAAGGAT GCAATGAAGA AAACAAACAA AAAGCTTCAA AAAGACTTGG ATGCAGCGCT GGCGTCAGTT GCGGCCTTAG AAGCGGAAAA GGTTGTGTAT TTGTTTTGTT TGTTTGTTCG TTGTATCAAT ATTACTTTGT TAGACGTGCT CTAACCCATG ATTATTTGAT CAACGCAATG CAGGCTGAAC TACAAACACA GCTCAACGCA CTAGGTATTG AAGAAGTGGA GGCACAGCTC GCAGAGTTGG GGCTCAATGA AGTGGCGAAA AAGCCCAATC GAAAACAAGA ACCCGAACCT GTCGTTGAGA TAGAACCAGA ATCGAATTCA GAAGCCCAGG CAGCGACAAC CACAACCAAA AGCAGCTCAT CACGTCCACA GGACACTTTC TTTGCTAATT TTACCGAAAT TCCCTTGGAC GAGCTTCCAG CAGTAGCTTC GGAGTCGACA GAGAAGAACA AAAGCTCTAA AACGAGAGCC GCTTCTAAGC AACCGCCTTC AAAGAAAACA AACATAAAGA AAAATTCGCC GAAGAAATCC GCCACAAAGG GTGCCATTAA GAAGCAAGTT GAGCAGAAGC CGGAAGAGAA AAAGGCTGAA ACGAAAAAGA CAGAAGCGAA AAAGGTTGAA ACAAAAAAGA CAGAAGCGGA AAAGTCTGAA ATGAAAAAGA CAGAAACAAA AAAGAAAGAA GCGGCTGACT CTGTGGTGTC AGGCGGATCG GAGAACTGGA ACAGTTTGTC AGAATCAACG CTGAAACGGA AAACGGTGAA GGAATTGACT TCGTATTTGG AAGAAAAGGT ACGTCAATAA TTGTTTTATT GCCTCTACCT TGTTTAATAG CCAGTTTGAT TTACCTTGCA TCCATTTGTT CTCTTCAGGG ACTTACAACG ACCGGAGGAG ATGGTAAAAC ACTGAAGAAA GCGGATCTGG TAGCCGTCGT TCTGTCGCAA TCGTAGGAAT GTTGTAATGG CTTCGTTTTT TTTGGCTATA TACTGGATCG ATGCAATTAA AATCGCTACC TTAAAACACG ACCCAGAAGA TTACTGCGTT GCACCACCCG TTCTTAATAT AGCGTATGTT TGTAAGGAAA CGACATGCTG G
|
Protein sequence | MRSGLLTLFG CAAVSDHVSA FVPQSRLLPV SYSSHGVTPL PRSLALQATA DNGGGSGLGN AIRRMMETPI PGTDKVAATA STPTTPVVVP GVPVNVPDWS QTLNDAVTTA ATTATKSSPT PTSIPADAVG TNLNVWKSYL APNTADLALP DRAALQAGAD QIVTSVQSIP VDKLNQAASD IANVLSTEGW STADITQALN IDELGVWYAG ALGAGILLAA GRNSTFTKAT APPPPPPPPK RTFPDPPKLP PLPNLPLPEN VPVPVVATAG GIGALTFASL LGFGDSIKNA IKFALVPNAA MTARSAAAVA ASTNTQTASV PLASNPPPPP PPTVTPEPIN QAIEAASTTI GSSASAGLSS LKTYLVPDPK LLELPDKAGL QATAGKFIDA AVAIPQSIPV DKFGTAAVTL ARVLRTEGWS AADVTNALNV EELGVWYAGA LGAGVLLASR NVTNIGTSAS GPKKPAFRPT RVVAPEPVPT PLESLQSQVR EISKVEISPA AKVTIGTVGA LTFASLVGMG DSVRSAIKFA LVPNAKKPAV SVLAQSVTQE PPEPPSLPDP PSFYAPAVSS EITDKPVTEQ LAGSLVESTK GVMSDTVDSS VASSSSVQAA IASVQSNYQN ALNSIKEAPG TSGQASKLSE FLKEKVPSFS TDFGKLDLPK PDFNVDLSGF DLTTERIKAS LESIPVDKLS STFDNLGKSL QNGGFTAQSI LESMSSAEKG WYLAAGSVVL AAVGAGIRNT YEDQLETTTL ETKEKQANAK EPAKISEADS VEDDMVSQIK ELSQMTTALS NELKQIKTQK SKKDYDVATM QSDVRELQNA MDAQKKSEKA LKLQLAQTEK KLAAETAQLQ QKLEEANKKF KDEKDAMKKT NKKLQKDLDA ALASVAALEA EKAELQTQLN ALGIEEVEAQ LAELGLNEVA KKPNRKQEPE PVVEIEPESN SEAQAATTTT KSSSSRPQDT FFANFTEIPL DELPAVASES TEKNKSSKTR AASKQPPSKK TNIKKNSPKK SATKGAIKKQ VEQKPEEKKA ETKKTEAKKV ETKKTEAEKS EMKKTETKKK EAADSVVSGG SENWNSLSES TLKRKTVKEL TSYLEEKGLT TTGGDGKTLK KADLVAVVLS QS
|
| |