Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34712 |
Symbol | |
ID | 7200179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 201683 |
End bp | 203671 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179155 |
Protein GI | 219116721 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.183935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACTCG ATTCGATAAA CGAGAGCGGT AGCAGTAACA ACACAACCAA CAAAAACAAT CATACGAATA CAAACGATGG TCCCGCGGAA AGTGGCCAGA ACCGGCGACG GAGACGTCGC TGGGGGGAAG TTCCGCCGGT TCCCGCCACC TCCCTTACAA CAACAACAAC AACAACAACG AACGCTCCGT CCCCACCGGA ACTGGATCCC AAAGCCAAAG TGCTGGCAAT GCAAGAAAGT ATTCGGGCCC GGTTAGCGGC TGCCAAAGCC AAACAAGAGT TACAAAAAAC CATTGCGGCA ACACCCATTG CGACGAATGA AGCAGCATCG GGTGCGAGCA AACGGCCTCT GGCGTACGAA CCCCCAAACG GTCTGCCGAA AGCAACCAAA CGTGCCAAGA CGTACAATAT TGACTTGTCG GTGACGGCAC CCACTTTCCA AAAAGGTACA AAAGCAGTCG GCGACGCGAC GACTACCGGT ACCAGTCCCG CACCCGCGGT GCCGCTCCAA AAGGTGAACA ACCCGTACTT GGCTCATTTG GAAGAGCACG ACACCGAAGC CGTCGTGGAC GATCGCTTGG ATCGAGCCTC AAAACCGCGG CGGCCGCACA AACCCTTGCA TTTTATCGAA CCGGGCACCT TTGCCGCGCT GGCCGAGCGC AAACGCGAAA AAGCCGCACT CGCGCAGGCG AGTGGGTACG TGTCGGGGCG CAAGACGGGG CACACGATTG TTTCCGCCAA CCTGGCGTCC GTGTACGGTC CGCAACACGG GAGAGATGAA GAGGAAGACG AGACACTCAA ACCGCGGTGG GACGCACATC CGGATACCAA AATGCCTTTG ACGATGGAAT GGTGGGATAC GGAATTGTTG CCCCAGAAAT TGAAAAAGCA AGTGGCGGCC GCGGAAGCAA AAGAACTCAA TAGTCAAGCT CAAGCGCAGT TGGTGAATCT AGACGACACG GACACTGGTA CCAAGGCAGT GACCAGTGTT GCTGCAGACG GAAATTTCGA GGCCTTGCAA AAAGCGTGCT TTGATCAAGC CGCGCTATCC TACAGCAAAA CTGCAGCCTT GGTTCAACAC ATCGTGCCAA TCAAACCACC GAACGCACCG TCCGGTCCGG CGAAACAGGC GGTCTTGCAT TTGACACGAA AGGAACTCAA ACGTCAACGA AAGCTGCGCC GGCAAGAGAA GCAACGTGAG ATGCAAGATT TGCAAGCTGC TGGTCTAGTA CCCGCACCCG AGCCTCGCTT GACCTTGTCC AATTTTATTC GTGTGCTGGG TGATCAAGCA TTTTTGGATC CATCGCATAT GGAACAAAAG GTTGCTCAGC AAATGCAGGC TCGGCAACAA GCCCATTTAA AACGGAATGA AACCAACAAA TTGACGAAAG AACAGCGGGC CGCCAAGCGC ACCAAAAAAC TACAGGAAGA CACTTCGCAA GGGGTCACTG TCGCGCTGTT TCTAGTGCAG GACTTGTCCC ATCCCTTGTT GCGCGCGAAC CTGGATTTGA ACGCCCAGCA AAACAACATT TCTGGAGCAG TAATCGAATG CCAAGATCCT TCACTAGGCT GTGTGATTTG CGAAGGCGGT CCCAAAGCAA TCAAACGCAT GACGCGACTC ATGTTGGTAC GGATGAAATG GCGGGGCCCT GATGACGGAG AAGAATTGGA GTACGAAAGC GATGAAGAGG ATGAAGCTGG TGAGTATTCT ACACACAAAT ACAATCCTGA TAACAAGTGC GAATTGGTCT GGCAAGGAAT GGCGGTCAAG CGAGTCTTTA ATGGCTTTCT CTTTCAGTCG GTCGAAACTT CCAATCAAGC GCGCAAAATT CTCAAAGCCA AAGGGGTATC TCACTATTGG GATCAACTTT TGATATGCAA AAGTGGTCGC GGCGAAGTCT TTCGCTTGAA GCTAGGCGAT GACAGTGAAG ACGAAGAAGG CAACCCGTTT GAGAAGCTGG ACGAAGACAT TGTCATGGCT GACGAGTAG
|
Protein sequence | MTLDSINESG SSNNTTNKNN HTNTNDGPAE SGQNRRRRRR WGEVPPVPAT SLTTTTTTTT NAPSPPELDP KAKVLAMQES IRARLAAAKA KQELQKTIAA TPIATNEAAS GASKRPLAYE PPNGLPKATK RAKTYNIDLS VTAPTFQKGT KAVGDATTTG TSPAPAVPLQ KVNNPYLAHL EEHDTEAVVD DRLDRASKPR RPHKPLHFIE PGTFAALAER KREKAALAQA SGYVSGRKTG HTIVSANLAS VYGPQHGRDE EEDETLKPRW DAHPDTKMPL TMEWWDTELL PQKLKKQVAA AEAKELNSQA QAQLVNLDDT DTGTKAVTSV AADGNFEALQ KACFDQAALS YSKTAALVQH IVPIKPPNAP SGPAKQAVLH LTRKELKRQR KLRRQEKQRE MQDLQAAGLV PAPEPRLTLS NFIRVLGDQA FLDPSHMEQK VAQQMQARQQ AHLKRNETNK LTKEQRAAKR TKKLQEDTSQ GVTVALFLVQ DLSHPLLRAN LDLNAQQNNI SGAVIECQDP SLGCVICEGG PKAIKRMTRL MLVRMKWRGP DDGEELEYES DEEDEAGEYS THKYNPDNKC ELVWQGMAVK RVFNGFLFQS VETSNQARKI LKAKGVSHYW DQLLICKSGR GEVFRLKLGD DSEDEEGNPF EKLDEDIVMA DE
|
| |