Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_24120 |
Symbol | |
ID | 7199274 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 165477 |
End bp | 167618 |
Gene Length | 2142 bp |
Protein Length | 568 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185438 |
Protein GI | 219130576 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCGCT ACCGTGGCTT TTGGTGGCAC CCCGACTCAT GCGGTATCCT GGTCGCCCGC GTCGACGAAT CGAGCGTCCC CGTTTACCGA ATCATGCACC AGGAAGTGGA AGATGGTCAC GCGTACGAAG ATCATCGCTA TCCTTTTGCC GGAAAAGAAA ATCCCACCGT CCGACTCGCC TTTGTACCGA TCGATTCCGT CTCCGTGGCA CAATCCTTGA CCGTGAATCG GAAAACGTCC ACTCCACCGA CAGAAGCCAC CGCCACGCAC TGCCACGACG ATGATACGCG GCAAGACGTG GACGGCAGTG AGGCTGGCGC CGATCCCATG GACGAAGACG ACGACCACGA ACTTTCGGCG CAAAGCATTG CCGATCTGGC TTGGGACAGT GTGGTATGGT TGGATCCACC CCGGGAGGCC ACGGAGTATC TCGCCCGCGT CTACTGGTTG CCCGATGGTT CTGCCGTGGC CCAGTGGCAG AATCGGACCC AAACCGTGAC GGTCTGGCAA CGGATACCCG TACTGGAAAC ACCGATACGA CCGCGGACGC TCGTCATGGA ACGCACCGAC GTGTGGATCA ATCTGCACCA CATGTTCCAC GTTTTACCGG AACCGGTTGC TCCCCAGCAG TGCGGAACGT CCGGACGCGA CCCTCAAATC CCGCCCTTTC CTGCACCCTT ACCGCCCGGA GCCTTTTCCT TTCTCCTCGC GTCCGAACGC AGCGGCTTTC AACATTTGTA CCTATACACG TATTGTCCGG GCATCAACGG CGAACAGGCA GTGCTGTTGC GGACCATCTC GGCGGGGGAA TGGATTGTGG AAAGTATCGT TGGTGTGGAT ATGGACAGGG ATGTCGTGTT CTTTACCGGC ACGTACGATT CAGTCCTGGA ACGACACTTG TACGCCTTGC CACTTACGTA TCGAGACGAA GTGGAGGGAT CCAATGAAGT ACGCGGTGAG GAACATCCAA CCGATCACAA CGGTGTCCGG CGAGGCTTGA GCAAAGTTAT GAGCGCTTTT TCTCCGGGAA AGCACCACAA GGTCAAAGCG AACGGTATGG ACTCTGACGG CGGGTCAACT GTACGCCCAA TTCGCTTGAC CGTGGACTCG GGAATGCACA GCATCGTGAT GGACGAGGAT TGTGAAATCT TTGCAGACAC TAGTAGTGAT CTAGATCGAC CAACGTCGGT CAAGATTTAC GAAGTGTCCA AGAACATTCT GGCGTGGAAC CAAAAGCCGA GCAGGACTCC CCCTGTCCAA TTGCTATTTA CGCTCTACGA TGCCATGAAT GACGACAAAT CCATGGTCGC CGAGGCGCTG GCGGCACAGT CCACGATTGG TCGCAGTGCC GGTAGTCGGT TACTAGCCAA TTTACCCGCA CCAGAACTGT TGAAATTCCC CACGTCGGAC GGGTCGGAAA TGCTGCACGC GGCACTTTAC CGCCCCGATG CTCGTATACA CGGCCCAGGT CCGTATCCCT TGATTTGTGC TGTTTATGGC GGTCCCCATG TCCAACGCGT GAACCGTTCG TGGTCACAGT GTGCCGATAT GCGAGCACAG CGACTGCGGA GTCTGGGTTT CTGCGTCGTC AAATGCGACA ATCGTGGTTC CTCCCGGCGA GGCCTGGCGT TCGAATCGGC AATTTCGCGG CGACTTGGTC GTCTTGAAGT GCTGGATCAA GTGGCTGCGG TACGACAACT CGCCGCCCGA GGTGTGGCCG ATCCTAACCG TGTGGGTATC TATGGCTGGT CCTATGGAGG CTATCTGTCG GCCATGTGCT TGTGTCGGGC ACCGGATGTG TTCCACGCAG CCGTTGCCGG TGCTCCGGTG ACCTCCTGGG ACGGCTACGA CACCCACTAC ACCGAGCGGT ATATGGGCCT ACCGTCCGAT AATCCGGCAG GATACCGCGA ATCGGCCTTG TTTGAACACA TTCCGAACAT GTCTGGGTCC TTGTTAATGA TCCACGGTTT GATTGACGAA AATGTACACT TTAGACACAC GGCGAGGCTC ATAAATAAGT TGGTGGCGTC CGGTAAGTCC TACGAGCTAT GCATCTTTCC CGACGAACGA CATTCGCCCC GACGATTACG GGATCGCATT TATATGGAGC AGCGCATTGG CGACTTTTTC GTAGAACGTT TG
|
Protein sequence | MDRYRGFWWH PDSCGILVAR VDESSVPVYR IMHQEVEDGH AYEDHRYPFA GKENPTVRLA FVPIDSVSVA QSLTATEYLA RVYWLPDGSA VAQWQNRTQT VTVWQRIPVL ETPIRPRTLV MERTDVWINL HHMFHVLPEP VAPQQCGTSG RDPQIPPFPA PLPPGAFSFL LASERSGFQH LYLYTYCPGI NGEQAVLLRT ISAGEWIVES IVGVDMDRDV VFFTGTYDSV LERHLYALPL TYRDEVKANG MDSDGGSTVR PIRLTVDSGM HSIVMDEDCE IFADTSSDLD RPTSVKIYEV SKNILARLLA NLPAPELLKF PTSDGSEMLH AALYRPDARI HGPGPYPLIC AVYGGPHVQR VNRSWSQCAD MRAQRLRSLG FCVVKCDNRG SSRRGLAFES AISRRLGRLE VLDQVAAVRQ LAARGVADPN RVGIYGWSYG GYLSAMCLCR APDVFHAAVA GAPVTSWDGY DTHYTERYMG LPSDNPAGYR ESALFEHIPN MSGSLLMIHG LIDENVHFRH TARLINKLVA SGKSYELCIF PDERHSPRRL RDRIYMEQRI GDFFVERL
|
| |