Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42720 |
Symbol | |
ID | 7196117 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 889474 |
End bp | 891661 |
Gene Length | 2188 bp |
Protein Length | 694 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177184 |
Protein GI | 219110865 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGGTT GGGCAACGCA ACAATTTGAG AAACTTTCTC AGACGGTTGC GCCACCACCG ACCGACCCAG CCTCCCGCTT TGTCTTCTGC TGCCAAAAAC TCGATGAAGA CGGTGCCATT AAATGTGTCA GTGAACTGTA CGGCGTGGCA ACGATCGTTG TAGCTGTCAA GGGTCAGGTG CCATTGCACG TTGCCTGTAC GTACGCATTG CCTACTTTGA TACGCCATAT TTTGTCCCAA CCGGGAGCCG ACCCGAACGT CGTCGATGCG TCCGGCAACA CACCTTTGCA CTGTGCCGTC ATGTCCAACA ATCAAGAGAC CACGTTGATG GTTGTAAAAA TGCTGCTGCA AGAGTACCAG GCTTCAGTCT TGGCAAAGAA CGCTTCTGGG CAAACACCGT ACGATGTTGC CAGTTTAAAT ATGGTGCGGC AGTTTTTACT GCCGCTGCAG TTGCAACAAG AAACGCAAGC TGCTCTAGAC AACGGAGGTG TTGGATTGCC GCCAGGAATT GATATGGGTG GTCTAAAGAT TTCGAATGCG CATCTGCCGC CACCCCCCCA TGGCCCCCAC GAGCGGCGGA GGCATGTCGT TGACGAACAC TGGTGGGACC CGCTACGCAC CACCACCCTC CTTGGTAACT GGACCCCCAG CAACATCCAC TGCGCCATCG GCACATTATA CATCACCGTC ATCGGTACAC TCGAGCGTGT CTCCGCACGG TTCTACCCTA GTCCCTCCTG AATCTGGGGG CACCGTTTCC GGAGCTCCTG GACAGGGGCA TCAATTGGCG TCTCAAACAT CGCAGCCAGC TGTGAGCACG ACATCTCCAT CAGTTGCAAC AGTCTCTCCA TTGGCGATTC CCGGCCCCCC TTCATCGGGG AACTCGGACT ATGCTCGAAC GGGATCATCC TCGGCCGCCT TCTACAAACC ATCAAGTCAG AGTGGGCGAA GTTTCATCCG ACCCGACGGC TTCCACTCAT CAAGTTCCGA CAAGTCTTTG CAGCAAAAGT ATGGACACGA TACCACTGCA TATTCCAATG CTCCGCCCCC GCCGATGAGC TCAGGAAACG CAATAACTTC GGCGCCGTCT TCACTGAACG GGAACGCCGG TCCAAATCCA TTTGCCGGTG GAGCCTCCGC GATGCGGCAA CGGGGATCTG GCCGCTACGT TGCGTACGGT CCCGTGAGTA TTCATCCCCA TTCTTTCAAT GGAGCTACAA ACGCGGGGTC GGTGCCGATT CCCGCTTATA CGAATTTCGC TCCTGCCCCC GTCACGGGAC AAGACTCAGA ATTTTCGCAT GCCTCTGTGT ATTCTCATCA GGAAACACTG TCTGGCGCGG GTTTAATTTC TACAAATACA CCAAATGGAT CCGCACAAAA TATACCTTCT CCGGTCGCCA CTGGTTCCGT TACGGCAAAC TTTCCACCAC CACCCTCCAG ACAACAGCCC GGTGCACTCG CTGCTCAAAA TTCTCAACGG GAGGTGTCAG GTTGGTCCCA AACATCTCGA TCTTCGAGTA TGGGTTCGAC TCCTAGTGGT CACGGTATTG CCCGCGTATC AAGCACGAAT TCGGCTGCCG ACGTTTTTGC AACGCCATCG CCTGATAAGG CCCATCGACC TGCTGCTGAG CAGTCCACTA TACCATCGCC CCAAAATCTC CATCAATCCA TGTCGGAACA AAGTCCAGCG TCAGCCTTGT TTTCCAAACC GAGCCCAACG GTGGCAGCCG ACATCTTTGG GGCCCCTAAA CCGGAACCCT CAGTCACTGA AACGTTTGCA CAAGGCAGCG GTGCACCAAT CGTGACTGAA GGCTCCGGGG GAACAACTAC TGTTCGACCC TCACAATCAG TTCAGTCAGG AAACTTTCCG TTGGTTGCAC CTGGCCCGGG AAACATTGCT GCCGATCTTT TCTCAAAACC TGCTGTGGCT AGTGCAATTC GAAATCCAGT CGCTGTGGCC GAAGCAGGAT CAGGTTTCCC ATGTAGTCAG AACCAATCAT ACGTCCCGTC TCCAGATGAC TCAAATGTTC CCGCGGAAGA CGGTGAGGAT ACCATGCACG AAGTGCCTTT AACACCATCT ATCGAACCGC CGAAGCAGGC AACTGGCACT GCCTCCGATT TGAAGGGTAC AGAAAATGCC TTGTTCCAAG CCATAGGTAT GCCACCTCCT CCCTTTTCCA AGAAGTAG
|
Protein sequence | MFGWATQQFE KLSQTVAPPP TDPASRFVFC CQKLDEDGAI KCVSELYGVA TIVVAVKGQV PLHVACTYAL PTLIRHILSQ PGADPNVVDA SGNTPLHCAV MSNNQETTLM VVKMLLQEYQ ASVLAKNASG QTPYDVASLN MVRQFLLPLQ LQQETQAALD NGGVGLPPGI DMGGLKISNA HLPPPPHGPH ERRRHVVDEH WWDPLRTTTL LGNWTPSNIH CAIGTLYITV IGTLERVSAR FYPIATVSPL AIPGPPSSGN SDYARTGSSS AAFYKPSSQS GRSFIRPDGF HSSSSDKSLQ QKYGHDTTAY SNAPPPPMSS GNAITSAPSS LNGNAGPNPF AGGASAMRQR GSGRYVAYGP VSIHPHSFNG ATNAGSVPIP AYTNFAPAPV TGQDSEFSHA SVYSHQETLS GAGLISTNTP NGSAQNIPSP VATGSVTANF PPPPSRQQPG ALAAQNSQRE VSGWSQTSRS SSMGSTPSGH GIARVSSTNS AADVFATPSP DKAHRPAAEQ STIPSPQNLH QSMSEQSPAS ALFSKPSPTV AADIFGAPKP EPSVTETFAQ GSGAPIVTEG SGGTTTVRPS QSVQSGNFPL VAPGPGNIAA DLFSKPAVAS AIRNPVAVAE AGSGFPCSQN QSYVPSPDDS NVPAEDGEDT MHEVPLTPSI EPPKQATGTA SDLKGTENAL FQAIGMPPPP FSKK
|
| |