Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38493 |
Symbol | |
ID | 7203470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 238150 |
End bp | 240015 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182644 |
Protein GI | 219124718 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGAGTA GCGACGACGA AGATTTAAAT CTTGCCTTAC GGGCCTCACA AGAAACGCTT ATCGTCGAAA GGAATCGTCG ACAACCCGTA CTCCAGCCAG ATGATGACTC CATTAGTACG GATGGTGAGC AAGCCCTGGT GGTTCTGCCG CCAGCTAAGC GTGCACTCAA GTCCAACCTG CAAGATACAT CGACCTTTCC GCGGCTGTCC GCTACCGGGG ACAAGCCTCG AGATGCGATC GTATTGCACG ACGATGAATC CGACTGCTGC GATGACGATG CCGCCAACAT GGAACTATCC CTCGTGGCAC GACTGCAACG CAAGCACCAA ATAGAATTTG AGGAGGCACC ATCTGCTAAG AAACCCGTTA CTTTGCCACA AGGTACTAAC GGTGCTCTTG AAGTCGCAAC GGTAGGCCAC AGTCCGTCTC CAACAAATAC AGCACCTCCA CGAACCGAGT TAAAGCAGCA GAAACACACA GCTCCAAAAA ACGCTCCGGT TGCTACGTCC ATCTTATTAA ATTCAGACGA TTCGTCTGAC TCGGATGACT CTATAAAAGC TTTAAAATTA CGATTAGCTG CATCGTCGGA GCGCTCCCGT CCACAGTGCT CACAGGATAT CGACTGTTCC GGGTCAGCGT CCCCATCTCC ACCAGCGCGC AAATCCACTA ACGTCTCCAA GCGTAAAGAA CGTCCGGCGG TCGAAACGAA GCGCGTCCAA GAGATCAGTA AGCGCCAGAA GGAGATTGAA AGGGAGCGTT CTCGAGAAGA GAAAGCCCGG TGTCAACAAT TGAAGGCGGC TGAACGTCAA CGGCTCAAAG TGGTAAAGCA AACTTCGGTA GCTCTAGAAA AAGCTCGCAA ACAGAAACAG CGTCAAGCAT CGGATCAAGC ACGCGGTAAA TTTTGCCACA AAGAAATTGC TGTTTTGATT GAGCAAGATT GGGTCCAGCA ACCTGTCTGG AAGGAAGCGA TCACTGACGG CTTAGTGGCA TCGGATGATC ACCCCTACAT TTGGCATGAG TACGCCACCT TGCTTGGTTG TCCCACTGTC CAATGGATTC GCAAAGATTA CTTACTAGGC GGTGCTACTG ACGCTTGGCA GCAACTTCGC AAAGGGAATC ACGCAGGCTA TCACCATATC CCACTCCTAT GTGTTATTGT TGAACCTGAC ATTTTCTTGA AGCTGTTGCA TCGAGACAAC AGCGAAGATG ACGATTATCC CGAACTTGAG AACTGGTTGA AAGGAATACA GGCTGGCTGG AAAGGAGCTT GGAGCCATCA ACAAAAAGGC ACCCCGCGAA TTATTATTCT ATTGTACAAG GTCAGAGAAA CGCTGGACCG CTTGTGGGTT AAATACAAGC GAGAAAGCCG TGGGCGTCGA GTGAGCTCTT CACCGCAACC ACCAACGGCC GAAGAATTGC ATGACGCGCT AATCTGGATG ATGATTGACT TTCAAGTAGA GTGCATTCAT TGTTCATCGT CAGAACAGAC TGTACACGAG TTGAGCAAGA TGACTCGCCT CTTGTCGGAA AAGCCGTATC AGAAGCACGT AACGGAGCTT GATTGTGTTC GCAAGCTCAA ACCACGAGTG GACGAGAATT CCACCCTCCA AGAACGAGCT GAGGATTGCT GGTTTCGACA GTTACAACAA ATCCCACGAA TAAGCCTTAC GGTGGCTCGT GAATTTACGC AGCACTATCC GACTGCTCGA TCGTTATGGA TTGCGTACCA GAATCCCGCA CTTTCGGAAG AGCAGAAAAG GGTTCTCTGT AAAAATTGCT TCTCGCAGAA GGCCTCGCAC GCCAAACTTT CAACTTGGAT GTACAAGACA ATGACGGGAA ATGACCCCAA TGATTTACTA CGATAA
|
Protein sequence | MWSSDDEDLN LALRASQETL IVERNRRQPV LQPDDDSIST DGEQALVVLP PAKRALKSNL QDTSTFPRLS ATGDKPRDAI VLHDDESDCC DDDAANMELS LVARLQRKHQ IEFEEAPSAK KPVTLPQGTN GALEVATVGH SPSPTNTAPP RTELKQQKHT APKNAPVATS ILLNSDDSSD SDDSIKALKL RLAASSERSR PQCSQDIDCS GSASPSPPAR KSTNVSKRKE RPAVETKRVQ EISKRQKEIE RERSREEKAR CQQLKAAERQ RLKVVKQTSV ALEKARKQKQ RQASDQARGK FCHKEIAVLI EQDWVQQPVW KEAITDGLVA SDDHPYIWHE YATLLGCPTV QWIRKDYLLG GATDAWQQLR KGNHAGYHHI PLLCVIVEPD IFLKLLHRDN SEDDDYPELE NWLKGIQAGW KGAWSHQQKG TPRIIILLYK VRETLDRLWV KYKRESRGRR VSSSPQPPTA EELHDALIWM MIDFQVECIH CSSSEQTVHE LSKMTRLLSE KPYQKHVTEL DCVRKLKPRV DENSTLQERA EDCWFRQLQQ IPRISLTVAR EFTQHYPTAR SLWIAYQNPA LSEEQKRVLC KNCFSQKASH AKLSTWMYKT MTGNDPNDLL R
|
| |