Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49641 |
Symbol | |
ID | 7198273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 282064 |
End bp | 284495 |
Gene Length | 2432 bp |
Protein Length | 779 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184436 |
Protein GI | 219128471 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTCGT CGCCGCCCAC GGGGGTGGAC GACCACACTG GGCAAGCCTT GCCCCGTCGT CGACGACGAC GACGACGACG ACGACGACGC CGGACCGTTT TCTACACGAG TAGTAATACT CCCTACCGTA TCCTCCTACT CTGTGGACTA GTGGCATGCG TACACCACGG ACTAGTGTCC CAGTGGGTGG GCACCCACGC GTGGGTCGCG GTCCATACGG TCCTCCACCC ACGACGGTAC GGGGTACCGT ACGGACGCGA CTGGTCAGGG CAGTGCACGG CACATCCCCC CGGACAATTC CCTCCACCAA TATTGGCGGA CGACGAAGAC CCCGACCATG CCTCACTCGC GTCGTCGCGA TCCAACACCA ACCAGAACGA AAGTGCTGGA TGGAAAGAAA CCCAAAGAAA AGTCGAACAG CAACAGCAAC AGATTGATCT CTTGCTCCAA CTTGTGCAGC AACAACAATC ACAACAATCA CAGCCATCAC AACAATACTC CCCCCGTGCA CCCGCACCCG TTTCCAGCCA CGAGGTCACC GCAACCCGGG CACCAGAGCG AGACCCAGAG GCGCTTTCCC CGTCGACCAA CGGCAACTAT TCCAGATCCG TTCCGTCGAC GGACAGCATC GCACCGGCTA CTGCCAACAC GGGTGTCGTG CCTCTCAAAG CCATGCTCTT TATTGACGGA ACGTGGTTGT ACTACAGTAT CTACGAACGC ACCGAAGCCC GCTGTCCCAT TATCCAACGC TACGGTCGTG GTTGGCAGAA TCGGTACGAT TTCCATTGGG CCGCCCTGCC ACGGATTCTC TGCGAAAGCT TGCGGGATCC CGGATGGAGT ACCAACACGG CCGCTCCCGC ACACACGACA ACCCACGAGT CTGCCACCAA AAACGCCCGT CCCATGGAAA TTGTCCGCGC CAGCGTCTTT ACGTCCTACA AGGCCGACAC ACCCACGTCC TCCTTTCGGT ACCAAATGTT TCAAGACATG CAGGCCGCCA ATTACGACGT CCACATGATG GAAACGGTCG GCCGCGGCGA AAAATGCGTC GACATACAGT TGGCCGTGGA AATGATGCAC TACGCCACCG TACCCAACGC CTACGACGTC GCCTTGTTGC TCACGGGCGA CAAGGATTTC ATGCCCGCCA TGATCCGGAC CCGCCAAAAA GCCCGCAAAG TCGGTCTCGT ATCCATGAAA ACCGGCTGCA ATCGGGCTCT GTACGAAACA CCCGGATTGA AGGATTACGA CGTCGTGTGG CTGGAAGACC ATTTGGACGA ACTCATCGTA CCGAAACGGG GAAAAGTTAA CGGCTCCAAC CATGTGGAAG CAGTGGTTTC CGTCTTTACG CTCATGAAAA TTTTGTACGA TTTTGTCACC GAATCGGGCC TGGAACGAGT GACGAGCCGG GATATCGGAC GGTATCTCAA GATTTTGAAA CTTGGGTCGC GGTCGGTTTT AGAGGAATTG AAATTGTCCT ACGGTGGACT CCGACAATTT TTGACCATGT CGGGTGTCTT TGTCATTGAA ACAAGGGACG ATCATTACCA AAAGGAAGAT CCTAGTGACA AGGCGTACTG GGTACGAGTA CGACTACCCG AAGCCACGGT CGCATTGACC GAACGGGCAC GGAGTACTCG TTTGAGTGCT GCCGAAAAAG ACTTTCTCGA AACGTACTCG CTGACTATTC TTCAAGACAA GGCCACAGCC TATTATCACT CTTTGCTGCT TTTGGATACT CTGCCCGATG CACCCAGTGT CTCCCGAGAT GCGGCCAACG CTTTGCGTGC AGACGGTGTG GAGCTGCCGG ACGATCTGAC TCGCGATTAT AGCCTTTGCA AGGTTGCCGA GCTCAAAGAC TGTTGTCGTG CTCGTGGTTT ACCAATTGGT GGAACCAAAG CAGTGCTAGT GGACCGTATT CGAAGTGATG TTGAGCAAGA AATTGCACGC TTACAAACAG CAGCGCACTC ATCACCCCGT AAGTACCAGC ATTTGAATAT GCCTCACTTA TTGTCCGACC CCAGTGAAGA AACGGGCACT ACTGTCTCCG ATGAGACGGA TACCTATCTT AAGGAACTCG TCTTTGAGTA TCTGAGAGCC AGTCACGGCC AGGCTAGTTC TCGTAACGTT GGACGGTATC TCGCCAGTAA CAAGTCTTCG ACTGGAGAAT ACAAGAAAGG TCGGCAGTCG GCACTACACG AGCTGAAAGC ACACTACGGA GGCCTGGCGA GTTTTGTAGG CCACCACGAT AAGCTTTTTG AACGACAGGA TACGTTAGGA TCAGACAACG ATCCAGCCTC AACCTATGAA TTTGGGGTCG GACTTCGAAA AGGAGCATGA TCCAGAATAG ATTACTTTTC TGTGCCGGCC TTACCTACTT TTTGACTGCG TGACGGACTG CCCCAGACAA CTTCCACTCG TCGTGGTTGG GT
|
Protein sequence | MPSSPPTGVD DHTGQALPRR RRRRRRRRRR RTVFYTSSNT PYRILLLCGL VACVHHGLVS QWVGTHAWVA VHTVLHPRRY GVPYGRDWSG QCTAHPPGQF PPPILADDED PDHASLASSR SNTNQNESAG WKETQRKVEQ QQQQIDLLLQ LVQQQQSQQS QPSQQYSPRA PAPVSSHEVT ATRAPERDPE ALSPSTNGNY SRSVPSTDSI APATANTGVV PLKAMLFIDG TWLYYSIYER TEARCPIIQR YGRGWQNRYD FHWAALPRIL CESLRDPGWS TNTAAPAHTT THESATKNAR PMEIVRASVF TSYKADTPTS SFRYQMFQDM QAANYDVHMM ETVGRGEKCV DIQLAVEMMH YATVPNAYDV ALLLTGDKDF MPAMIRTRQK ARKVGLVSMK TGCNRALYET PGLKDYDVVW LEDHLDELIV PKRGKVNGSN HVEAVVSVFT LMKILYDFVT ESGLERVTSR DIGRYLKILK LGSRSVLEEL KLSYGGLRQF LTMSGVFVIE TRDDHYQKED PSDKAYWVRV RLPEATVALT ERARSTRLSA AEKDFLETYS LTILQDKATA YYHSLLLLDT LPDAPSVSRD AANALRADGV ELPDDLTRDY SLCKVAELKD CCRARGLPIG GTKAVLVDRI RSDVEQEIAR LQTAAHSSPR KYQHLNMPHL LSDPSEETGT TVSDETDTYL KELVFEYLRA SHGQASSRNV GRYLASNKSS TGEYKKGRQS ALHELKAHYG GLASFVGHHD KLFERQDTLG SDNDPASTYE FGVGLRKGA
|
| |