Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42424 |
Symbol | |
ID | 7196636 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 18350 |
End bp | 20109 |
Gene Length | 1760 bp |
Protein Length | 551 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176998 |
Protein GI | 219110493 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.960008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGAAT CTCCCTCGCG TAGCCTTTTC AAGAAGGCTG GTTTTGTGTT CACAATCGCG TTATCCCTAT GGGCGATTAC TGACGAGAAC ATTTCTGTCG GAAAAGCGCT GCAACAGCAC GAACCGAGTC TGCGAGTGTT TCGATCCTTG CTCGAAGTGA ATCTTTTATT TTTCTGTACC GCGGCGGCAT TGTTTGTCTG GTCCAAGACC ATAGGACAAT CAACAATTGA AGCGCTCTTG TTTCAACCAC TCAAGCTTTC CGGGGCACAC CCGGAGAACG CAGACCGGCA TGTGTACGCC ATGACCGAAG TCAATGACGA CGAAGTTCTC CAGGAAGACG ACGCTCTAGA TGCCGATTAC GCCAGTGATG GAGAAGTCGA TCGGCAAGAA GACTTAGACG AGAAACTATA CATCCCCACC GCGGCTTCTG TTGCCAATGC TGCCTTAAAC ATGCTCTTTA CTATTCTGGT CGTTCTGTTC TTGTTCACGT TAAGCTCGAT CTCCACCGCC AAGCATTCAG TCGAAGAAGT AGCATCCAAC ACGAACAGCA GTGGCCTCTG GGATCTCTTT TCACGTGTCA CGGCTCCAGT TTTCCCTCTG TTGCTCTTTC TTTACTTTTT GCTACGGGCT ATCTTCCCCT GGAGGCGTAA AAGATCGTTT TGGGCAGTTG TTTTCATGAC GATGTCCGCC CCGTGGCATC CAGTAGACTT TCGGGACGGC TTTATCGGTG ATATCATTAC TTCTTCAGTA CGACCGATGC AAGACATTGC TTTTACCGTA TTTTATATCC TATCGGGTCT AAGAGGATGG TGGTCACGAG AATATCGAGA CGGCAACTTT ATCGATTCCG CGGATGCGAG CGTTCCAGCA ATGGAACGAT CATGGCTGTT ACACACTGTC GTACTGCCAA TGTGCATGGT CAGGTACGCA TACATGCATA TACACGGGTA GCAAATACGT GTTAGTTGCT AACGTTGGGA AATCAGTATC TAGCCTCACT TACCGTCCCC TTTGCTCATG GTGGTAGTCC CCTCTGGTGG CGATTTCTTC AAAACCTTCG ACAAAGCTAC GATAGCAAGC AGCGCTGGCC GCACCTTGGC AACGCTCTTA AATACTGTTT CGCCGCCCAA ATTGCAATGT TTGGTGTATT CAATCCCGAC CAAAAAAAGA GCGTTCTCTG GTTAACAAGT TTTGTTGGCG CTACTTTGTA TCAGCTTTGG TGGGACATCT TTATGGACTG GTGCCTATTG GTTCGTGTGG ACGAGCGCTG GAAACTTCGT AGTACACGTC TGTACACCAA AACATCTGTA TATTGGATTA TCTGTGGGGC AAACTTAGTT TTGCGTTTTT GCTGGACTCT GAGTTTTGTC CCGCCGCGCT ATCTAAATGC CTCCGGCGTT CTGAAAGAAA GCTTCTCAGG CGATGTGAAG AATATCCTGG GCCCCTTTAT TGCTTCCGCC GAAATTGTGC GAAGGGCTCT ATGGGGACTG CTGCGTTTTG AATGGGAGGC GACGAAGAGA TACAGTGATC GTAAATCATC GTTTGACGAA AGTCAAGACG GTTTGAGAAA TGAAATCGAA CTTACACCGA TGAAAATAAA ACAAGATGAG TATCGCAAGT CTTCCAATGC TTTCTCCGTA GGCCATTCTT GGAAAATGTC CTCGATGAAT GAGGTTCAAA TAATTGGCGA GCTTGGTGTA TACGCGACAG CTTTCTGGTT GATTGGGACA CTAGCCGCCG CACATCGAGG AACTTTGTAG
|
Protein sequence | MVESPSRSLF KKAGFVFTIA LSLWAITDEN ISVGKALQQH EPSLRVFRSL LEVNLLFFCT AAALFVWSKT IGQSTIEALL FQPLKLSGAH PENADRHVYA MTEVNDDEVL QEDDALDADY ASDGEVDRQE DLDEKLYIPT AASVANAALN MLFTILVVLF LFTLSSISTA KHSVEEVASN TNSSGLWDLF SRVTAPVFPL LLFLYFLLRA IFPWRRKRSF WAVVFMTMSA PWHPVDFRDG FIGDIITSSV RPMQDIAFTV FYILSGLRGW WSREYRDGNF IDSADASVPA MERSWLLHTV VLPMCMVSPL WWRFLQNLRQ SYDSKQRWPH LGNALKYCFA AQIAMFGVFN PDQKKSVLWL TSFVGATLYQ LWWDIFMDWC LLVRVDERWK LRSTRLYTKT SVYWIICGAN LVLRFCWTLS FVPPRYLNAS GVLKESFSGD VKNILGPFIA SAEIVRRALW GLLRFEWEAT KRYSDRKSSF DESQDGLRNE IELTPMKIKQ DEYRKSSNAF SVGHSWKMSS MNEVQIIGEL GVYATAFWLI GTLAAAHRGT L
|
| |