Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49422 |
Symbol | |
ID | 7195796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 212128 |
End bp | 213992 |
Gene Length | 1865 bp |
Protein Length | 537 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184206 |
Protein GI | 219127988 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000496723 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATGTTTGTT TATCTGATCG CGACAATTGC CGTCTGGCCA CCATTGTAAG CAGCAAGCTA CTTCTACACG CTTGCCAAAG CTTTATTTTC TCATGCCTAG TAGAAGACGG CGAAGCAACA ACCTATTTTT TGAAGCCTAC TGCGTCTTTC TTCGCTACCC CAGAACAATC GCTGTAATTT TTTCCTTGCT TAATCCGAGC ACTTCATTCC AGCTACGCTC AGATCAAAGG ATCAGGCTTC CTGCCATGGG TCTCGCGGCG TCGACCCCGA AACAAGAGAA GAGAAGTGAA CCAGAACAAG CTCCAGAAAC GAAACAGATC AAACGAAGAC ATAAGACCGA GGAATGGAAA ATAGCCGTTG ATACGTTTGC AAAGCAAGTT CTGGATCCCT CCTATCTTCA CCTCCCAGTT CGAGCGAGTA CTATGGAGGC AGCGTCCTTT TATCCGTCGG AAAACAACGA GCAAGCATGC TTGATGCCCG GGACGCACAA GCATCTCGGT GGTGCGTACG ATGCTACGGA TGGTTGCATT TACGGCATTC CAGCGAGTGC AAAAGCCGTC ATGTGCTTGT ACCCCGATGG TGATAAATAC AAGATGACCA CAATTCCATT ACCGGAGGAT GCAGCAAAAA CAACTTACAA ATGGTTGCGT GGAATTTTCG CAGATGGCTA TCTCTGGGCG ATTCCGGCAT GGGCGGACTC TGTACTCTGC GTTGACGTGG ATGCTTTCTG GGGACGACGT CCAGCTTCTG GCGAGGTTGT CCAATTGATT CCTCTTCCAG ATGAACACCC TAAGGATATG CGGTGGCAGT GGCACGGTGC TGGGATGAAT AGAGAAAAGA CGGCAATCTA CTGTATTCCA TCCAATGCCC AAAGAGTGCT CCGGGTCGAC TTACGAATGA AGACTACTTC GCTGATACCA ATCGAGTACA ATCCTGACAA ATACCCGAAC TTTCGAATTG AATTGGCAAA CAAGTGGTAC GGTGGTATTC TTGGGGAAGA TAATGCTGTT TATGGAGTAT GCTATCGCAG TTGCGCCGTT TTACGAATCG ACTGCGAATC TGATACCGCA TCACTTGTTG GCCCCGATTA CGGATGTGCA GGATACAACT GGCATGGTGG AGTCAAGACA AATGGAAAGA TATACGCGCA TCCCTCACAC GCACCCGATT CAGTGTTGGT TATCAACACC AATCCCGACG ATCACAAAGA GGACCCATGC TCGGAACTGC CAATCAAACG CGCCGAATAC GACAAAGACA TCAGAATAAA TTACAAATGG CTCGGTGGGG CCATAGGGGC AGATGGCAAT ATTTACTGTC CTGCATGCGA TACGTCCTCC GTACTCAAGA TTGATACACA GACTGAAGAG TGCACAACTT TTGGGTTTGT GGGAACTTTG AAGAATAAGT GGCAAGGTGG TATTTTGGGT CGGGACGATT GCATATACTG CATACCTGCG TCCGGACACC ATGTTTTGCG TATCTTTACT AGTCCAGACA TAGTTGGAGA GAATCCGATA CAATTAATCG GGCAGATGCC GGTGCACAAG GACAAATGGC AAGGAGGCCA CGTCGGAAAG GATGGGGCTT TGTACTTCAT TCCTGAGAAT GGATACCGTG TTTTGAAAGT TACGCCACCA AACGCGCCAC CAAGTCTTGT CAATGGAAAG CTGCCGGAAG GCGATGTGAA AATAGAAATG ATGTAGGTGG TAGCGACTTC ACTGCCAATT TCTGGTGTAG AATGCAATCC ACACCACTTG TTCCTCTGAT TATAACACAA TAACAAAAGC ATGTGCTCCC AGCCACAACT ACGGCCGAAA GCAATACTTT CCCATTTAAA GGAATAATAA CATCCAAGAC AACTG
|
Protein sequence | MPSRRRRSNN LFFEAYCVFL RYPRTIAVIF SLLNPSTSFQ LRSDQRIRLP AMGLAASTPK QEKRSEPEQA PETKQIKRRH KTEEWKIAVD TFAKQVLDPS YLHLPVRAST MEAASFYPSE NNEQACLMPG THKHLGGAYD ATDGCIYGIP ASAKAVMCLY PDGDKYKMTT IPLPEDAAKT TYKWLRGIFA DGYLWAIPAW ADSVLCVDVD AFWGRRPASG EVVQLIPLPD EHPKDMRWQW HGAGMNREKT AIYCIPSNAQ RVLRVDLRMK TTSLIPIEYN PDKYPNFRIE LANKWYGGIL GEDNAVYGVC YRSCAVLRID CESDTASLVG PDYGCAGYNW HGGVKTNGKI YAHPSHAPDS VLVINTNPDD HKEDPCSELP IKRAEYDKDI RINYKWLGGA IGADGNIYCP ACDTSSVLKI DTQTEECTTF GFVGTLKNKW QGGILGRDDC IYCIPASGHH VLRIFTSPDI VGENPIQLIG QMPVHKDKWQ GGHVGKDGAL YFIPENGYRV LKVTPPNAPP SLVNGKLPEG DVKIEMM
|
| |