Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48825 |
Symbol | |
ID | 7195126 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 364500 |
End bp | 366170 |
Gene Length | 1671 bp |
Protein Length | 443 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183474 |
Protein GI | 219126458 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGACCATA GCGCGTACAG CATTCACTGT CAAGAGCAAT TCTCCAACCA GCAACTCTGA CTTTTCTCAA CGACGAGTAT GAAAAAATCT GTAGATATTA GGGCAAAAGC TTCCTCAGCC AATAGTGCTC TGTTTCGCGT ATTGGTGAGT TCCATAGGGT GCTTGTTGAT GACTTCCTTT CTCATCAACG TGCTCTATCA TTCGAGATTG CCGTCTCCAC AATCCGATCT CTCCCAACTG ATTTCTCGCG AGTCATCCTA CGTACAACAC CTTAATAAAA CGTTGTGGAG CGGCACAAAC AAGGCCCAGC AGGTGGAAAA ACGCAACAAG AAAGGACAAA TTTTACACCA CAGTATTACG GCGATCAAAC AAGAGGATCT ACGGGGCGTC GGCGACGACG CAACCATACC AAAGAGAGAA TCAGATCAAG CCAATAGGGA CATGGGTCAA GCACGCCAGG GTCGCGAAGA GTTGCTGGCA ATTCTGAAAG ATGCCGGAGT CGAAGACATT GACGTGGCGA GCGTTTTGAA ACTCCCATTG TGGTCCGAAG TTCAGGCTTT GTACGGTGAC GGTCCGGTCG TCTATGGCCT CGATACTTGC GAAGATTTCC GCGCGAATAT TCCTAGAGAA GATGCTTCTA TTGGCACGGC TGGACTGTTT AATACCGGCA CCAACCCCTT CAACATGTAC CTGGAAGGTA ACTGTATCAT GCCCGAAAAC GAACACGACC ATCACGGCGG CATGAGGTGG CAAGTTCCGT GGGGTAAGCA TATGCTGGCG AGCCGCAAGT GGAACAACAC GGCTGGGCAC GACCATAAAG TCAATAAATC CAACGTGCTA CCCGTTGTAC TGATTCGAGA TCCCTACTCC TGGATGCAAT CCATGTGCAA ACACCCATAC GCCACGTCCT GGTCCAAGAA TGTGTCCCGG CACTGTCCCA ATCTTGTGCC TACCCCGGAA GACCGAGCAG AGTTTTCCGA GTTGGGCACG AACGAGTCCA TTCCGGTGCG TATCAACTAC CCCAATAAGC CGGCCGATTA CCCGTCGTTG GCCCATTTTT GGAGCGACTG GTATCAACAA TATTTGCAGG CCGAGTATCC CAGGCTTTTG ATACGCTTTG AAGATTTGAT TTTTCACCAG AAAAGTCTAC TTTCGATTGT TTGTGCATGT GCCGGAGCTA TACCCAAACA AGATACGTTT TCGTACGTCG TCGATGCCGG TAAATGGGGT TCCGCTCACA AAGGAAGGTA AGCAGCAGCT TCCAGCAGAT GGTGCTTTGT GATTTGATGT CGCAACACAT CCAGTAACAC AATTTTGGGT ATTCTGTATG TACGATAATT GCTAGCTCCA ACATGATCTC CGCCATGGTG AAGTACGGAA GCAACAAGAA ACGCTTGAGC GGTTTGACCA ATGAAGATTT GAGCTACGCC CAGGAGCATC TGAGTGCACA CCTGATGACG CTCTTCCAGT ACGCCCAGCC ACCAGAAGGA GGGGCAGCAC TCTCATGATA CTAGAATTTG AGCTAGTGCT TTGAACGGAA CGCTTGAAAG CTGCCTTAAA AGCTTTGCCC TTGAACGTTT CAGAATGGAA CGAAAGCTCG CTCTTTCATA GTACAAAATA TCTCTAAACA ATAGCTATGT ATCCAAGATT TCACCATCAA CCCAAGGTTT C
|
Protein sequence | MKKSVDIRAK ASSANSALFR VLVSSIGCLL MTSFLINVLY HSRLPSPQSD LSQLISRESS YVQHLNKTLW SGTNKAQQVE KRNKKGQILH HSITAIKQED LRGVGDDATI PKRESDQANR DMGQARQGRE ELLAILKDAG VEDIDVASVL KLPLWSEVQA LYGDGPVVYG LDTCEDFRAN IPREDASIGT AGLFNTGTNP FNMYLEGNCI MPENEHDHHG GMRWQVPWGK HMLASRKWNN TAGHDHKVNK SNVLPVVLIR DPYSWMQSMC KHPYATSWSK NVSRHCPNLV PTPEDRAEFS ELGTNESIPV RINYPNKPAD YPSLAHFWSD WYQQYLQAEY PRLLIRFEDL IFHQKSLLSI VCACAGAIPK QDTFSYVVDA GKWGSAHKGS SNMISAMVKY GSNKKRLSGL TNEDLSYAQE HLSAHLMTLF QYAQPPEGGA ALS
|
| |