Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42643 |
Symbol | |
ID | 7195998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 684151 |
End bp | 687876 |
Gene Length | 3726 bp |
Protein Length | 1238 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176629 |
Protein GI | 219109751 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.107643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGAG CAAAGAAGAA TCCCGCCAGT AGCGTGAAAG TCCGTTTCGA CGGACTCGAC GTGACGGCCA TGGTGTCGCA CGTACAACGC CGCTTGCTCG GACGCAAAAT TATCAACGTC TACGATGGCG ACAACGGCGA AACGTACGTC TTCAAGCTGG ATAGTAGTGG TGGGACTACT ATCAGCAACA ACAACAACAA CACTAGCAAC TCTAAAGAGT TCTTGTTACT GGAGTCGGGA ATTCGCTTTC ACCCGCTGGA GCATTTCGAG TCAAACTTGC CCATGCCGAC ACCGTTCTGT GCCAAGCTGC GCAAGCATTT GCGGGGACTC CGACTGGAGC AAATATCGCA AATTGGGACC GATCGAGTGA TACTCTTGCA ATTTGGTTCC GGAGCTTCCC GGCACGCTTT GATACTGGAA CTGTACGCCA AAGGAAACAT TATCTTGACG GAGGGGATTC ATTACACCAT ACTGGCACTT TTACGATCGC ACGTCTACGA AAAGGATCAG GTCGCCGTCC AAGTTGGACA GGTCTATCCT GTTACGTATG CCACATCGGT ACAAAAAGAC AACCAAACCG TAGCGAATGC TGTTGCGGCT ACCGATACTC AACCCGAAAA CGATCCGTCA CCAACTTCTA GAATCATGGA CACTGCATGC GCTGCCAAAA ACAAAAATGG TATTCTAAAT ATGTCGATTG AGGAGATTCA AGCATCCTTG GCGCTGCTAC TCGAGCCGGC ACCGGTATCA GCAACGACCA AAAAAGGGAA AAAAGGAAGC CCGCTCAACT TGAAAACGCT CTTATTGCAA CCCCAATGGG GTGTCTCTCA GTACGGACCC GCACTACTGG AGCACTGTAT TTTACAGGCA AATCTGCTAC CGCATGCATC GATCAAGGAG ACCGTGCTGC AGGCGGCTGA TTGGGAACGA CTGCAAACAT CGCTAAGCGA ACAAGGTCCT GCCATCATGT ATAATCTACA CTCGGCAGCG ATCGACACGC CCGGCTACAT TCTTTATCAA CCTCGCGTGG AGGAAGATAT CGTTAACGGC AAGCCGCATT CTGAAAATCT GTCGTCGGCA GTTGCAGTCG TGGCCAAAGA ATTAGCACAC GCTGATAAAG TACTGCTCGA ATTCCAACCC CACTTGCTTG CCCAACACCA GAATTGTCCC CGGTTGGAGT ACAAACACTT CGGCGCCGCC GTGGCTGACT TTTTCGCGCA TATGGTTGCC CAGAAACGCC TTCTCAAGGT CCAAGCCTCG GAAATGGCCG TCCAAGAAAA ACTGCGGAAA GTACAACAAG ATCAAGCCGA TCGCGTGATG GCTTTGGAAC GCGACCAGCA AACGCTACAA GCTTATGCCC AGGTAGTCAA GAACAACGCG GAAAACGTTG ACAAGGCCTT GCTAGTGATA AACTCAGCTT TGGATAGTGG TATGGATTGG GATCAACTGA TTGAACTTGT GAGTGTTGAA CAGGCAAATA GAAATCCGAT TGCTAATTTG ATTGTCCGCT TGGAATTGGA AAATGAAATC ATGATACTAC GACTGCCTCG AGACCCGTTC GACGAATTGT CTGACGTGTT GAATGTGAAT GTGTCGTTGA AAGATTCGGC GCATGCCAAC GCCAGTGCGC TGTTTGCAAA GTATAGGGCA TCCAAGGAGA AGACACAAAA AACTCTTGAA TCGTCAAGTA AGGCTTTACA GGCGGCCGAA GAAAGCGCCC AACGGCAATT GATCGAAGCC CAACGACGCA CGAAACAAAC TGTCGCTGCC GTCAAGCGCA AGCCAGCTTG GTACGAAAAG TTTCACTGGT TTGTCACTAG TGACAACTAT CTGGTGCTAG GCGGTAAGGA CGCCCACCAG AATGAGTTGT TGGTCAAACG ATACTTGCGG GCCGGGGACG CTTACTTGCA TGCTGAAGTG CACGGAGCTG CCTCGTGTAT TCTTCGTGCT AAACGTCGAC GACTCCCGAA CGGAGCCACC CAGAGTATAC CCTTGTCTGA CCAGGCCCTG CGGGAAGCGG GCAACTTTAC AATTTGCCGG TCTTCAGCAT GGGCGAGCCG CATGGTCACG TCCGCTTGGT GGGTGGAATC GCACCAAGTA TCCAAAACTG CACCGAGTGG AGAATTCTTA ACCGTAGGGT CATTTATGGT ACGAGGTAAA AAGAATTTCT TGCCTCCGAG TCCACTAGAA ATGGGCTTGG CCGTGCTGTT TCGGTTAGGC GACGACGATA GTATTGCCAG GCACAAAACC GAACGTAGAG ACTTTGCCCT GATTGAGTTA GAGAATTCTA GCGTGGATGT GCTCGACGCC GTATCGTCGT TTCAGATGGA GCCGAAGACA AATATTGAAG GTCAAGAGGC TACGACACAC AGAGACACAA CAGAGCACGA AGGATCCGAT TTAGTATCGG ATGAGGTCTG GATGACGCTT CCGAAAGTCA TCGTCTCAAA CAGCACGTCT AGCGCTGAAA ATCTGATCAA CGATCCTACG CGCGACGACG GTAGTTGTGG AAGCGATGGC AACGAAGAAG CCAAGAAAGG GTCGACCACA AACGAAGGAA ATGGACGCCG TACAAAAAAG GGCCTTTCGG TCAAAGAAAG GAAGCAAATG AAGAAATACG GTTCGCTCGG CGAAGCTAGG AAGTTGCACT CAACAGTTGC AGTTGACAAG TCATCCACAG AGGATACCCA CGGTCAGCAG CCTGTTTTGC CCTCCTTGGA CGGCCTCATT GACGCGAGCA AACTGAAGAG AGGCAAGCGA GCGAAAGCGA AACGTGCCAT GCTAAAGTAT ATGGATCAGG ACGACGAAGA TCGAGAGTTG GCGATGCTGG CACTGCAAGG AGGTGAAGGA AAAAATCGAA AAAAGGGCAA GAATAAACGC AGCCAAGGAC CTGTGTCAGC AGCGCAAAGT CAAGTCGCAG GAGAAACTGC TGCATTATTA GTAAGAGATA CGTCCGAGAC CATTGAACAG CTACCCGGTC AAGTCGTATC TATTTTGCAA GAATGCCTAA CTGCAAACAA TGGACTAGGC AAGCATAACG AAGCGATTCG CTGGGATAAA CTTGACTCCG ACACTGTGGA GCAGCTTGTT GCGCTAGAGT CCTTGGACGC GCAAGTAGCT GCCGCAACAC GTCTTTTGAA TTTAAAAACG AGTACTCGAG TGGACAACTT CTCAGCGAGT TTAGGTGGTA TTATTCGTAC TATTCGAAAA TACGGATACA GTTGTCTCGA TGATGAAAAG ACCGAAGTAC TAGAAAAGCC AAAAAGGAAG ACGAAAGCGC AGAAAGATGT TGAAAGTACA CAGTGGAAGC AAACAATGGA AGAGGAGGGA GTTGTTGGTA GCGACCTGGA TGAAGATGCG GTCGATGATA CGATCGAGCT GAGCAAGTTA TCTGGAATGC CTCAAGCGGA AGATCTTGTT CTCTATGCAG TACCAGTCTG CGCACCTTAC CAAACTCTTT CAAAGTACAC ATATCGTGTC AAACTCACAC CTGGTAGCAC GAAGAGAGGA AAGGCTGTCA AGCAGTGTGT GGACATGTTT TTAAAGAACA TGGTTTTGAA AGAGCCTTCC GCCTCGGAGC ATTGTACAGA ACTCATCAAG AAGCTCGGAG ACAACGATTG GGTACAAGTT ATTTGTGCAG ATGTAAAAAT ATCTGCACCG GGGGCTAGCA AGACAGCGAA GAAGCACAGA GCAATCACTA AGAAGAAAAA CAAATAGTGA ATACTT
|
Protein sequence | MQRAKKNPAS SVKVRFDGLD VTAMVSHVQR RLLGRKIINV YDGDNGETYV FKLDSSGGTT ISNNNNNTSN SKEFLLLESG IRFHPLEHFE SNLPMPTPFC AKLRKHLRGL RLEQISQIGT DRVILLQFGS GASRHALILE LYAKGNIILT EGIHYTILAL LRSHVYEKDQ VAVQVGQVYP VTYATSVQKD NQTVANAVAA TDTQPENDPS PTSRIMDTAC AAKNKNGILN MSIEEIQASL ALLLEPAPVS ATTKKGKKGS PLNLKTLLLQ PQWGVSQYGP ALLEHCILQA NLLPHASIKE TVLQAADWER LQTSLSEQGP AIMYNLHSAA IDTPGYILYQ PRVEEDIVNG KPHSENLSSA VAVVAKELAH ADKVLLEFQP HLLAQHQNCP RLEYKHFGAA VADFFAHMVA QKRLLKVQAS EMAVQEKLRK VQQDQADRVM ALERDQQTLQ AYAQVVKNNA ENVDKALLVI NSALDSGMDW DQLIELVSVE QANRNPIANL IVRLELENEI MILRLPRDPF DELSDVLNVN VSLKDSAHAN ASALFAKYRA SKEKTQKTLE SSSKALQAAE ESAQRQLIEA QRRTKQTVAA VKRKPAWYEK FHWFVTSDNY LVLGGKDAHQ NELLVKRYLR AGDAYLHAEV HGAASCILRA KRRRLPNGAT QSIPLSDQAL REAGNFTICR SSAWASRMVT SAWWVESHQV SKTAPSGEFL TVGSFMVRGK KNFLPPSPLE MGLAVLFRLG DDDSIARHKT ERRDFALIEL ENSSVDVLDA VSSFQMEPKT NIEGQEATTH RDTTEHEGSD LVSDEVWMTL PKVIVSNSTS SAENLINDPT RDDGSCGSDG NEEAKKGSTT NEGNGRRTKK GLSVKERKQM KKYGSLGEAR KLHSTVAVDK SSTEDTHGQQ PVLPSLDGLI DASKLKRGKR AKAKRAMLKY MDQDDEDREL AMLALQGGEG KNRKKGKNKR SQGPVSAAQS QVAGETAALL VRDTSETIEQ LPGQVVSILQ ECLTANNGLG KHNEAIRWDK LDSDTVEQLV ALESLDAQVA AATRLLNLKT STRVDNFSAS LGGIIRTIRK YGYSCLDDEK TEVLEKPKRK TKAQKDVEST QWKQTMEEEG VVGSDLDEDA VDDTIELSKL SGMPQAEDLV LYAVPVCAPY QTLSKYTYRV KLTPGSTKRG KAVKQCVDMF LKNMVLKEPS ASEHCTELIK KLGDNDWVQV ICADVKISAP GASKTAKKHR AITKKKNK
|
| |