Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43447 |
Symbol | |
ID | 7197161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 476673 |
End bp | 478688 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177627 |
Protein GI | 219111751 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGCCG CTGCCAAGAA TACGTCCTCA AGGAGTCACC ATCCCAAAAA GCAACGCGTC TCGGCCAAGC ACACGGCACA TGATCAACCC CCACGAATGG CACCCACCGA AAAATTGGAA CAAGGACGTC ATCAACACCA ACACCACCCT ATCGTCGAAC ACGACGATGA CCTATCCCAC GCAACGTCTG ATCGAAACAC TGGGACGTCC TTGTCCGAGA CTACCAAAGC GACCCGTACA GTGTCGGAAA CGTGTGATTA TATTGCCGAG TTGAGTGAAG CCATACTGGA ACAACCGGAC AAGGCCTTTA TCAGCAGCGA AATTCCCAAT CCGGCCAATC CGCGGTATCC CAAACAGGGG CCGTCCAAAA TGAAGCAGTT GCTCGTTCTC GCCAACGCAT CCGTAGTGCC TCGGCACAAC AACCACAACA CCGATAACGA CGATCCGTCG TCCCACAGCG CGTATACCTC ACAGCTAGCG ACAATGTCGT TGCTCGCCAT TTTTCGAGAC ATTCTGCCCT CCTACCGGAT CAAGCTCCCG ACCACGCAAC AAGCGGCCGT CAAAGTTTCG AAAGAAACCA AAGTACTTTG GGATTACGAA CGTGCACTCC TGCAATCCTA CCAGGAATAC CTACAAATCC TCGAACACTG TTGGGATGCC ACCCGCACTG CTCCGCATCC GTCCCAACTA GGGGTCACGA GTATCCTCAG TCTCTGCGAA CTCCTCAAAT CGGCGTTTCA TTTCAATTTC CGCTCCAACC TACTCACGGT CGTGAGTCGC CATACGAATC ATCCCAGTAC CGTGGTCGGC GATGCGTGCT GCGCGGCCAT AGCCTACGTC TTTGCGCACG ATGCACAGGG CGAAGTTGCG CTCGAAGCTA CCCGGCTGCT GGCCAAGTTC GTCAAAGATC GGGCCTTTAA AATTCGACCC TCCGTTCTCC GGACCTTTAC CAGTCTACCC CTCCGCGTGC ACGTGGACGA AGCCCAAGCG GCGAAACTGG CGGCCGCCGC CAACGCCAAG AAACGCAAAA AAGACAAGGA ACTCGCCGAA ATTGACGCCG AACTCAAAGA AAGTGACGCC AAGGTGGACA AGATTATACT CGCACGGTGC CAATCGGAAA CGCTTCAACA CGTTACGCTT ACGTACTTTC GGATTCTGAA GCACGATAAT TTGCAAGCGG CACACGTCGA GACTCTGTTG CCGGCCGCGC TGGAGGGTTT GGCCAAGTTT GCTCATCTCA TCAATATTGA TACCGTCATG GATTTACTCG GCGTTTTGAA GGATTTGCTG AAAAAGATGA ACGCACTACC TCTGGAGGCC GCGCTCAATT GCATTTTGAC AGCGTTTCAA ACCTTGCAGG GGCCGGGGAA GGAAATGAAC ATTGACGTCA AGGAATACAT TGTTCCGCTC TATACTCAAT TACCGCGTCT GGTGGGGGAC GTTAATTGTC GTCGGCACTT GCCCACGGTA CTGCTCTGCT TGAATGCCGC CTTTATCAAA CGCCGTGAAT ACTCAACGAT TCGAGTTTCT GCCTTTTGGA AACAAATCCT GACCGTTTCC TTGCACGTAC CTCCGCACAC GGCGGTTCCG TTGATAGCCT TTGGACGGCA ACTTCTCCAA CGATATCCCG TCACACACCA GATGCTGGAA AATGAACAAG ACGTGATTAC GTCGGGAGAG TATACACCCG ACGTGGAGGA TCCCGAGCAC AGCAATCCTT TGGCCACGTC GGCCTGGGAA TTAGCCTTGG CCAAATTCCA CGTGCACCTT TCGGTTGTTC AGCAAGCACA GAGTACCGCA ACGTTAAGGC TACCCAATCT CCCGACCGAG AGTCCCGAAC GCTTGTACCA GGAACTGTTT CGTGCGGAGG ACGAGCTCTT TTTCTCCTTC CAGCGTGTGC GTAAAAAGCA TCCGTTGACA CCGCCGAAGC AGGATGGTAG CAAGAAACGG AAGCAGTACC GTTTCCTCAC GCCGCGGGCG ACGGAATCAT TCTTGTTGAA AGCGAACGCA TTGTAG
|
Protein sequence | MGAAAKNTSS RSHHPKKQRV SAKHTAHDQP PRMAPTEKLE QGRHQHQHHP IVEHDDDLSH ATSDRNTGTS LSETTKATRT VSETCDYIAE LSEAILEQPD KAFISSEIPN PANPRYPKQG PSKMKQLLVL ANASVVPRHN NHNTDNDDPS SHSAYTSQLA TMSLLAIFRD ILPSYRIKLP TTQQAAVKVS KETKVLWDYE RALLQSYQEY LQILEHCWDA TRTAPHPSQL GVTSILSLCE LLKSAFHFNF RSNLLTVVSR HTNHPSTVVG DACCAAIAYV FAHDAQGEVA LEATRLLAKF VKDRAFKIRP SVLRTFTSLP LRVHVDEAQA AKLAAAANAK KRKKDKELAE IDAELKESDA KVDKIILARC QSETLQHVTL TYFRILKHDN LQAAHVETLL PAALEGLAKF AHLINIDTVM DLLGVLKDLL KKMNALPLEA ALNCILTAFQ TLQGPGKEMN IDVKEYIVPL YTQLPRLVGD VNCRRHLPTV LLCLNAAFIK RREYSTIRVS AFWKQILTVS LHVPPHTAVP LIAFGRQLLQ RYPVTHQMLE NEQDVITSGE YTPDVEDPEH SNPLATSAWE LALAKFHVHL SVVQQAQSTA TLRLPNLPTE SPERLYQELF RAEDELFFSF QRVRKKHPLT PPKQDGSKKR KQYRFLTPRA TESFLLKANA L
|
| |