Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_23314 |
Symbol | KRP2 |
ID | 7195867 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 569970 |
End bp | 572332 |
Gene Length | 2363 bp |
Protein Length | 602 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184278 |
Protein GI | 219128139 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATGAACCAT GATGCAACAA GACTATATTG ATTGCATTTG TCTTGACTGA GACACCGGTG CACACTGACA TTGACTAATT CCCCCCTTTT TGCTTGTGCA CGGCCTTTCA TTTTTGGATA CTATGAGTAA TCGACGCCGA GTACGTGTTG TTATGGTTGT CATGGTAGCC GATACTCGCC AATACTTCTT CGCGTTCCGA TGCGGCTCAA GGTTCCTGAC ATTGCTAACT TTCAAAAACA CATTCCTTCA ATACGTACCG ACTCACCACA ATTTGATTCG CTTCTTTGTT CTCGCTAGTC CTCAGTAGTT CCCCCCAAAG ATCAAACTCG TCTCAAGATT GAGCGGATGG AAGCGGAACG AGAAGAACGT CGTAGGGCTA TGGTTGAGGT GCGTCACCAG CGAATCTTCT GTATCCTGCT TTACCAACCA ACACGACCCA CGGGTCCTGT CTTGTCATTT CGATTTGTCT CTCACCAGAC CTGGTTTTCT ATCTCCGTCG TTAGCGCAAG AAAGAGCGCG CTCAGGAAGA ACAGCGAAAC ATTGCCGCTG GAAATCCAGG CGACGTCGAT TTCATTGGAA TGGTGCGACG CTGGCGACAG CAACACGCGC AAGAAATCGA GCCACACGAT TCTTGCGACG CGCATCCTCG TATTTGCATC TGTGTACGTA AACGTCCCAT TTCGGAAAAG GAGCGGGCCA GAAACGATCA CGACTCTGTT ACTTGCCTCA ATCCGGCCGT CTGGATACAC AGCTCGAAAG TACGGGTAGA CGGCATCACA AAATACCTCG ATCAAACGTC ATTCACTTTC GACCACGCCT TCAGTGAAGA GGCGTCTACC GAAGACGTTT ACAAACACAC CACCATGCCA TTGCTGGACT TTGTTTGCAG CGGGAAAGGG GGACGGGCAA CGGTGTTCGC CTACGGACAA ACCGGCTCGG GGAAAACGCA TACCATGAAT GGTATTCAAG CGATTCTTTG TGAGGACTTG TACCTGCAGC TTTCCGATCA TGCAAACGCT GGTGGATGTT CTTTGAAGAG CACAAAGATC GTGCTAAGTT TTTTTGAAAT GTACGGAGGG TTTGTGCAGG ACCTTCTGAA TGATCGCAAT CGACTCAAAG TTCTCGAAGA CGGCCAAGGG GAGATTGTTG TAACAGGACT TCAGGAAGTG GAGGCCGATA CTGTCGCAGT CTTTCACGAA ATCGTCAACA CGGGTAATTC CTTTCGGACA ACTCATACAA CGGAGGCCAA CGATACGTCC TCACGATCGC ACGCTATTTG TCAAATACTG TTGCGTGACA AGCGCACAAA CAAACTGCAG GGGAAACTGT CCTTGGTTGA TTTGGCGGGA AGTGAACGCG GCACTGATAC AAAATCTCAT AATTCCCAGC GGCGAGCAGA AAGTTCTGAT ATTAATACGT CGTTACTGGC TTTGAAGGAA TGCATTCGTG CCTTGGATAA CAAAAACAAG TCCGGAGCTA AACACGTCCC GTATCGTTCC TCGAAATTGA CTTTGATTCT CAAAGATTGT TTCACCTCAC CTGCCGCCAT GACCACCATG ATTGCAACAG TCTCTCCAGG AGCATCCGCG ACGGATCATT CTTTGAACAC GCTACGATAC GCTGGCAGGA TCAAAGAGCA ACGCGCCGGC ACCAAAGCTG GTGATATTGC CAAGAGTCCG ATGCGACGGA AAGTGTCGCC TCGGAGTGCA GACACGTCAT CTACAGCATC GGATCAACTT CCAAACAGTC TGGGTATAGA TGAAAAGACA GCAGTGGTTG AAGGGAGTGC AACCCGATTT ACACCGAGGT CAGGGACGAA TACAATGCTT CGTAAGGAAG TATTAGTACG TCGTAGCCCG TCGGATGATT TAGACGCCGT CTCAGGTGCG ACACAGGAGG AGACCGAGCT GCGACGCACT GTGCAAGGTT TATTTGAACA AGAAGAAGCA ATCTTGAGTA TGCACATGAG GTAAGAGGCT GCTTTTTGAA GCCGTCTCAG GGTTTGCAAT GATTCAGCCG TCCTGATGCA TTTTCTACCC TCACGTTTAG AAATATTCAG GAGAATGCTG AATTGCTCAC GGAAGAAGGA AAACTTTTAC AGGAGGTGCA AAGGGACGGA GTCGACAATG AAGCCATCGA GGAGTAGTAA GTTGTTTCTG GATCGCTGCC TCATGCAAGA CGGGAAAGCC AAGCCTCACC CATTTTTATG TGCAAACACG ACAGCATTTG CGCACTCGAA AGCGTTGTAG AAAGAAAAGA AACAATGATT TTGTCTCTGC AGGAAAAGCT TTTGGTGTTC AGCGATGCTT TAGAGAAGGA GCAGGCACTT TCTAAAAAGG TGGGTTCGCT GGCACAGTAT TAA
|
Protein sequence | MSNRRRSSVV PPKDQTRLKI ERMEAEREER RRAMVERKKE RAQEEQRNIA AGNPGDVDFI GMVRRWRQQH AQEIEPHDSC DAHPRICICV RKRPISEKER ARNDHDSVTC LNPAVWIHSS KVRVDGITKY LDQTSFTFDH AFSEEASTED VYKHTTMPLL DFVCSGKGGR ATVFAYGQTG SGKTHTMNGI QAILCEDLYL QLSDHANAGG CSLKSTKIVL SFFEMYGGFV QDLLNDRNRL KVLEDGQGEI VVTGLQEVEA DTVAVFHEIV NTGNSFRTTH TTEANDTSSR SHAICQILLR DKRTNKLQGK LSLVDLAGSE RGTDTKSHNS QRRAESSDIN TSLLALKECI RALDNKNKSG AKHVPYRSSK LTLILKDCFT SPAAMTTMIA TVSPGASATD HSLNTLRYAG RIKEQRAGTK AGDIAKSPMR RKVSPRSADT SSTASDQLPN SLGIDEKTAV VEGSATRFTP RSGTNTMLRK EVLVRRSPSD DLDAVSGATQ EETELRRTVQ GLFEQEEAIL SMHMRNIQEN AELLTEEGKL LQEVQRDGVD NEAIEEYICA LESVVERKET MILSLQEKLL VFSDALEKEQ ALSKKVGSLA QY
|
| |