Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44749 |
Symbol | SAE2 |
ID | 7199872 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 124330 |
End bp | 126535 |
Gene Length | 2206 bp |
Protein Length | 643 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | sumo-activating enzyme 2 |
Protein accession | XP_002178935 |
Protein GI | 219116280 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.97194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTAAGACTG CGGTCGGCAG TGCAAATCGT GTTGCACTTC GTTCGGTCGT GACGCCGACT GCGTTTCAGA CCATTGCAAT CTCGTTGGCG TAGTCACAGA CATCTAGTGC GCTTCGTTGG TGTCACCGTA AGAATTACTT TTGCTCGTTT CACACGTCTG TTGTTTTTCT ACTTCCATGA CAACCGCATC GTCGTCCGTG CCGCACTTAC TGAGCGGCAT GGAAGCCACT CTCGGAACGA ACATGCTGAC CAAGATTCAA AGCAGTAAAA TCTTGCTCGT GGGTGCCGGT GGGATTGGCT GCGAATTGCT CAAGAATCTC GCACTTACCG GCTTTCGACA CGTCCAAGTC ATTGATCTCG ACACCATCGA CGTGTCCAAT CTCAATCGCC AGCTCTTGTT TCGATCCCAG CATGTGGGCA TGCCCAAATG TACCGTGGCT TGTCAAGTTG CCACGCAAAT GGTACAAGAC CCTTCTCTGG TTTCGTATAC AGCCCATCAC GGGAACGTCT GTGACAACGA CACATTCAAC GTGCAGTTCG TCCAACAGTT TGATCTCACC TTGAACGCGC TGGACAACGT CGTCGCCCGG CGTAGGGTCA ACCGACTTTG CTTGGCCGCC GGAGTACCAT TGATTGAAGC GGGTACCACG GGATACCTTG GTCAAGTCAA CGTCATTGAC AAGGAAAGTG ACGTTGCCTG TTACGAATGT CAGACTCAGG AAACACAAAA GGTGTACCCC ATTTGTACCA TCCGATCCAC GCCGTCCATG CCAGTCCACA CCATTGTTTG GGCGAAGGAG CTGTACAAAC TATTGTTCGG CGACAAAGTG GAAGAATCAA TGTTGTTTGA GGATACGACG GCACCGGATG CCGAGCCATC GACCTACATG TCGGCGGTGT TGAGTTTTCG TCGGGCGCGG GCTGCACGGG ACAGCGACGT CGTGCGTACC GCGGCCGGGG AAGTTGTCAC CAAACTGTTC GTGGACGAGA TTCAGAAGCA ACTCGACATG GGCCGATACA AGACGGCGCG CAAGACACCA GCCGTCTTGC CGACGAGTGT CATTGTGGAC GCCACCACTA CGGTACCACC GACGGCCAAG CCGTCCTACC GGACGACGGA TCTGTGGACG CCGACTGAGT GCGTGGCCGA GTTCATCGCG TGCTTGGAGA ATGCGGCCAC CGCAGCCACC GTCTTACCGT CTTTCGACAA GGATGATACG CTAGCAATGA GGCTGGTGAC AGCGTCTTCG AATTTGCGCA GTTTTGTCTT TGAGATTGAA CCTTTACAAA GCTTTTACTC GGCCAAGGGG ATTGCCGGCA ACAGTACGTG CACGGCCATG CGAAACATAG GTATTTGATG TCTACGCCTG CTCACACACG CCAATTTCTT TTTCCGTTTT TTCCCTGGCG TGCGTTTACA GTCATTCCGG CGATTGCCAC AACGAATGCG ATTGCGGCCG GGTTGCAGAT CCTACAGGCC TTTCAAGTCC TCCGCGCCCA ACTCGAAACC GGCACCAAGT CGGCCGGCAA GCTGGGTGAG TACTGCTCCT ACATTAACTG CCTGCGCAAC TCGACGCGGA ACGGTCTCTT CTTGACAGCG TCGAATTTGG AAAAGCCCAA TCCACGGTGC TTTGTCTGTC GCAACGCTAC CGTACCACTC GCGCTGAACG TGAACAACTG GACTTTGCAA GACTTACTCC AGAAGCTAAT CAAGAAAGAT TTGGGCTTTG AAGAGCCGAC GATTACGCTG GATGGGGACA TTGTTTGGGA AGAAGGGTCA GACGCGGACT CGGAGGCGTT TGCCGTGAAT TTACCCAAAT TACTGCCACA ACTCCCTTGT GGTGGTATTC AGCACGGAAC GGTTTTGCGC ATTGAAGACT TTTCGCAAGA TTTGACCGTG GACGTGGCGG TGACACACCA AACGGTATGG GAACGGGGCG ACGAGGAGGA TGACGACGAT GATACGTACC AGTACGTGCT GAAGGGATCC AAGCCGACCG CTTCGGCGCT GCACGTTCCC TCCAACGGTG CGCTCAACAA CGGGGTGGGT ACGAAGGTGG AGGAAGCGGA GGATGACGAT GATATTGTGG TGGTGATGGC AGCGGACGCG AAAGGCAAAC GCAACCGGGA GACGAACGGG GACGGCCCCG TGAACAAACG GCAAAAGATG TCCATTCTTG AAGCCGACGT CATTGAGATT AGCTAG
|
Protein sequence | MTTASSSVPH LLSGMEATLG TNMLTKIQSS KILLVGAGGI GCELLKNLAL TGFRHVQVID LDTIDVSNLN RQLLFRSQHV GMPKCTVACQ VATQMVQDPS LVSYTAHHGN VCDNDTFNVQ FVQQFDLTLN ALDNVVARRR VNRLCLAAGV PLIEAGTTGY LGQVNVIDKE SDVACYECQT QETQKVYPIC TIRSTPSMPV HTIVWAKELY KLLFGDKVEE SMLFEDTTAP DAEPSTYMSA VLSFRRARAA RDSDVVRTAA GEVVTKLFVD EIQKQLDMGR YKTARKTPAV LPTSVIVDAT TTVPPTAKPS YRTTDLWTPT ECVAEFIACL ENAATAATVL PSFDKDDTLA MRLVTASSNL RSFVFEIEPL QSFYSAKGIA GNIIPAIATT NAIAAGLQIL QAFQVLRAQL ETGTKSAGKL GEYCSYINCL RNSTRNGLFL TASNLEKPNP RCFVCRNATV PLALNVNNWT LQDLLQKLIK KDLGFEEPTI TLDGDIVWEE GSDADSEAFA VNLPKLLPQL PCGGIQHGTV LRIEDFSQDL TVDVAVTHQT VWERGDEEDD DDDTYQYVLK GSKPTASALH VPSNGALNNG VGTKVEEAED DDDIVVVMAA DAKGKRNRET NGDGPVNKRQ KMSILEADVI EIS
|
| |