Gene PHATRDRAFT_44749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44749 
SymbolSAE2 
ID7199872 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp124330 
End bp126535 
Gene Length2206 bp 
Protein Length643 aa 
Translation table 
GC content54% 
IMG OID 
Productsumo-activating enzyme 2 
Protein accessionXP_002178935 
Protein GI219116280 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.97194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTAAGACTG CGGTCGGCAG TGCAAATCGT GTTGCACTTC GTTCGGTCGT GACGCCGACT 
GCGTTTCAGA CCATTGCAAT CTCGTTGGCG TAGTCACAGA CATCTAGTGC GCTTCGTTGG
TGTCACCGTA AGAATTACTT TTGCTCGTTT CACACGTCTG TTGTTTTTCT ACTTCCATGA
CAACCGCATC GTCGTCCGTG CCGCACTTAC TGAGCGGCAT GGAAGCCACT CTCGGAACGA
ACATGCTGAC CAAGATTCAA AGCAGTAAAA TCTTGCTCGT GGGTGCCGGT GGGATTGGCT
GCGAATTGCT CAAGAATCTC GCACTTACCG GCTTTCGACA CGTCCAAGTC ATTGATCTCG
ACACCATCGA CGTGTCCAAT CTCAATCGCC AGCTCTTGTT TCGATCCCAG CATGTGGGCA
TGCCCAAATG TACCGTGGCT TGTCAAGTTG CCACGCAAAT GGTACAAGAC CCTTCTCTGG
TTTCGTATAC AGCCCATCAC GGGAACGTCT GTGACAACGA CACATTCAAC GTGCAGTTCG
TCCAACAGTT TGATCTCACC TTGAACGCGC TGGACAACGT CGTCGCCCGG CGTAGGGTCA
ACCGACTTTG CTTGGCCGCC GGAGTACCAT TGATTGAAGC GGGTACCACG GGATACCTTG
GTCAAGTCAA CGTCATTGAC AAGGAAAGTG ACGTTGCCTG TTACGAATGT CAGACTCAGG
AAACACAAAA GGTGTACCCC ATTTGTACCA TCCGATCCAC GCCGTCCATG CCAGTCCACA
CCATTGTTTG GGCGAAGGAG CTGTACAAAC TATTGTTCGG CGACAAAGTG GAAGAATCAA
TGTTGTTTGA GGATACGACG GCACCGGATG CCGAGCCATC GACCTACATG TCGGCGGTGT
TGAGTTTTCG TCGGGCGCGG GCTGCACGGG ACAGCGACGT CGTGCGTACC GCGGCCGGGG
AAGTTGTCAC CAAACTGTTC GTGGACGAGA TTCAGAAGCA ACTCGACATG GGCCGATACA
AGACGGCGCG CAAGACACCA GCCGTCTTGC CGACGAGTGT CATTGTGGAC GCCACCACTA
CGGTACCACC GACGGCCAAG CCGTCCTACC GGACGACGGA TCTGTGGACG CCGACTGAGT
GCGTGGCCGA GTTCATCGCG TGCTTGGAGA ATGCGGCCAC CGCAGCCACC GTCTTACCGT
CTTTCGACAA GGATGATACG CTAGCAATGA GGCTGGTGAC AGCGTCTTCG AATTTGCGCA
GTTTTGTCTT TGAGATTGAA CCTTTACAAA GCTTTTACTC GGCCAAGGGG ATTGCCGGCA
ACAGTACGTG CACGGCCATG CGAAACATAG GTATTTGATG TCTACGCCTG CTCACACACG
CCAATTTCTT TTTCCGTTTT TTCCCTGGCG TGCGTTTACA GTCATTCCGG CGATTGCCAC
AACGAATGCG ATTGCGGCCG GGTTGCAGAT CCTACAGGCC TTTCAAGTCC TCCGCGCCCA
ACTCGAAACC GGCACCAAGT CGGCCGGCAA GCTGGGTGAG TACTGCTCCT ACATTAACTG
CCTGCGCAAC TCGACGCGGA ACGGTCTCTT CTTGACAGCG TCGAATTTGG AAAAGCCCAA
TCCACGGTGC TTTGTCTGTC GCAACGCTAC CGTACCACTC GCGCTGAACG TGAACAACTG
GACTTTGCAA GACTTACTCC AGAAGCTAAT CAAGAAAGAT TTGGGCTTTG AAGAGCCGAC
GATTACGCTG GATGGGGACA TTGTTTGGGA AGAAGGGTCA GACGCGGACT CGGAGGCGTT
TGCCGTGAAT TTACCCAAAT TACTGCCACA ACTCCCTTGT GGTGGTATTC AGCACGGAAC
GGTTTTGCGC ATTGAAGACT TTTCGCAAGA TTTGACCGTG GACGTGGCGG TGACACACCA
AACGGTATGG GAACGGGGCG ACGAGGAGGA TGACGACGAT GATACGTACC AGTACGTGCT
GAAGGGATCC AAGCCGACCG CTTCGGCGCT GCACGTTCCC TCCAACGGTG CGCTCAACAA
CGGGGTGGGT ACGAAGGTGG AGGAAGCGGA GGATGACGAT GATATTGTGG TGGTGATGGC
AGCGGACGCG AAAGGCAAAC GCAACCGGGA GACGAACGGG GACGGCCCCG TGAACAAACG
GCAAAAGATG TCCATTCTTG AAGCCGACGT CATTGAGATT AGCTAG
 
Protein sequence
MTTASSSVPH LLSGMEATLG TNMLTKIQSS KILLVGAGGI GCELLKNLAL TGFRHVQVID 
LDTIDVSNLN RQLLFRSQHV GMPKCTVACQ VATQMVQDPS LVSYTAHHGN VCDNDTFNVQ
FVQQFDLTLN ALDNVVARRR VNRLCLAAGV PLIEAGTTGY LGQVNVIDKE SDVACYECQT
QETQKVYPIC TIRSTPSMPV HTIVWAKELY KLLFGDKVEE SMLFEDTTAP DAEPSTYMSA
VLSFRRARAA RDSDVVRTAA GEVVTKLFVD EIQKQLDMGR YKTARKTPAV LPTSVIVDAT
TTVPPTAKPS YRTTDLWTPT ECVAEFIACL ENAATAATVL PSFDKDDTLA MRLVTASSNL
RSFVFEIEPL QSFYSAKGIA GNIIPAIATT NAIAAGLQIL QAFQVLRAQL ETGTKSAGKL
GEYCSYINCL RNSTRNGLFL TASNLEKPNP RCFVCRNATV PLALNVNNWT LQDLLQKLIK
KDLGFEEPTI TLDGDIVWEE GSDADSEAFA VNLPKLLPQL PCGGIQHGTV LRIEDFSQDL
TVDVAVTHQT VWERGDEEDD DDDTYQYVLK GSKPTASALH VPSNGALNNG VGTKVEEAED
DDDIVVVMAA DAKGKRNRET NGDGPVNKRQ KMSILEADVI EIS