Gene PHATRDRAFT_45122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45122 
Symbol 
ID7200188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp265297 
End bp266662 
Gene Length1366 bp 
Protein Length421 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179169 
Protein GI219116749 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAATCACTGC CGCCGCTCTA GTGCCGTCGA ATAGTGAAAC CATTCGTTTT ACGTTTCAGT 
AAAGAAAGGT TCAGTATGAG TAGCGAAGAC ACGAGCAGTA GCGCTAATAA TTCCGAAGAA
GCTGTGGCTC CCGCCGAAGC CCCGGCGGAG AATGTGGCTG CGCAACAGGA GAGATCCGAA
CGCGAGCAAG TGCTCGAGCA GTACAGAGCA AAAATCCGCG AGCATCGTGA GGTGGAAGCC
CGGTTGAAGC GGATGCGAGA AGATGCGAAA GGTCTGCAAG GGCGTTTCCA AAAGACAGAA
GACGATTTAA GCGCCTTGCA GTCGGTAGGA ATGATTATTG GGGACGTCTT GAAGCGTCTT
GATCCTGAAC GATTCATTGT CAAGGCCAGC TCTGGACCAC GCTATGTTGT TGGTTGTAGG
GCTCGTCTGC AGCACAATCT TCTCAAGCCA GGCACTCGTG TTGCTCTCGA TATGACGACC
TTGACGATCA TGCGAATTCT CCCTCGCGAG GTGGACCCAA CTGTTTTTCA CATGCAAGCC
GGCGAAGAAG AAGGTGGCGT TTCCTTTGGC GACATTGGTG GACTCAATGA ACAAATTCGT
GAGCTCCGGG AGGTCATTGA ACTTCCCCTG ACCAACCCCG AGCTATTTAT CCGTGTTGGA
ATTAAGGCTC CGAAGGGTGT CTTACTCTAC GGACCTCCCG GAACGGGCAA GACACTTCTG
GCCCGCGCAC TGGCGTCGAA CATTAGCGCT ACCTTTCTCA AAGTAGTCGC TTCCGCTATT
GTCGACAAAT ACATCGGCGA ATCCGCCCGT ATTATTCGCG AGATGTTTGG TTTTGCCAGG
GATCATGAGC CCTGCGTGAT TTTCATGGAC GAAATTGACG CCATTGGTGG TTCCCGTTTC
TCAGAGGGTA CCTCTGCAGA CCGAGAAATC CAGCGTACGC TGATGGAACT CTTGAACCAA
ATGGACGGCT TTGAAGAGCA AGGTCAGGTC AAAATGGTCA TGGCCACCAA TCGCCCGGAT
ATTCTCGATC CAGCCTTGCT GCGTCCCGGC CGCCTCGATC GCAAGATTGA AATCCCAGAA
CCCAACGAAT CGCAGCGGCT GGAGATTTTA AAAATTCACG CGTCCGGCAT TACCAAAAGG
GGTGACATTG ACTTTGAATC CGTCGTGAAG CTCGCGGATG GATTGAACGG GGCGGATATG
CGGAATGTAT GTACCGAAGC GGGATTGTTC GCCATCCGGT CGGATCGAGA TTATGTACTC
GAAGAAGACT TTATGAAGGC AGCCCGGAAG ATATTGGACA ACAAGAAACT CGAATCCAAA
CTCGACTATA GCAAAGTGTA AATTGTAAGG CAACTTGCAT AAGTCC
 
Protein sequence
MSSEDTSSSA NNSEEAVAPA EAPAENVAAQ QERSEREQVL EQYRAKIREH REVEARLKRM 
REDAKGLQGR FQKTEDDLSA LQSVGMIIGD VLKRLDPERF IVKASSGPRY VVGCRARLQH
NLLKPGTRVA LDMTTLTIMR ILPREVDPTV FHMQAGEEEG GVSFGDIGGL NEQIRELREV
IELPLTNPEL FIRVGIKAPK GVLLYGPPGT GKTLLARALA SNISATFLKV VASAIVDKYI
GESARIIREM FGFARDHEPC VIFMDEIDAI GGSRFSEGTS ADREIQRTLM ELLNQMDGFE
EQGQVKMVMA TNRPDILDPA LLRPGRLDRK IEIPEPNESQ RLEILKIHAS GITKRGDIDF
ESVVKLADGL NGADMRNVCT EAGLFAIRSD RDYVLEEDFM KAARKILDNK KLESKLDYSK
V