Gene PHATRDRAFT_49212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49212 
Symbol 
ID7195519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp242101 
End bp244243 
Gene Length2143 bp 
Protein Length561 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183838 
Protein GI219127221 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCAACACTG GCGTGTTATT TCCTTGTGGA CTACGTACGA GACTTTTTCT CTCTTCCGTA 
TCGAGGTCTC GACGATTGGA TCGACGAGAT TATCGCTACC GAACGTGAAA AGTGGAGAGA
AAGGGGTAAT TTGAGTACAC GGAATCGCAG TTTCAACCAC AGGTTGCTGC TAGTCTCTTT
CCGTTGTTTC ACCGGTACTC TGCAAGGTAT GCGGCAGTGC TGAAGACGAG GTGATTATTT
CTTTACAGTG AGTCATCCGC CGGAATATTT GGACAGTAAT AGAATCATAA GTATGAGCAG
CAACAACAGC ACCGATTGGT CGCGCAAGCG CAGTCGGGGC GAGGAAAAAC TAGACGAGAA
GCAGGGAGAC AAAGATCTCT CTTCCACGGC TCCTCGCGAT CCCCATTCCC CTCCGCTTGG
AGGCATTGGA CCTCCTCCTC CTCCACAGGG AGGTCCTCCA CCAGGAAATT CTAGAGGAGG
CGGCCCCCCG CCCCCGTCTC ACCAAGGATA CGGAGGATAC CCCCCGCCGC ATATGGGTTA
TCCTCCTCCT TCCGGTCCGG AAGGACAGGC CCCCTATCCT CCCAGAGGCT ACGAAAACGC
GCCTCCTTAC CCAGGCCCTG GGTATGGACA GCATCCTCCG GGATATCCTC CCTACGGAAT
GCCACCTTAC GGCATGTACC CGCAATACCA GCATTCCATG TACGGACCCC CTCCTCCGCA
CCACCCCTAC CCACCGCACG GACAACCCCC ACAGCAACCC TACTATGGGG ATCCCAACAT
GCCCGGCGGG GGCAACAGCA ACGGACCCCA TCCTCCCTCA TCACGCTACG ATGCACCACT
GGAGGAACGG AGCGGACCGT ACGGTGGCAA TGCCGCTTCG GGTGGTCCCA TTGCACCCCG
CAGAGGAAGT GGCGTCGATC GGAAACTATC CCGGGCCTCG GCCGAGTCCT CCCGTGCCGA
TGACGACGGA GACACGGAAG GCGATGACGA CAACATTGTC CCTGGCGCGG TGACGGCGCG
TCTCAAAACC TACATCAAGC CGCGGACTCC GTCCACTCGC GAAGTTCTGG ATCGCCGAGC
ACGCAAGAAT GCACAGTCTC GAGCACGCGC GGCGAAATTG CGTTTGCGCA TTGCGGAAAT
TGAAATGAAG CCAGAAGACG AACGCACGGA AGAAGAAATT CATTTATGGC AACAGTACGA
ATCAAGGAGG CAGCGGAAGA ATGATCGGTC CCGCGAACGC GCACTGGAAA AGAAAGAGGA
AATTGATCGC ATTTTGGCCA AGCCCGATAA GAAGCGAACT AAAATCGAAC GCCAATTTTT
GGAAACCGCT TTGTCAGCGA AAAAACGTAA AAACGAAGGC GATCGATTGC GGCGACAGAG
ACTTAAGGAG CTGGGTTTGG CAACGAAAGG AACTGGAATA AAACCTGGCA TTAGTGCTCG
TGGCCCACTT CCGCCTCAGT ACCAAGGCAT GGTGAATCCG CCGCCGCACC ATCATCCCAT
GTACGGACAG ATGGGCGGTA TGGGCGGACA CCCGCATCAC CCAATGGGAG ATATTCCCAT
GTCACCGCTA CCCAGCATGC CCCACCATCA ACACCATCAG TATGCTCCCA TGCAGTCGCC
ACACTTTGGA TCGCCTACTA TGATGACCCA CCCGCATCAC CAACCATACG GCTACCCGAG
TCCGCAGCGC AGAGGCCCTC ACGGAACCAC CAGTCGCCAT GAGGGTGGTA TGCCCTACAT
GCCACCAGCG CAAGGCTACG ACGCAAGTCC GCCTTCACGT CATCCTCCTC CGGTGCAACA
GCGTCGCAAT CCGGACGGCT CGATGAGTAT TTCGATTGGA GGTAGAAGTG GACAGCAACA
AGGAGGGCCT CCTTTTGGCG GGGAGGTCCG AGGAGATGAC ATATCAAATA TGATGATGGA
CAACGACAAC CGTGGTGGAT ATGATCGGAG TGGACCAAGA GATATCAAGG ACGAGTAGGT
AAAGACGCTG CGTGCGTTCT GAATGTTTCA AAAACAAGTT TGTTTTGTGA AAGAGTGACG
AAGCCCTCAG GCATTGTTGA ACAAAGCTTA AGGGTAGAAT TTAATGTTCG AGACCATTGT
GGCCTCGCAA CTAGTATTTA TGTAAATCTG TTTCTGCAGA GTT
 
Protein sequence
MSSNNSTDWS RKRSRGEEKL DEKQGDKDLS STAPRDPHSP PLGGIGPPPP PQGGPPPGNS 
RGGGPPPPSH QGYGGYPPPH MGYPPPSGPE GQAPYPPRGY ENAPPYPGPG YGQHPPGYPP
YGMPPYGMYP QYQHSMYGPP PPHHPYPPHG QPPQQPYYGD PNMPGGGNSN GPHPPSSRYD
APLEERSGPY GGNAASGGPI APRRGSGVDR KLSRASAESS RADDDGDTEG DDDNIVPGAV
TARLKTYIKP RTPSTREVLD RRARKNAQSR ARAAKLRLRI AEIEMKPEDE RTEEEIHLWQ
QYESRRQRKN DRSRERALEK KEEIDRILAK PDKKRTKIER QFLETALSAK KRKNEGDRLR
RQRLKELGLA TKGTGIKPGI SARGPLPPQY QGMVNPPPHH HPMYGQMGGM GGHPHHPMGD
IPMSPLPSMP HHQHHQYAPM QSPHFGSPTM MTHPHHQPYG YPSPQRRGPH GTTSRHEGGM
PYMPPAQGYD ASPPSRHPPP VQQRRNPDGS MSISIGGRSG QQQGGPPFGG EVRGDDISNM
MMDNDNRGGY DRSGPRDIKD E