Gene PHATRDRAFT_49082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49082 
Symbol 
ID7195319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp531230 
End bp532354 
Gene Length1125 bp 
Protein Length351 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183766 
Protein GI219127069 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCGAAGGCT GCAATCTCTG TAGAGCATTC AAGTTTCTGA GAGATCTCAA GAGCCCTATC 
CGGTAGACAA TGGCAATTCC AAAGCTTACT AAAGCAGGCT TCAAAACGTC AAAGAGGAGC
GCCCCGACAT CGATCGACTT TCGTCGAAGA GCACCACCAA TCATGATTTG TTGTCTCCTT
GTTGTGTTAA CCGCCAGAAA GCTGACGCAA CTCTATAATT TACCGCACGG TCCGACATTA
CTTCGCGGTC TCGCCAAAGA TACAAACAGG GAAACAATGC CAAACTTGTC CAGTTTGGGG
TTCAAGGACC CAGCGCCGTT CGAGCGCTTG TCTTCTCTGA CCTGGGAGCT GCACGAGTAC
TCCAAAATGA GTCGACAGTC GAGCACTCCT CCAGCCGACA GGACAAGGCC TCGGCTGTTA
ATTGCACAAA CAACCTCCGA CTTCTACAAA CCGTTATTCG ACATTGGCCG GCCATTGAAT
CAAGAGTATG CACGAGTACA CAGCCACGAC TTTGTCGTGG CTCGCGGCAT CTACTTGTTG
GACGAGACCA AGAACGAAAC GGAGGCCAGT GTCCCGGAGT CTCGAGCAAC GTACAATAAG
ATTGCTCTTT TGAGCTATGC GTTGCGCCAA GGAAAGTACG ACAAGCTGCT GATTTTGGAT
TCAGACGCTG TGATTCGTGA CTTTGACTTG GATTTCGCGA CTTACGGCCC GGATCCTCGA
GTGATGTTTT TTGGCCACCC TGTGGAAGCC CATCAGCCAG CGAACAACTG GAATGTAAAT
ATTGGAGTTG GGCTTTGGAA TCTTCACCAA GACTTGCTTC GTCCAACTTG GTACGATTGG
TACAATCGCT CAATGAGTCG AGTGTATCAT GGTATGACCG ACGAAGACCA AGAAGTCTTA
CACGATATAT TTCGTGAGAT GCCGGATGAG AGTCGCCCAG TCCGTGGCGT CAACAATCAT
TTTTGCGGCA ACGAAGGGGC TCTCATTCAG CATTTCATGA GAATGAACCA CGCCACTTTT
GCAGTGATAG AAGAATCTCG CGTGGAGATC ATGAAGGCCG CTGCCCAAGA AACAATGGCT
TTTTATAGAG CGCTCCTGGA CTCGGGAAAT AATCAAGCAC AATAA
 
Protein sequence
MAIPKLTKAG FKTSKRSAPT SIDFRRRAPP IMICCLLVVL TARKLTQLYN LPHGPTLLRG 
LAKDTNRETM PNLSSLGFKD PAPFERLSSL TWELHEYSKM SRQSSTPPAD RTRPRLLIAQ
TTSDFYKPLF DIGRPLNQEY ARVHSHDFVV ARGIYLLDET KNETEASVPE SRATYNKIAL
LSYALRQGKY DKLLILDSDA VIRDFDLDFA TYGPDPRVMF FGHPVEAHQP ANNWNVNIGV
GLWNLHQDLL RPTWYDWYNR SMSRVYHGMT DEDQEVLHDI FREMPDESRP VRGVNNHFCG
NEGALIQHFM RMNHATFAVI EESRVEIMKA AAQETMAFYR ALLDSGNNQA Q