Gene PHATR_43839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43839 
Symbol 
ID7203966 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp198693 
End bp200260 
Gene Length1568 bp 
Protein Length409 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186013 
Protein GI219112859 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.416143 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGTACGGCAT CGCGTCGCTG CAATTACAGA TACGTGCCGA TTGTTTCCAC ACAATCCACA 
GGGAAAACGG AGAGAAAGCA CACCGTAACA CCGATTATGA AGAATCGACG CAACAGTGAC
GAAATAAACG ACGACGCAGA CGAAAACGGT TCCTTCACAC CTCCATTTCT GCCCCTGGAA
GAATCCAATC CTGTCTCGGC AATGTCCCTC TCGTCCTCTC TCGCCGCCGC CAACACCACC
ACCGGTACAT CCACAAGCAG TACGACTGGA AATGTCTCGA CCAAACACAA TCTAAACCAA
CGCACTAAAG CCCATTCCTC GTACAATAAG GCTTCCCCGT CTCTAACTCG TTCCATGTCG
AACCGGGGTG GAATCGTCTC TTCTTGCAAC GGCAAAAAAG TCGTCTCGAT CGTCATGCTC
TCACTTCTGG CGCTCGTCAT TTGGGACGCC GTCTTGACGC CTCCGGAACG CCGCTGGATC
AAACCGGATT TCAGTGAAAC CTTCCTATTG TGGGTACAGG ATCATCCCAT CCGGGGTTTG
CTCGCCTTTC TCGTTGTCAT TGCCGTGGCC GTTGTTTTCA TGGTACCCAT CGGGACGCCC
CTCACTCTGG GATGCGGGTA CGTCTATAAG GGAGCCTACG GATGGAGACT GGGCTTGACC
ATTGCTACAG CAGTATCCAT GGCTGGATCC GCACTCGGCG CCGTCGTTTG CTTTCTCCTC
GGACGATACC TCATGCGAGA CCAAGTACGC ACCTGGATTC GCAAATATCC ACTCTTTGAC
GCTATTGATG CCGGTACGTC TCGTTGTGAT ATTTCTTCTG TTTTTTTTCC GTGTCCTGTA
TTGTAATGAC ACCATCCACG TGTCTACTGG TCCACACTCA ACCCGGTGCT TCCCCCCCTT
TTTTTTGTTC TCAGCCGCTG CCGAACACGG CCTCCGGATC ATGGCCATGC TTTACTTGAC
TCCCATTCTA CCACTCGGTC CCGTCTCATA CATGTGTGGC ACCACTTCCA TGGCCTTGTC
CAGCTTTGTC TTGGCCAAAA TTGCCTCCCT CCCACTCATG CTACTCTACG CCTTTATCGG
GGCTAGTACT GGGGCCTTGC TCGGGCAACC ATCGCAAACG CAGTTGGACG GTAGCGTCAC
GGCACAACAA CAGGAACAAC AACATTCGAC CGCCAACGAA TTCAAATCCA TCGAAGAAAA
TCAAACTCTC ATTTTGTCCG GTATTTGCTT GAGTTTTGTC ATGATTGCGG GCATTACGCA
CAATATTAAA CGAGAGCTCA ATCTGGTAAG CAAAATCAAA AGGGTCGTAT GGGCCTTGCC
GACGCACCCA GTCGTGTCCA TTCTAGCGGT AGTTAGATTG CCTCGACTAT CATTCTTATC
CACTCTGACT CGTTTGTGTT TTTTAGATTC TGGAGCGGCA AAAGAAAGCC GGCGAAAGAG
ACCACGGGAG TGACCTTATG GGTGACAGCT CGATCGGGAG CAGTAGTAGT GCCAACGTAA
TCGATGCCGA ACAATCCGCC ATTGAAATGG GTCTCAGCAA GCCGGCTCAC CGCAGACGAG
TGGCCTAG
 
Protein sequence
MKNRRNSDEI NDDADENGSF TPPFLPLEES NPVSAMSLSS SLAAANTTTG TSTSSTTGNV 
STKHNLNQRT KAHSSYNKAS PSLTRSMSNR GGIVSSCNGK KVVSIVMLSL LALVIWDAVL
TPPERRWIKP DFSETFLLWV QDHPIRGLLA FLVVIAVAVV FMVPIGTPLT LGCGYVYKGA
YGWRLGLTIA TAVSMAGSAL GAVVCFLLGR YLMRDQVRTW IRKYPLFDAI DAAAAEHGLR
IMAMLYLTPI LPLGPVSYMC GTTSMALSSF VLAKIASLPL MLLYAFIGAS TGALLGQPSQ
TQLDGSVTAQ QQEQQHSTAN EFKSIEENQT LILSGICLSF VMIAGITHNI KRELNLILER
QKKAGERDHG SDLMGDSSIG SSSSANVIDA EQSAIEMGLS KPAHRRRVA