Gene PHATRDRAFT_47594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47594 
Symbol 
ID7202810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp218571 
End bp220370 
Gene Length1800 bp 
Protein Length599 aa 
Translation table 
GC content62% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182027 
Protein GI219123428 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGACA CACAACAACG ACACCGACAA CCACCACCAC CACAGCGACG ACGACGACAA 
TGGCGGACGG GTGTGACCCT GGCGTGGGTG AGTCTCAGTC TCCCGTCCCG GGCCGCGTGG
ACTTCCCCCC GTCTACGTCG CGTACGACGG GCACTACCCA CCTCTCTTTG TCCCCGCTAT
ACCCTATTGT CATCATCATC ATCATCATCG TTGCAAGCCG CCGAACAGTA CAACTCATCT
CCCCCCATCG ATCCGCCCGG TCCCTACCGT GGCGTCTTTC ACGAGACACT CGTCTTTCCC
ACCCACCGTG AACTCCTGGC GTTGGCGTAC GGCGAGTCCG TCTCCAGTGC ACCCTTTGCC
ATTCCCACCC ACGACGGACC GATCTACCGG TTTCGGGTCT CCGTCTATCC CCGCGGCGGT
GGACACGCCG GATCCGCGGA CCCGCGCGGC TGGGGACGAA AGCGTGGCCC GGAACGCGTC
GGAGTGTATT TGCAATTCTT GCCGGATGCA ACGGACGATA CGGTCGATGC GTCCTTTGTC
TTTACTCTTC GGGGACGACA AGACCCGAAA CCCTTTGACG TCGAATGGCG GGCCGGAATG
CGGTTCGTCA GTCTCGAACG ATCCCGTTTG GCCCAGGGAC GAGCCAACGA CTTTGGTGCC
CACCTATTGT CCACAACCAT GCTCCGAGAC TTGCTCGGGG GAGCCGACGA TGACCATGAC
GGCACGACGT CGCCGTCCTT GCACATTCGA GTTACCGTGT CCCTGCACGC GACGACCGTG
CCCACTGCCG GCGTCGGGGA TTCTCGGCCG TGGGCGCCGA CTCGTCTCCT CGACGACATT
CGGAAAATCG ACTCGGCGCC ACCGTCCACC AACAATATTG CCACCACCAC CAACGAACGG
GTCCGGGTCG GTACCATTGT AGTCCCCGTC CTCCAAAAGT TGGCGCAACG ACCCCGCATG
TTTCAACAAG GCGCCTACCC CGGCGTCGAA TACCGTATCC TGCGCATCAT TGACCCGCAC
ACCAACCGTG ACCTCTTTTA CAGTCAACCC GGTGCCGACT ACGAACTCAA GCCGGTTTAT
CCCCTTGTTC GGCAGCTCGA GCGCCCTTGG CCAGTCCGCG TCAACGAACG CGATATTCCC
AAACTCCTCA CCCCCACCAT GTACAATACC GTATCGGCCG TGGGATCGCT ACTCACCGCA
CTCACCGGCC TCCTCGTCGC ATTCGTGCTC TCCCAAGCCG TCTCGCTCTT TGTCATTCCC
AGTCTCAGCA TGGCCCCCAC GCTCGCCAAA GGCGACGTCG TCCTCGTCGA CAAACTCACC
CCGCGCTTCT GGGGTCCCCG GACCAACATT CCCGTCGGCG ACGTGGTCTT TTTTCACCCT
CCCGAACCTC TCCAAGACAT GGTCGTCCGC AGCACGGGCC GCCGCTTGGC CCCTCGCGAT
TTGTTCGTCA AACGCGTCGC CGCCGGACCA GGGGACGTCC TCACGGTCGA CCCTTCCGGT
TCCGTCCGCG TCAACGGCGC GACGCCAGCC GTTGCCCGCG AAACCTGCGA AGCGGAACCC
TTGCGCTTGA TCGAAGCCTA TCTGAAAAAG GCGTCGCCCG ACAATCCGGA CGGGGCCAAC
GTACGGATCG GACCGGGACA AGTCGCCGTC CTCGGGGATT GTGCGTCCGT ATCGATCGAC
TCGCGTGTCT GGGGACCACT CCCGCAAAAC GATATTGTGG GCCGGCCCGT CGTGCGGCTA
TGGCCCCCTT CGCGGTGGGG ACCCGTCCCT GGACTTTTGC ACGCACCGGA TGCATTGTAG
 
Protein sequence
MVDTQQRHRQ PPPPQRRRRQ WRTGVTLAWV SLSLPSRAAW TSPRLRRVRR ALPTSLCPRY 
TLLSSSSSSS LQAAEQYNSS PPIDPPGPYR GVFHETLVFP THRELLALAY GESVSSAPFA
IPTHDGPIYR FRVSVYPRGG GHAGSADPRG WGRKRGPERV GVYLQFLPDA TDDTVDASFV
FTLRGRQDPK PFDVEWRAGM RFVSLERSRL AQGRANDFGA HLLSTTMLRD LLGGADDDHD
GTTSPSLHIR VTVSLHATTV PTAGVGDSRP WAPTRLLDDI RKIDSAPPST NNIATTTNER
VRVGTIVVPV LQKLAQRPRM FQQGAYPGVE YRILRIIDPH TNRDLFYSQP GADYELKPVY
PLVRQLERPW PVRVNERDIP KLLTPTMYNT VSAVGSLLTA LTGLLVAFVL SQAVSLFVIP
SLSMAPTLAK GDVVLVDKLT PRFWGPRTNI PVGDVVFFHP PEPLQDMVVR STGRRLAPRD
LFVKRVAAGP GDVLTVDPSG SVRVNGATPA VARETCEAEP LRLIEAYLKK ASPDNPDGAN
VRIGPGQVAV LGDCASVSID SRVWGPLPQN DIVGRPVVRL WPPSRWGPVP GLLHAPDAL