Gene PHATRDRAFT_47593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47593 
Symbol 
ID7202809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp213921 
End bp215912 
Gene Length1992 bp 
Protein Length663 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182026 
Protein GI219123426 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCAAGA AACCGGAGCC GAAACCCCAC GAAAAGGAAG TTTCCTCGTC GGATGATGAC 
GATTCCGTCG CCGAGTCCAG TCCCAAAGAA GACACCAGCT TTCCGGCTGT CGCTCCGACG
CACTCTGCCA AGGACCGCAA CACCGATTAC GACTGGTTGA TTTTTCTCGG ACCCGCTCTG
TTTGCCAAGT TCACTCCGGT TTGGACGATT TACGCCATTG CCATCGCTCG CATCGCGACG
ACGCACCTGA TTCTTTCCCT GCACTATCAG TTCGTCGACA AGGACAACTA TTTCAACAAG
CTCACGCAGA AACAGTTACG TCGGGAGAAG GATGACTACC TGACCGGATT TTACTTGCAC
ATGTACACGC AAATCGCTCT GCAGCTGATT TTCCCTTCTA TGTTCTTTAG CCCCAACGAA
CAGATCTGGA GTTGTGCCAA GGAAGTTTTC CTCTCGCACG TGCTCGTCGT CGAGCCGCTC
TACTACCTGG CGCATCGCTG GTTGCACGTG CCCAAACAAA TGAAAGCCAT GCATGGCTTT
CATCATTTGA GTATACACAC CCTACCCTCA ACGTCTTTGG TGCAAAACTT TCACGAGCAC
TTTGTCTACC TTGCCGTCTT TGGCCCGGCC TTTATGCTGC CTTTTTTGTT ACAGGGGAGG
CAGCACTGGG CCGTCGTGGG AGCCTACCTC GTTGCCTTTG ACGCCATCAA CGCCTGGGGT
CACACCAACG TGCAGATTCG TTCCTGGTTC TTGACCAGCC CCTGGTCGCC TTTGACTTAT
CTCTTTTACA CCCCCGAGTT CCATCTCGGA CACCATGCCT ACTTTAACGC TAACTACGGC
CTCTTCATGC CCCTGTGGGA TCGCTTGTTG GGAACCTACC GCGAATACCA CAAAAAGCCG
CGGGCTATGC TGCCGGCCGA TCAACAGGAC TTTGTGTTCA TCGGACACAA CGGAGGATTC
GGCCACTTTC TGACCATTCC GGAAATTTCC GTATACAACG TCTTTGACCA ATACCTGTTG
ACCGGACTCC CACTGAAACT CGAGTTTTTC CTCATGCACT TGGTGGCCCA AGTGTGTAGG
TTGTTCATGA GCTTTTACTA TTGCTCCCGG ACCTGCGTCG CCAATGAGTT CGTGGCGCGC
ACCATTGTGT TGGTGCGCAC GCCGTGGGAC TACATGTCCG GTCCTAGTCG CTTCGACGCC
ATCAACCGTG AAATGCTTCA ACTGATGCGG AACGAGCACC AAAAATACGG AACCCGCAAA
TTCGGTTTCG GGAATCTCAA TAAGATGAAG CAGCTCAATG ACGGCGGCAT GGATTTGACC
AATATGATTG CACAAGACGA GTACCTTCAC GACAAGAATA TTCGAGTGTG GACGGGCGAT
ACCATGACGG TCGCTTCCGT TTATAACCAA ATTGTCGAAG TTCCCAACCT GGATCGGCTC
TTTTATATCG GTGCCGGGGG TAAAGTCGGC ACGGCTGTGT GTGAGCTGCT AACCACCAGT
CGACCGGGCT TGAAAATATG CATCTTTTCA CGCCACCGTG TTCTGAATCA CCCGAATATT
TCCTACACCA ACAACCTCAG TGACATGGCC GACTACCGAG TCGTACTGGT GGGAAAAATA
TTGTCCAACG CTATGTACGA GAAAGCTTTG CGGACGGTAG ATCAGGTCCA AACACGATTC
ATGCTGGATT ACACCGTTCC GGTACTACCC ATTCCAGCCT TAGAGTCACG AGGAGTCGGA
ATGATTCGGC ATATTCGCAT CGGTCTGCTT CAAACACGGC CCAACAACGC CTTTCTCAAA
GGCCACTACG ACTGGTGTAT GAGCCACGGC GAGAATCAGA TTGTCCCGTG TCATTTCGGC
TGTCTGTTGA ATACGGTAAA TGGTCGGGAG ACCAACGAGG TGGGGGAGAT CAATCCCTTA
CAGGTCGAAC AACTTTGGAA ACAGGCCAAC GCACGAGGAT TTTACAACAT TCCCATTGAC
TATCAGACTT AA
 
Protein sequence
MCKKPEPKPH EKEVSSSDDD DSVAESSPKE DTSFPAVAPT HSAKDRNTDY DWLIFLGPAL 
FAKFTPVWTI YAIAIARIAT THLILSLHYQ FVDKDNYFNK LTQKQLRREK DDYLTGFYLH
MYTQIALQLI FPSMFFSPNE QIWSCAKEVF LSHVLVVEPL YYLAHRWLHV PKQMKAMHGF
HHLSIHTLPS TSLVQNFHEH FVYLAVFGPA FMLPFLLQGR QHWAVVGAYL VAFDAINAWG
HTNVQIRSWF LTSPWSPLTY LFYTPEFHLG HHAYFNANYG LFMPLWDRLL GTYREYHKKP
RAMLPADQQD FVFIGHNGGF GHFLTIPEIS VYNVFDQYLL TGLPLKLEFF LMHLVAQVCR
LFMSFYYCSR TCVANEFVAR TIVLVRTPWD YMSGPSRFDA INREMLQLMR NEHQKYGTRK
FGFGNLNKMK QLNDGGMDLT NMIAQDEYLH DKNIRVWTGD TMTVASVYNQ IVEVPNLDRL
FYIGAGGKVG TAVCELLTTS RPGLKICIFS RHRVLNHPNI SYTNNLSDMA DYRVVLVGKI
LSNAMYEKAL RTVDQVQTRF MLDYTVPVLP IPALESRGVG MIRHIRIGLL QTRPNNAFLK
GHYDWCMSHG ENQIVPCHFG CLLNTVNGRE TNEVGEINPL QVEQLWKQAN ARGFYNIPID
YQT