Gene PHATRDRAFT_40744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40744 
Symbol 
ID7198616 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp282788 
End bp284616 
Gene Length1829 bp 
Protein Length583 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184770 
Protein GI219129173 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGCGTA CTCGATATAC ATTGATAGCA TGGGCTGTCT CAACTCTGCG AGTCTCGCAG 
TCCTTGGCTC CACCGGACCC CGGTTTTGTG GGTGGAGGTC GAAGTTGGCA AGACGTCCAC
GAGTTTCGAG CCATGCACAA TATTACGTTC AAATACGAAC CGTTACATCT ACAAAACGAA
CATTGTCGCT ATTTGACTGA AGCCGAATGC CAACACGACG ATGAAGCTTA CTGGCAATCA
AAATTCAGAC CCCAATCCTC TAAGGAACGA CGACTGAACC CTTCAATTGG TACTTTCCGC
GCTTTGGTTA TTTTAGTACG ATTCACAGAC CACGCCAGCC GACAACTTCC CAGCCCTGCT
TATTTCGATG AACTCTTTAA CGGAGCTAAG GGCTCGGTCA ATGAAGTGGG ATCTGTACGA
GAGTACATGC GATTCAATAG CATGGGAAAG TATAATGTTC AGTTCGATGT ACTGGACTGG
GAAAATGCGG AGAATACCGA ATCGTTCTAT GCCGAAGGGA AATCAGGCCG AGTAGGAAAT
GTTCGCATTC AAGATATGTA CGGTTCGGTG TTGGACAAGT TGGACAGGGC AGGAAAAATC
AATTGGTTTG ACTATGATAT TGGCGGTGAC CCTGAAAATC CTGAGTGGGG CGACGGACTG
CTTGATCATA TAGTTGTTGT ACACTCGGGT TACGGAGCAG AGCAGTAAGT ATAGAGAATG
GTGCGGCTTG ACACCTCAAT TCCCAACAAT TAGGCTCACG CAGCCTTCTT TACTCCATTT
GTTTTGTAAG CGGTGACAAA CAGTGCCTTC CAGGTAGCTA CCTCGATCGC ATATGGTCAC
AAGGATCGGC AAGTAGTAAC GGAGGATGGC GATCATTGGA TGGTAATTTG GAAATTGGTG
GGCATACGAT CGCGTCAGCT TTTGCGAATC CGCGCTGCGA CAGGAACAAT GACTTTCAGC
TTCTTATCGA ACCGAACACA ATGGGAGTCT TCACCCATGA ATATATGCAC GGCTTTCGAA
TGATCGACCT ATACGACAAC GACGGAGACA GCGCGCCAGT TAGGCTTGGA GGGGTGGGGC
ATTTCGACAT CATGTGCAAT GCCTATGGAT GGTTTCGCTC TGGCACAATA CCGGGCTATG
CCAGCCCGTA CAGCAAAATG ATCGCTCAAT GGCTCTCACC TATCGAAATC ACTATGGATG
GAGTGTACGC TGTCCAACCA GCAGAAATTT CCAGTCAAAT TTACATGATC AGTACACCAT
ATCCAGCGGG CGAGTATTTG CTAATTGAAA ACCGGCAGCC GCTGAAATGG GATAAAGACT
GGCCTGGACG AGGCATCGTA ATATATCATA TAGACGAGTT GGCTCCGCGA CAAACTGCGC
GAGGATACCC AGGAGGACCT GGATGGCCAA CCGATCACTA TCAAGTCGCG GTTGTCCAAG
CGGACGGCAA CTTTGACCTC GAAAAAGGTG AAAATGAAGG AGACGAGGGC GATTTCTGGA
CGCGCGGCAT GACATTAGGA GCCGACACGA ACTCGCAGCC AAATACGGCA GCATACCAGA
GCGGAAATCT CCGGTCAACT GGGATCTCTA TCACTATCTT GTCCGATCCT GGTTTCATCA
TGAACTTTCA AGTCGAAGGG TTGGGCGGAA TGCGAGCTCC TGGGCAATTC TGGGACGACG
ATGAGTCACC ACTGGCGAAC AGCGCCCCCG ATTCTATTCT ACCCGTGTCG ACCGATCCTG
GCGGTGGGAC GGGCAAAACG CTGGCCTGGA TTTTCTCAAT GATTGGGGGA CTATCGCTGG
TTGTCGGTCT TATCGCAATA CTACTATAG
 
Protein sequence
MVRTRYTLIA WAVSTLRVSQ SLAPPDPGFV GGGRSWQDVH EFRAMHNITF KYEPLHLQNE 
HCRYLTEAEC QHDDEAYWQS KFRPQSSKER RLNPSIGTFR ALVILVRFTD HASRQLPSPA
YFDELFNGAK GSVNEVGSVR EYMRFNSMGK YNVQFDVLDW ENAENTESFY AEGKSGRVGN
VRIQDMYGSV LDKLDRAGKI NWFDYDIGGD PENPEWGDGL LDHIVVVHSG YGAEQLTQPS
LLHLFCSYLD RIWSQGSASS NGGWRSLDGN LEIGGHTIAS AFANPRCDRN NDFQLLIEPN
TMGVFTHEYM HGFRMIDLYD NDGDSAPVRL GGVGHFDIMC NAYGWFRSGT IPGYASPYSK
MIAQWLSPIE ITMDGVYAVQ PAEISSQIYM ISTPYPAGEY LLIENRQPLK WDKDWPGRGI
VIYHIDELAP RQTARGYPGG PGWPTDHYQV AVVQADGNFD LEKGENEGDE GDFWTRGMTL
GADTNSQPNT AAYQSGNLRS TGISITILSD PGFIMNFQVE GLGGMRAPGQ FWDDDESPLA
NSAPDSILPV STDPGGGTGK TLAWIFSMIG GLSLVVGLIA ILL