Gene PHATRDRAFT_42501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42501 
Symbol 
ID7196682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp233806 
End bp236993 
Gene Length3188 bp 
Protein Length755 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176550 
Protein GI219109591 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAAAATTTC CTACGGCACC CAAATCCATC CTCTTTGAGT AAGCAGTAGC ACTCTCTCTT 
TTACCACTGC CTCTACCACT ACGGTCAGCG TTCTTACTAT CGCTTCCCAC ACCAATACCA
TTATCAAGCA ACAACGGTAC ACACACAGCT AGAAGCAGCA GGATCATTCC AGGATCATGA
CGAAACAGTC TTCAACACTC GCTCGTCTAT TGCTCTTACT CGCGCTGTGG TGCGATCCCA
CGTACGGTCA GAACGACTGT GCTCTGTCCT TTTGGTTGTT CCCTTCACAG CAGGACTGTC
GACTCCGCTC GGGCGATCCG TCCGCCATCG AAACCTTCGT TGCCGACGAC GTTTGTCGTG
TGATGAGTAA CCCATCTCCA TTCTTGTTGG GGCGCTACAC GGCGAGTTGC GTCTCCTCGG
ACACCGTGCG GATATCGCAG TCCGGCTGTA CCCGATCGGA CTGTTCGAGC ACGTCAGGTG
GCAGCGTTTG CGATCGCGAC CTCACCAGCG TTTCCTCCTT TTACTCGCTT CTCAGTACGC
CTGAGTACAA CGTTCAGGAC CCCGCGACAC AATCCGGAAC TTACCAGTGC TTTACGCTTC
GCGGAGATTC GGAAGCCGTG ACCTTCGCCA TTTTTGGAGA TTGCGGCGCG TGTTTAGGGG
ACGGAGGCGA ACCCATGGAA TCGTCACCAT CCCTCGCCCC CGTGGTCATG CCCGTAGTCA
TGCCGACCAA TCCACCAACA CCGACAGGAC AAGCGTCGGT GGCCCCCGTA GGCATGCCGA
CGCCTCGCCC GACGGCAGTT CCCGTAGCCA ACCCAGTGAG TGCCCCCATC AACACTCCCA
GTTCAAGCCC TGTCGGGACC ATGCCAGATC CAACGATGGC ACCTTTCGGG ATGACCCCGC
GACCGACGGC TTCACCGGTC GCATCACCCA CGCGAGCTCC GACGTTCGCC GAAACGGAAG
AACCGACGGA AGATGACACA CCGGTCGGCA TGTCGATTCC GCCGACAGGT ACAACCCTAC
CACCGACCGG ATCGACACCA CCGCCATCCG TTGGTGGGAT GAATACATCG GCACCTCCGA
CCTCGAGTGT TTCGATCCCC GACACTCCTA CAGAGATTCC AAATATGACT GTTTCAACTC
CACCGGTCGC GACTACTCCC ACTCCATTAC CTACCCCTCA AGGAACCAGC AGCGACTTGG
TCGGACTTGT CATGCTGTTG ACCAATACGA ACGGCGTCCT GGGGGAAAGC TCCACGATAG
CCTGGGAATC CGCCACTGCT GCTCACATCA CCAACACGCT TGCGAGAGAG GCCCCCCTCT
CCAAGGCCAC GGTGGAGGTT GATGTTGTTA GCCAAACCCG AATGTCGCCC AACTCGTCGC
TCAACGGTGT CCGTCGAGTC CAAGAAGTCG ACGAATTACG GCCACTTCGG GTGGCCTTTG
ACGTCGTGGT CCGATTTCTC GCAACGACTT CGGCGCCCAA CGACGCTGAA GCCGCTACTT
TGTTGGGGGA AGCCTTTAAT AGCCAAGAAG ATCGGGAGCT CTACATAAAT AGACTTAGGG
CTACCGATAA TACTGCGTTT GAGTCTTTGG ACAGAATACA AGTTCTTGTG GATGGTTTCA
CTCCCGCGGA GGAAGGATTT CAAGGAAGTA ATGACAGTGG TGGTTCAAGT ATGGGTATTA
TCATTGGGGC GGCCGTCGGC GGTATCGCGA TTCTCGTGAT AGCGGCACTG GCAGCATTCT
TCGTCTTTCG CAATCGCGAC AATCAGAACG ACCGCGTCAA TAAGAGACTC TCGGACGACG
ACGAGCACAT GCAAAATGAC GCGCGAGAGC TTTACTCACC GCCCACAGAA TCGGCCTCGC
CCTTAGGCGC TGCCAACGAA ATTTCCGTGG ACCGGCAAGA CGACGTCAGT ACACTAGGGG
ATTCCATCTT GGCCGGCATG GCGGTATTGC AGGACGGCGA AGATGAAAGG ACGGCCAGCA
TTGATGGTGG CTACGACTAC GCACAAGAGC AATTTCGTGG CGACGGACCC TTGTCGGTCT
CACGCGGTGA AAACTCCACA ATGCTGTCGT CATACCCAAG CGTAGGACAA ATGGGAGGGC
CTTCTCTTAT CGAGGACGAC GCCTCGTTTG AAGAACTGTA CGGGGACGTC GATGATACAT
CGCACCCAGA TCGCTTCGAA GTCGATGTGC CACCCGGAAA ACTCGGCATG GTAGTGGATA
CGCCCAATTA TGGAATCCCA CAGGTACACG CAATTAAAGA AGATAGTGTT TTGGTCGGGC
GCGTAAAAGT CGGCGACCGA CTAATGCACG TGGACTTGAT CGATGTGACT CGAATGTCTG
CCATTGAAGT GTCAAACTTG ATCCATAAAA AATCGAAGAG TGCCCGGGTG CTCGGGTTTG
CCCGAAAAGC ATCGCCATCG AACGATCCGT ATGCGCTATC GTAGAACCTG AGGTTGCGCT
ATGCCTGACT TGTTGACACG ATCTTTCTCG GTTCTCTTCA AGACAACAAC TTTGGATCTG
GAATCCGTAC ACTGTTTTCT TCTGTTTTAC ACTATATGCC CCCGACTCCA ATAAATCCAT
TCGCTTTCTA ACTTGTTCGC GCTGCTACTG GCGCGAAAGC GAACATCATC GTACAAAATA
ATGTTAATTC ACACTTCTCC GCCACACATT CTTTATATCT GTCCTCTTGT GAATCCAATG
CAGGGACAAG CCTTATGGTT CCAGGAAATA TACATCAAAG CCTAGTTACG GCACATTCCT
CCGGACGACT GAGCTTTTCT TTGAACCCTG TAGTCTATAG AAATGATTTC GCTTGAGCCA
GAAAATGGGA CAAGTTTTGT GAACGCAACA CGCGTTCGTT TGGAGGTACT CAATGCCGCG
CATAGCAACC TGAAATCCTT TTCACTCGTC TTTCTTGCCT TCAATGGATG TCACCGTCGT
TCTCTATAGT CGGAAAGTTG TAGGACTTAT GGCAACAATT TTATTGCAAG CACAGTTAGC
GTCAGCTAGC GATGAAGAAG GCATCCCAAG AAGTCTTTTC CTTTGGACAA CAACTATCAA
GAAAACAATG AACAATGCAG ATCAGAATCA TTAACTGTAA TAGCGATAAA CTTGAATTAG
GCAACGTCGT CATTTGCTGT CCCTGGAATA AAAAAAAGGA ATTTGGATCG CCCAAATCAC
AAATCGTA
 
Protein sequence
MTKQSSTLAR LLLLLALWCD PTYGQNDCAL SFWLFPSQQD CRLRSGDPSA IETFVADDVC 
RVMSNPSPFL LGRYTASCVS SDTVRISQSG CTRSDCSSTS GGSVCDRDLT SVSSFYSLLS
TPEYNVQDPA TQSGTYQCFT LRGDSEAVTF AIFGDCGACL GDGGEPMESS PSLAPVVMPV
VMPTNPPTPT GQASVAPVGM PTPRPTAVPV ANPVSAPINT PSSSPVGTMP DPTMAPFGMT
PRPTASPVAS PTRAPTFAET EEPTEDDTPV GMSIPPTGTT LPPTGSTPPP SVGGMNTSAP
PTSSVSIPDT PTEIPNMTVS TPPVATTPTP LPTPQGTSSD LVGLVMLLTN TNGVLGESST
IAWESATAAH ITNTLAREAP LSKATVEVDV VSQTRMSPNS SLNGVRRVQE VDELRPLRVA
FDVVVRFLAT TSAPNDAEAA TLLGEAFNSQ EDRELYINRL RATDNTAFES LDRIQVLVDG
FTPAEEGFQG SNDSGGSSMG IIIGAAVGGI AILVIAALAA FFVFRNRDNQ NDRVNKRLSD
DDEHMQNDAR ELYSPPTESA SPLGAANEIS VDRQDDVSTL GDSILAGMAV LQDGEDERTA
SIDGGYDYAQ EQFRGDGPLS VSRGENSTML SSYPSVGQMG GPSLIEDDAS FEELYGDVDD
TSHPDRFEVD VPPGKLGMVV DTPNYGIPQV HAIKEDSVLV GRVKVGDRLM HVDLIDVTRM
SAIEVSNLIH KKSKSARVLG FARKASPSND PYALS