Gene PHATRDRAFT_47839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47839 
Symbol 
ID7202995 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp201612 
End bp205660 
Gene Length4049 bp 
Protein Length922 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182176 
Protein GI219123738 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.759183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCCATCGG GATGGTTCCA AACTGACAGT GATCGAGATT CGCGAAACAG TAAAGACAAA 
CGGCACACAT TTGCTCTTTC GTCGCGGTTT TGACCGATTC ACGACATGAA GGTCGTGACC
GCCTCCTTCC TTCTGTTCAC CTTTGCGGCG TTCTTGGTCG GCACGCACGG AAAATATGAT
AACGAGAGTC ACAAACTCAT TTTGATCCGA GGACGCCGCG GCTTTCGTCA CCGAACCGGC
TCCTGGCTCG CCGCCAGCCC CGCAAATCAA AGAAGGTGCG GTGCCAAAAA TCTGTACACC
CGAAGACCAG TACACCCGAA GACCAGTACA CCGTTCGTGT CCGGTGGAAT GCCCGAACCA
TAGTGGCTGT CCCGTTGACT CACCAACGTC GGCCCCAACA CCTCCTCCCA TTGAGACTGC
GACAAACTCT CCAGAAGGAT CTCCAACCGC GGCTCTACAA GAAACTGACT ACGGTCCGTA
TCCGGCCGGT CCTTACATTT CGGTACAACA AAACTCGGAG CGCATCACTT TGCCTGAACC
CGCTACAGTA TTGCTTTCAA TCGCCTCAAA TACAGCTTGC CCTCATACAG GTGCTGACAT
TGTCTACTGG GACGATGCCA GTACATGGGG CGCTTCCGGA ATACCCGACA CGGCGAACCA
AGACGTTGCA GTACCCAGCG GAAGTCGTGT GGTGATTCGT TCAACAATTC CTGTGGTACT
TGGTGTAGTT ACAGTACCTG CTGGCAGCAA TCTCATTATT GGTTCCGATG TGAATGGTAT
TGACATACAC GTCGCTGGCA TGGAGGTTGC GGGACGTCTC CTTGTCGGGT CCGAGACCTG
CCGCCTTGCC AACCCCGTTA CCATCACTCT TCATGGTAGC CGACCGCGGG ACGCCGTCAC
CAACGTACCA TCGGATTCCT ACAAGGGTAT CCACGTTACA GGTGTGCTAA GCCTGCACGG
GAAGCGCTAC TTCCATACTT GGTCTCGATT GGCCAAGACT GCGGAAGCAG GATTGTCTGT
ATTGATGCTA CAGAATCCCG TCAACTGGGA AGCTGGTCAA GAAGTTGTTA TTGTGACGTC
CGCCATCAAG GATTCCATCG AATATCACCA GAATGAAGTC CGCACGGTGC GAGCCGTGCA
CACCAGCCCG CCCAGTGGTG TGGGAGCAAT TGTGTATTTG ACCGAGCCCG TGGACTACAG
CCACATTGCG AACAGCAACT ACCAGGTGGA AGTCGGTTTG TTGACTCGCA CAGTCAAAGT
CCAAGGCTCC GAATCTGATT CCGAGCCGAC GGATCCCAAT CCCCTTTCTT GCACGTAAGT
ATGGGTAGTA TGCAAACATG ATCTGGTGGC ACCATAAAAG TATCAGTGCT GCTGTGCATA
AGTCTTGATT GGTCAAGTGG GATCTTACTC TCGCTATGGC TGTGCGCTTC CTTCTGTTCT
TTTATGTGAC GACAAATTCA GAACCCCACT CGACAATTGG TGGTGGATCC AGTCATTTAC
TGGGCAGCCC TGTGAGAACA AGGAGTTGAC GGGCTTTGGC GGCCACGTCA TAGTTCGCGG
AGGTAGACGA GGACGTCGAA GGCGTGGAGC TCTATTGCAT GGGACAGACC AACTTACTGG
GCCGCTACCC TATCCACTTT CATATGCTAG GAGACTTCCC AGACTGTTAC GTCAAGGACT
CGTCAATTCA TCGGTCCTAC TACCGTTGCG TCTCTCTTCA TGGCACGCAC TATACAACCA
CAACGGAGAA TGTTGCCTAC GATGTTTGCG GATACTGCTA CTATTTAGAA GACGGCGTTG
AGCAGTTCAA CACACTGTCC TACAACCTGG CCGCGCATAT TCATAGCATT GGCCCGGTAC
CATGGGGTGG TGGCCAAACG ACCGACATCT TCCAGCAGAG CACCACACTA ACACTTCCGG
CGGACGTGAC GGCATCTGGA TTTTATATTA CCAACATTCA CAATCATATC ATTGGTAACG
CCGCATCTGG GGGCTGGGCC GGCTTTGCAT TCCCGAATCT CGCCGAGGCT ATTGGCGCTC
ACCAAGGAAA CGAAGCCTTT CGGCCTGCTA CGGTGACAGG CTTGACTCTA GACGGCAACA
CAGTGCATTC TACTGGCTGG TGGTGGAACC ATGCTGGCGC TTTCTACTTT GGTGGCTCTC
TTTATTACAA CGGTGATAAG TTGGAGTACA ATCCTGGTCG AAGTTTCTCT TTTGAGCGTG
ACGATTGTCA TACGTGCAAG GTGAACAACT GCGCGCCACC GTATAACGAT TGTGTATACG
GATGTCCTCA AGACAAAAAG GACTGGCTTT GAATCACAAA TAGTAAGGCT TTCTTGACCG
CCAGAGTCGG ACTGGTATGT AATGATATGC TGTACACAAG TCTCCACAAT ACTTCCAGAC
CGGTCTGTCT CACGAAATAT GGTTACTGTG TTGCTCTGTT CTTTGTCCAA CATTACAATC
CTAGAACTTG TGGTCCGGAC AAATGGAAGT GATCGGTTTT GAGTCTCATG ATAACAGTCT
AGCCATAGAG GCGCTCTCTA GCGGCTTGTG GATCAACCAT TTGCTGGCTG TGTGCCGCAC
AGGCAAATCA CTTGGACTAC CGGAAGGTGC CACAAACAAC CGACCCCTCG AAGGGAGCGG
CTTCTTTTGG TACGATACGG GCCAGGAGCA CATCATCACA CAGTCTACTT TTCGCAATTG
TGGCTTTCGC TCGGACAATT ACAATCAGTA CAACTCTAGC GCCACTCGAG GATGCGACGA
CAGCGACATG TCTAAAGCAT GCTACAGCGA GTCGTCGATG TTTGGCTTTC TGACCCATTC
GGACGAGTTC ACGCCAAAAA TTATGCAGGG GACTCGCGAT ATTACATTCG ATAACTGTGG
CCGTCGCTTC AAGTTTACAA TAAACAAGCT GGAGACCGTA TCTGGACAAG GTCAGAACTG
GTTGGATATG GATGGGAGCG TTTCGGGCCT GAATGACCCA ACTATCATTG CATCCGGTCT
TGAGCTAGCG AAGGACTGGT GGGGCGTTGA CAATCAAGGT CTGTATTGGA TAATGTATAA
TTTTTATGCA CTCCTTTGTT TGCGAATTGC ATTCATTCTC ATGTGCGCAC TGTATCTTGC
TTTACCTACA GTTGTATACG AGCCACAGGC ACCTCTCAGG TTCATAAAGA AGAAGAACGG
CCCAATATGA TCCATGGGTC ACGTGCAGAT GAGCTGGGAT GAGAGTCTAC ACAACCAAGT
CGGCAGCACA TACTGCCGCA ATGATGGATC CGGTCTCAAT TGTAGCCCTG TTGGTTACAT
GCGGCATCTC GGTCATAAGT TTAGTCCAAC CCTGAACGCT GCGGTGAACA ATGGTCTCCC
GGTTACAGCC AACCCTGATG TGGTTAGTAT GATTGGTGGC TTCGGCTGGC TTCTGACCTT
GAATCGTGGC GCGCCTCGAC AGCTTGTCTT GTGGGACTTG GAAGTTGATC CTGGAAGTGT
GCTTCTACTG AGTATCCCGT ATCCAGCCGG CACAACATTC AACATTTGAG CTAGTGCGCC
AAGTTGGTGC GGAGATTCCG ATGGTTTTGT ATGCAACACT GACTTTGTTG CTGTGAGCTC
AGTCCAGGCA GTACGCAACA GTGCTGGTAA CATTTACCAC GTTGGCACAA ACGGTGTGCT
GACGCTGCAT ATTGTTCAGT TTTCCGGGCA GTTTACTGGC AACCCAAACT GGATCCTTCC
CAATTACAAC ACTGGGATAA AAAGGCAATT TCCAAGTTTG AGCGAGACGT GGTAGTGCTT
CCAGTACAGG AGTGGGCAAA TACACTAACA AGCTCGGCCA ATTGTGGTGG GTCGGGTGTA
TACTGCAGTG GATCGGTTGC GGCGTATGAC CCTGATGTGT GCAATTCAGG TTTTGTGCAA
GTCGCATACG ATACATGTTG CCAGTGCTCA AACCTGAGTT GATGCATGTT TGCCAACGGC
AGTCGCAACT TTTAAAAGTT TTTTAGCTGT CTGTTGCACG TGATTCTGAA TCATCCTATG
TTAAACAGTT AGGCAGCGAA ACACATTTT
 
Protein sequence
MKVVTASFLL FTFAAFLVGT HGKYDNESHK LILIRGRRGF RHRTGSWLAA SPANQRRCGA 
KNLYTRRPVH PKTSTPGCPV DSPTSAPTPP PIETATNSPE GSPTAALQET DYGPYPAGPY
ISVQQNSERI TLPEPATVLL SIASNTACPH TGADIVYWDD ASTWGASGIP DTANQDVAVP
SGSRVVIRST IPVVLGVVTV PAGSNLIIGS DVNGIDIHVA GMEVAGRLLV GSETCRLANP
VTITLHGSRP RDAVTNVPSD SYKGIHVTGV LSLHGKRYFH TWSRLAKTAE AGLSVLMLQN
PVNWEAGQEV VIVTSAIKDS IEYHQNEVRT VRAVHTSPPS GVGAIVYLTE PVDYSHIANS
NYQVEVGLLT RTVKVQGSES DSEPTDPNPL SCTHLLGSPF AEVDEDVEGV ELYCMGQTNL
LGRYPIHFHM LGDFPDCYVK DSSIHRSYYR CVSLHGTHYT TTTENVAYDV CGYCYYLEDG
VEQFNTLSYN LAAHIHSIGP VPWGGGQTTD IFQQSTTLTL PADVTASGFY ITNIHNHIIG
NAASGGWAGF AFPNLAEAIG AHQGNEAFRP ATVTGLTLDG NTVHSTGWWW NHAGAFYFGG
SLYYNGDKLE YNPGRSFSFE RDDCHTCKNL WSGQMEVIGF ESHDNSLAIE ALSSGLWINH
LLAVCRTGKS LGLPEGATNN RPLEGSGFFW YDTGQEHIIT QSTFRNCGFR SDNYNQYNSS
ATRGCDDSDM SKACYSESSM FGFLTHSDEF TPKIMQGTRD ITFDNCGRRF KFTINKLETV
SGQGQNWLDM DGSVSGLNDP TIIASGLELA KDWWGVDNQV VYEPQAPLST YCRNDGSGLN
CSPVGYMRHL GHKFSPTLNA AVNNGLPVTA NPDVVSMIGG FGWLLTLNRG APRQLVLWDL
EVDPGSVLLL SIPYPAGTTF NI