Gene PHATR_18473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_18473 
Symbol 
ID7204277 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp217885 
End bp220232 
Gene Length2348 bp 
Protein Length645 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186302 
Protein GI219113437 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACCGCTATAA AGCACATCAC CTTCTCTTAT CGTACCAGAC AATTTAGCTA CTACATAGTG 
TCGATTAAGC AATACAGTCA CCAATAGTAT GATGATTGAC AGTGAGAAAG CTTCGGACGA
GACCTCTAAA GAAGGCTTTC CGACAGAGGA TGTTCCTCGT CCAAATGCCG ACGACGCACC
GGACGTACCT TCTTTTGGAC GGCTTCTCAC CTTGGCGAAA CCCGAATGGA CGATGCTTGC
CGTAGCATTC ATTTTAATGG TAGGCGCTGA GGGACTGGGT CTCTACAATC CGGTGCTTCT
TGCCGACGCG TACGACTACT TGATAAATCC ACTGCTGACG ACGTCCGAGC GTATGACCGA
GATCAATGGA GTAATGGCGT TGGTTTTGAT ACTACACGGA GCTGGTGTAG TTGGAGGCTT
CTTTCGCGTT GCCATCATGC AATCAGCAGG GGAGCGAATT GTTGCGAGAC TTCGTTATGA
TTTATACTCG TCTATCTTGA GCCAGGATAT TGCATTCTTT GACAAGACCA AGTCTGGCGA
GCTGGTTTCT CGTCTGTCCT CAGATACCAC TTTACTGCAG AAAGCGACCA GCCAAGCTGT
TCCAGAAGTG TGTGTTGGAT TCGTGAAACT AGTTGCTTCG ATTGCCATTA TGTTCTGGAT
TTCTGCTCCT CTCGCCGGCG TAACACTCGC TTGCGTTTTC CTCATCTTTG TCGTTGTGAT
TCCTTTTGGG AAATGGATTG GAGCCTTGTC GAAGAGGTAC CAGGACGCCC TGGGAAAGGC
ACAAACAAGG TCAACGGAAG CTCTCGGTGC CATGCGAACG GTGCAGTCGT TTGCTGCTGA
AGACCGTGAA AGGGCACGCT ACAGAGAAGT TATAGGTGAT CCAATGCAGT TCCCTTTTTG
GTATCCCACT GATCATAAGA AACACGAGAC GACGTACAGC GTGGGATTTT TCAAATCGAT
CGTCAACTCT GGTTTCTATT CGATCATTTT TGGTGTCGGC TTTGGTTGCT TGTACATTTC
TTTGTGGTAA GTTGGACCCT TCCAGTCTCA CGTGAATCTT ATTGTCAAGG ATCTAAAACA
CTTTATTTTC TGCGATCTGC ACAGGTTTGG GTTCAAGCTT GTCAATGATG GAGATATTTC
CTTGGGCGAT TTGACAGCGT TTCAGTCGTA TGTCTTTCAA ATTGGGGTAA GTAACTTGGA
ACAGCATTGC GATCGCCATC GATATATATC CTTCGTTCAG ATGCTAAAAT TGTTATCATC
GTAGGCATCC CTGGGGCAAA CCAGCGCTGC GATCACCCAG CTAGTCGAAG CCAAAGGTGC
TTCTGGTCGA GTATTCTACT TGCTGGACCG AGTCCCATCT ATTCCTACTC CTTTGCTTGG
GGACGACAAA AACGATGAAG AAGTACCCCC CACTCCACTC AAACCTGAGT CCATGATGGG
TGCCGTTGCG TTCAACAATG TCTCTTTTTC TTATCCCTCG CGCCCTGATC TACCAGTGCT
TCGCAACTTT TCATTATCTA TTCTTCCAAA TACAACCGCT GCATTGGTTG GGTCTTCTGG
CGCTGGAAAA AGTACCGTTG TGGCGTTGAT TCAGCGATTT TACGATGTGA CGGATGGATC
CGTCACAATT GACGGTAATG ATATACGCGA TCTGGATGTG AAATGGCTGC GTCGCCGTGT
GGGATACGTC CAGCAGGAAC CTTCCGTATT TGGTTTGTCC GTGCGTGAGA ATATTACGTA
CGGTGTCGAC CGCATGGTGT CACAAGAAGA ATTGGAGGCG TGTTGTGAAA AGGCCAACGC
GCACGACTTT ATTGCGCAGT GGCCAAACGG TTACGAAACT TTGGTGGGCG AACGAGGCAT
ACAGCTCAGC GGAGGACAGA AACAGCGATT AGCTATTGCT CGTTCACTGC TAGTCGATCC
ACGAATTCTT CTGTTGGACG AAGCCACATC TGCGTTAGAC GCGGAGTCGG AGCATTTAGT
TCAAGAAGCA ATTGACAAGG CAGCCGAAGG TCGAACAACA ATAATTGTAG CACATCGGCT
GTCCACGATT CGTCGTGCCA GCCAGATTGT CGTCGTCGAT GACCATCAAA TTATTGATGT
CGGGAGCCAT GATGCACTGT TAGAGCGATG TCCCAAGTAT CAAGATTTGA TTAGGCGTCA
ATCAGTGTTC AGTACGAAAT AAGAATTTCT GACAAGATTC GGTCCTGCAT CAATCAAACG
CGTCAGGCAA ATGGCCTTTT CTTAGAATCA GCACAATTTC CCATTTACAA CGACACTAAC
GGGGGGAGTT GGAAGCGTGC GTCATGAATA GGTTTATTCT TTAATAGTAT CCAAAACTTG
CCGCGTCC
 
Protein sequence
MMIDSEKASD ETSKEGFPTE DVPRPNADDA PDVPSFGRLL TLAKPEWTML AVAFILMVGA 
EGLGLYNPVL LADAYDYLIN PLLTTSERMT EINGVMALVL ILHGAGVVGG FFRVAIMQSA
GERIVARLRY DLYSSILSQD IAFFDKTKSG ELVSRLSSDT TLLQKATSQA VPEVCVGFVK
LVASIAIMFW ISAPLAGVTL ACVFLIFVVV IPFGKWIGAL SKRYQDALGK AQTRSTEALG
AMRTVQSFAA EDRERARYRE VIGDPMQFPF WYPTDHKKHE TTYSVGFFKS IVNSGFYSII
FGVGFGCLYI SLWFGFKLVN DGDISLGDLT AFQSYVFQIG ASLGQTSAAI TQLVEAKGAS
GRVFYLLDRV PSIPTPLLGD DKNDEEVPPT PLKPESMMGA VAFNNVSFSY PSRPDLPVLR
NFSLSILPNT TAALVGSSGA GKSTVVALIQ RFYDVTDGSV TIDGNDIRDL DVKWLRRRVG
YVQQEPSVFG LSVRENITYG VDRMVSQEEL EACCEKANAH DFIAQWPNGY ETLVGERGIQ
LSGGQKQRLA IARSLLVDPR ILLLDEATSA LDAESEHLVQ EAIDKAAEGR TTIIVAHRLS
TIRRASQIVV VDDHQIIDVG SHDALLERCP KYQDLIRRQS VFSTK