Gene PHATR_46782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_46782 
Symbol 
ID7204538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp441122 
End bp442302 
Gene Length1181 bp 
Protein Length352 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185703 
Protein GI219120943 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.211133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAAAATTCG AAGTTACAGT TACACAGTTA GTTACATTAT TTAGTCAAGT CAACCCAGAA 
AGCAGCAAAC CTGACAATCA CAGAACAGGT TTATTGCGGA ATGTGAGATC TCCATTGTTG
GTATGATTGA TGACAACACT TCAGTTGGCT CAAAGAACTG GTCCGTATTC AAACAAACTC
AAGTTCGATA CAAACGCCAG GTTGACGCTC CTGATCGGCT GGACGATGTA GATGACTTTG
TGGATTTCGG CCGAGCCGAT GGGCGGATTC AGCGCATTGT CGTACCAAAA AGCGAAGATT
TTGCTTTTTA CAAAGGGCCC GTCTATGGAG TAAATGAGTT CCCTGGATTC TTATACGCAC
CGCAGGCGCT TTCTGAACTG CTACAAGCAG AGCTTTCCTT TTTGGCGGTT TCTTCGTTTT
GTGAACGCCC TCACTCAACC AATATTGACA AGGTCCCTTG CAAAAATTGG GAAATAGACG
ATGGACAACG ATGCATGTGG GAAGAATGGA AATTAGAGCA AATGGAAACT TATACAGAGG
CATCCCAAAT GACTTCCAAA AGTTCGTCCA GACCAAAGTA TAGAAGTTTC AGAAAGCTAT
CCTGGGCTAC GATGGGCTAT CATTACGACT GGAATACTCG ATCGTACAAT GAAAAGGCAA
AATCACCGAT GCCAAAATTG TTGGAACGGA TTGCGGAAAT ATTCGCTGCA ACGTCTCTTC
TTGTCGACGG ACAGGATCCA TGTTTCACGG CTTCAGCCAG CATCGTCAAC TTCTACACGC
CCAAGTCCAT GATGGGTGGA CACCGGGATG ATTTAGAGCA TGCTCTGGAC AAACCAATTG
TTTCTATTAG CTTAGGACGA CCGGCCGTAT TTCTGTTGGG TGGAAACACC AAGGATGATC
AACCAGTAGT AGCGATACTA GTTCGACCGG GAGATGTTAT GATGATGGGA GGGGCATCCC
GGTTGCGCTA TCACGGAATG GCCCGACTAC TGCCTACGAC CGGTCTACCC TCAGTCGAGA
AAGACCGTGT GCCAGACTGG GATTTGCAGC TTTCTGCAAA ATCGTTAGGA AAGGAAGCGG
AACTTTCGCA GTTTGAAGAG GACGACCGAA GGGCTTTGGC ATCTTTTCTG GAACAACATA
GAATCAATAT CAACGTTCGC CAAGTATACT CCGGAACGTA G
 
Protein sequence
MIDDNTSVGS KNWSVFKQTQ VRYKRQVDAP DRLDDVDDFV DFGRADGRIQ RIVVPKSEDF 
AFYKGPVYGV NEFPGFLYAP QALSELLQAE LSFLAVSSFC ERPHSTNIDK VPCKNWEIDD
GQRCMWEEWK LEQMETYTEA SQMTSKSSSR PKYRSFRKLS WATMGYHYDW NTRSYNEKAK
SPMPKLLERI AEIFAATSLL VDGQDPCFTA SASIVNFYTP KSMMGGHRDD LEHALDKPIV
SISLGRPAVF LLGGNTKDDQ PVVAILVRPG DVMMMGGASR LRYHGMARLL PTTGLPSVEK
DRVPDWDLQL SAKSLGKEAE LSQFEEDDRR ALASFLEQHR ININVRQVYS GT