Gene PHATR_33624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33624 
Symbol 
ID7204069 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1398768 
End bp1400952 
Gene Length2185 bp 
Protein Length648 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186245 
Protein GI219113323 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0268673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCTTCC AACGAGACCA CTCCGCGAGA CCTTTCGGCG TGAAGAAGTT TATAATTTGC 
ACTTCTTCCC GTACCTGTTT GTTCCCCGAC CTCCAGCAGT GTGGTATGTT GTTTCTAGTC
GTTCTGATTA CACAGCCGAT TGTTGGCCTA ACGTATCTCA TTTTTTTTTG TTCTGCTCAC
ACATAAGGTA GCACTCTTAC ATAAAATTCC GAGTTACATT TAGTTTTATG AACGTTACGC
CGCTGTTCCT CCTCGTCTGC CACTTGCTGT CGGCGAAGGT TCGGGGAGAA TCGTCTATTC
GAGGAACGCA GAAAGAGGAG CAACACACTG TCGGAAAGCT ACAAGAGCAG TCTGTTGTTT
GTATGGTGGA GCAGGTGGAT GTCAAATTTT CCAATTATGA ACCCTCACCC AGTAACGATG
CGGCGAGTAC GGCGAACGAC AGCGACTCGG CATTCCTTTG TGTGACGGAC CAGGACAACG
CGGCTGATCA ATCGTTCGTC ATTGACTTGC CACCTTTGGT CTTGGAGAAC ATGACGCATC
ACGAGCATCC CGTTCTATCC ATTTCCGACG CGGTTCTGGA TAATGAAGCG GTTCAGCTTT
CGGTCATCAA GGACGCCTCT ATTGATATTG ATGATTCACC ATCCCAACAC CGTTCTCTGG
CGTCGGCAAC GGGTACTGGC CAAGTCCTGG TCTTACGGAT CACTTATCGT GGCATATCCC
CCTCTCTGTC GGCCGATCAA CTGGCCTCAC GTGTTTTTGG CTTGGGTAAC AATCCGGAAC
GTCACAGTCT TTCCAGCCAA ATCGATGCTT GTTCATTTAG GCAGTTACAA CTGAGACCAG
CAGAGGGCGA TGACATTGTA AATGGCATTG CTGAAATTTC CATTGACAAG CGAGTAGCTG
GCTCGAACTC GGTGCTGGCT TTGGATAATC TAGTGGTCGC TAACGCCACG CGACGGTTTG
GTCGATTGAG TACCCAGTTC GCTCATGTCT TATTTGTCTT CCCAAGCGCT GGGCTTTTGT
TTGGTGGACG TGGATGGTTG GCCTATGCCT ATTTCAACGG GTGGCGCTCC GTCTACAACG
ACAAATGGGG TGGTAGCTTG TCCGCCTTGA TGCATGAAGT AGGGCACAAT CTTAACTTGA
ACCACGCCGG ACGAGGTAGC CAAAACTATG GCGATGTCAC AGGTAGGTGC CTCAAGAAGG
CTTTCCATGT ACTATTTCAT CCCGATTAAC TTATCGCGAT GTGTTTTGAA CGCATGGCGA
CGGTAACTAG GCAGAGCTCA TTGATTTTGT TTCCTTCTTA TGCGGATTAG GTTACATGGG
ATGGGGAACT TCTAAGATTG GTGCTCCCAC AAGCTGCTTT AATGCACAAA AGAGCTGGGC
ACTGGGATGG TACAAAGATC GTTCCCTTTC ACTCTCTCTC CCAGATTTCC CTTGGGGAGG
ACAGGTTGCC TTCTTCGGGG AGTACGACAA GACAACGCCG GATCAACCTG TTATTCTAAG
TCTTGAAGAC GGCAGCAAGC GATTTTTTTT ACAGTACAAC CGCGCCAAAG GTATGAACGA
GCAAACCCGA GAATTTCCCA ACCAGGTTGT CCTTGTCAGT GACGAAGGCC CGGGAGACGG
AAGGTGGGGT CCGCAATCCC GGCTGGAAGG AGCTATCGGC TTGAACGAGC AAAGCACTCG
ACGAAACTTT CGAATCCAAA ACTTTGAAGC ATCGGGCTTT CCTCTCTTTA TCCGAGTTTG
TGATGAGGTG GAGGGCCCTC CCGACTTGGT ACGTCTCAGC ATTCACTTGG AAGATGGAAC
GCAGAGTGAC ACTTGCAATT TAAATATAGA GGCTTCAGCG CAGACCACGC CTTGCGATGA
CGACTTCTCG GCAGTGTTCT TCGTAGACCC CAATCGTGGG TACAAAGACT GCGGCTGGCT
GGCTCGAGTG ATGGCAAAAA GTGATTTTTG GTCTGAGGCT TTGTGTCAAG AGGGCCACGA
AGCCTACAAT GCATGCGCGG AGACATGTGG CAAATGTACA GACAAATGCG AAGACACCAC
CGGAGCTTTT TTCTATGTCA ATGCTCGTCA TGGAAACAAG GATTGTGAGT GGCTTTCGAC
GAGAACTCCT TGGCGTGAAA AATTATGTCA TGAAGGCAAC GCCGCATACG CTTACTGCCA
AGAATCCTGC AACGTCTGCG ACTAA
 
Protein sequence
MLFQRDHSAR PFGVKKFIIC TSSRTCLFPD LQQCVTFSFM NVTPLFLLVC HLLSAKVRGE 
SSIRGTQKEE QHTVGKLQEQ SVVCMVEQVD VKFSNYEPSP SNDAASTAND SDSAFLCVTD
QDNAADQSFV IDLPPLVLEN MTHHEHPVLS ISDAVLDNEA VQLSVIKDAS IDIDDSPSQH
RSLASATGTG QVLVLRITYR GISPSLSADQ LASRVFGLGN NPERHSLSSQ IDACSFRQLQ
LRPAEGDDIV NGIAEISIDK RVAGSNSVLA LDNLVVANAT RRFGRLSTQF AHVLFVFPSA
GLLFGGRGWL AYAYFNGWRS VYNDKWGGSL SALMHEVGHN LNLNHAGRGS QNYGDVTGYM
GWGTSKIGAP TSCFNAQKSW ALGWYKDRSL SLSLPDFPWG GQVAFFGEYD KTTPDQPVIL
SLEDGSKRFF LQYNRAKGMN EQTREFPNQV VLVSDEGPGD GRWGPQSRLE GAIGLNEQST
RRNFRIQNFE ASGFPLFIRV CDEVEGPPDL VRLSIHLEDG TQSDTCNLNI EASAQTTPCD
DDFSAVFFVD PNRGYKDCGW LARVMAKSDF WSEALCQEGH EAYNACAETC GKCTDKCEDT
TGAFFYVNAR HGNKDCEWLS TRTPWREKLC HEGNAAYAYC QESCNVCD