Gene PHATR_44029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44029 
Symbol 
ID7204220 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp739332 
End bp741131 
Gene Length1800 bp 
Protein Length460 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186120 
Protein GI219113073 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.492048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACTTTGTCA ACGAAAGCAT GCAAATCAGA AATTGCCTGC TACATCATCT ACCCATATTT 
TCAAAATTTT CATTTTCCTA TTACCAAATT GCCGTGCACT TTACACAGGA TTCTGACAAT
AAAACCATGT CTTTTCCTTC AATCGAACTT GAGACCCGAC AAAGCAAACA GCGGGTCACG
AGGGCGGCAA AGACGAATGA AGTCAATTCC AACGCCCCTA TATTCTTGCG GGTAAGCGAA
TGAAGGGAGT AACCATGTCC ACAACCTTCC AATCACTCAA TGATTCTCAA TTCGCTGATA
CAGAAAACGT TCACTATGAT CGATAGTTGC GATCCCTACG TTGCGGCATG GTAAGTCTGG
TAACAGCAAA TATTGAAGAC TTCATTATTT GAATCTGCCA TGGGTGTGGA TTCGGCAAAA
GGAGTTTCTA CCAACCCGCG ACCGCTTGTC TGTAGATTGC AAGCTACGAT TTGCTCGATA
TATGTGTGTG ATGAAAACCA GTGTTAGCGC AGACGCCAAC AATATTTCTG CCTAACTGGA
ATTCCAAAAC GCAGCGTGAT CTGGGACGCA ATTTTCTTCC AAAGAAATCT CAGAAAATGT
TTGTGGAGAA AGTTTTCACC GACAAGATCA CGTACAGTCA GTCCCTAAAC CTTACTCTTT
CATCTTTTCA AAAATAGGAG CGACGATGGA TACACTTTTG TGGTCAAAGA CACTGAGAAG
TTTGCTTCGG AGGTCATCCC GGAGTTTTTC AAGCACAATA ACTTCTCATC ATTTGTTCGC
CAGCTCAATT TCTACGGTTT CCGAAAAATC AAGTCTGATC CCCTTCGCAT AAAGGATGCC
GAAACAAATG AGGAATCGAG GTTCTGGAAG TTCCGTCACG AAAAGTTCCA GCGAGGGCGT
CCCGACCTTC TCGGAGAAAT TCGGAAGTCG AATCATAATG AATCTGCGGA TAAACGTGAA
GTGGAACACC TCAAGAACGA GGTTGACCAT CTTAGGAGCA AACTTGCCAC AATGTCCAGT
GACCTGGAGC AGCTCACGGG TGTTGTGGGT ACACTTATGA AAAACTGTCA ACTACATGAT
ATTGACTCGA AGAAGCGCAA GATTACGCAA GGCCCTGATC CAGTCCTTAG TTGGCATAAA
ATGGAACACG GCACTCCCGA CCTTTCTTCC TTGGAACCAA TGCCCGTGGG TTCCCTGTCA
TATGAAGCTG CACTTTTCGA GGATCTCGCA AAGGATCCCA CGATCGATCC GTTTGCCAGT
GCCATACATA ATTCTGAACA ATGCGAATAC TTTCCTCGTT CGGTTTCCTT GGAGGGACAC
GAGTCACAGG ACGATGAGGC GATGGCTTCT CTTCTTGCTC TCGACCCAGT TGACGAAATT
AAAATCTTAC AGAATCCTGA TAGTTCGGGT ATCGGGGTGG AGTTATCTGA AGCTATCAAG
CCGGCTGCAA CAGGAACTGA CCCACATCTT ATTGAGAAGC TTCAAATATC CCTGGGAAAT
CTACCGAAGG ATATGCAAGA GCTGTTTGTC GACCGCGTGG TTTCCTTTGC AGCAAATCCG
GAATGCTTCC AGCGGCAGAT CGACGCAATG ACCTCGTTGG CTACTTCTGC GGCCGACGAA
GCCCAACGGC GCTTAATTGC CGCAGGGAAG AGCCCGAGCG ACCCCAAGTG TGTTCCTTTG
GCTTCAGCAG TATTGGGGGC TTACCTTACT CGATTTGCGA CGCTTCCTGC GTCCGATCTG
CAGTCTATGG AAGCCTCCAG CCATGTGCCG TGCTCGGGTT CTCCTTTCAC ACACATTTGA
 
Protein sequence
MQIRNCLLHH LPIFSKFSFS YYQIAVHFTQ DSDNKTMSFP SIELETRQSK QRVTRAAKTN 
EVNSNAPIFL RKTFTMIDSC DPYVAAWSDD GYTFVVKDTE KFASEVIPEF FKHNNFSSFV
RQLNFYGFRK IKSDPLRIKD AETNEESRFW KFRHEKFQRG RPDLLGEIRK SNHNESADKR
EVEHLKNEVD HLRSKLATMS SDLEQLTGVV GTLMKNCQLH DIDSKKRKIT QGPDPVLSWH
KMEHGTPDLS SLEPMPVGSL SYEAALFEDL AKDPTIDPFA SAIHNSEQCE YFPRSVSLEG
HESQDDEAMA SLLALDPVDE IKILQNPDSS GIGVELSEAI KPAATGTDPH LIEKLQISLG
NLPKDMQELF VDRVVSFAAN PECFQRQIDA MTSLATSAAD EAQRRLIAAG KSPSDPKCVP
LASAVLGAYL TRFATLPASD LQSMEASSHV PCSGSPFTHI