Gene PHATRDRAFT_54101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54101 
Symbol 
ID7197201 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp680008 
End bp681616 
Gene Length1609 bp 
Protein Length471 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177983 
Protein GI219112463 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.644826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGAAA CTGACAAGCC AACTATCGTA GAAGCCGGTG AACCTGTAAG TATTGAAGAG 
GCCTTTGCCG CACCAGAACT GTGCGTTTGG CAACAATATC TGACTCGCTT GTTTGGACAG
GTTGAATGGA AGCAGTACAG TACGTACAGT ATTAAGACCG ACCCCGACCA AGATGATAAG
GCTACAGAAA TCAAACTGTG CAGCTTTGCC CGGCCCCACA TGCGAGCTTT CCACTGTTCT
TGGTGGTGTT TTTTCATTGC CTTCTTCATT TGGTTTGCCA TCGCCCCCCT TCTCTCCGAA
ATCAGAGACG ATATTGGCAT CACCAAACAG GATGTTTGGA CTTCGTCGAT TGTCGGAGTC
GGCGGAACAA TTTTGATGCG CTTTATTATG GGACCCATGT GTGATAAATA CGGTGCTCGT
ATTTCTCTTG ATTCTGTCGT TCGCTTCTAT TCCTACGGCA TGTACTGGAT GCATGTACTG
GATTCGTGAA CAGCGCCACC GGACTCGCGG TCTTGCGTCT GTTCATTGGT GTTGCTGGTT
CTACCTTTGT TCCTTGCCAG TATTGGTCGA GCCGTATGTT TTCGAAAGAA GTCGTTGGAA
CAGCTAATGC TTTGTGCGGT GGCTGGGGAA ATCTGGGTGG TGGAGTCACA CAGCTTGTCA
TGGGATCTGC CCTCTTCCCG CTGTTCAAAA TTTTCTTTGA CGGCGACTCA GAAATGGCCT
GGCGAACAGT TTGTGTTATC CCAGCCATTA TTGCCATGGC ATCTGGTATT ATTGTGTATC
GTATCAGTGA CGATGCTCCG AAGGGAAACT ACGTTGATAT GAAGAAGCAT GGTACCATGC
CTGAAGTCTC AGCTGCTGCC TCATTCCGTT CAGGAGCATT GAACCTCAAT ACATGGGTCT
TGTTTGTACA GTATGCGTGC TGTTTTGGAG TGGAGCTGAC TATGAACAAT GCCGCGGCTC
TGTATTTTAA GGACGAGTTT GGTCAATCGA CAGAATCTGC TGCTGCAATT GCTTCCATTT
TTGGATGGAT GAATCTTTTT GCTCGCGGTC TCGGAGGCTT TACAAGTGAT AAGGCCAACG
CCAAGATGGG AATGCGCGGA CGTCTTTGGG TACAAACTAT TTTTCTTGCG CTCGAAGGTG
CCCTTGTTCT GGTATTTGCT CAGACTGGAT CGCTGGTTGG AGCCATTGTT GTCATGATTT
TCTTCTCCTT GAACGTCCAA GCCGCTGAAG GCGCTACTTA TGGAATAGTT CCCTATGTCG
ACCCCGCCTC TACTGGATCC ATTTCCGGTA TCGTGGGAGC TGGAGGTAAC ACTGGTGCCG
TCTGCTTCGG ACTCGGATTC CGTCAGCTCA GCTACGAAAA AGCATTTAAC ATTATGGGGT
ATTCCATCCT TGCGTCAGCC TTCATGTCAG CTTTAATCAA CATAAAGGGG CATGCAAGTA
TGTTCTGGGG TAAGGATGAA ATTATCGAAA AGGGAATACT TGCTGTTCCT ATGCCAGAGG
CTGAAGAAGA GATCGAAGCC TAGAGTCTCC TGGTTGATTT TGTCCATTTC CCCCGACTAT
TTCCTTCAAC ATCTTATTCT TAAAGTTACT TATTTTCTTT TCTACACTA
 
Protein sequence
MSETDKPTIV EAGEPVEWKQ YSTYSIKTDP DQDDKATEIK LCSFARPHMR AFHCSWWCFF 
IAFFIWFAIA PLLSEIRDDI GITKQDVWTS SIVGVGGTIL MRFIMGPMCD KYGARISLDS
VVRFYSYGIA TGLAVLRLFI GVAGSTFVPC QYWSSRMFSK EVVGTANALC GGWGNLGGGV
TQLVMGSALF PLFKIFFDGD SEMAWRTVCV IPAIIAMASG IIVYRISDDA PKGNYVDMKK
HGTMPEVSAA ASFRSGALNL NTWVLFVQYA CCFGVELTMN NAAALYFKDE FGQSTESAAA
IASIFGWMNL FARGLGGFTS DKANAKMGMR GRLWVQTIFL ALEGALVLVF AQTGSLVGAI
VVMIFFSLNV QAAEGATYGI VPYVDPASTG SISGIVGAGG NTGAVCFGLG FRQLSYEKAF
NIMGYSILAS AFMSALINIK GHASMFWGKD EIIEKGILAV PMPEAEEEIE A