Gene PHATRDRAFT_13578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_13578 
Symbol 
ID7202155 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp325123 
End bp326247 
Gene Length1125 bp 
Protein Length345 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181354 
Protein GI219122023 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATC CAACTGGAAT TGCCTTCACG CCGGCTCTAC ACGTTGACGA AATTGGTCTA 
ACATCCGAAA AGTATATTCC GATCAATGAA ACCTTGACGA GCTTGCCTTT GCGCATTTCT
TTCGATCGCA GCGACATGCA TCATCAGGCA ACAACGAGCT CGACGGCAAC GGCAGGTGGA
TTGAGCCCAG CACGTTGGCG CTTGCTAATG CACCTTTCGC AAGCGATTGA ACAGCAGACG
CAGTTGGGTT TTGAGCAGTC CGATATTGAC GATGTTCGGC GACTTATTGC GGAGACAAAC
GTGACCCTTT TGGCAATAAC GATGCTAGCG AGTGCCCTAC ATTTGTTGTT CGAGTTTTTG
ACATTTAAGA GCGAGGTCAA CTTCTGGAGC AAAAATGAAG ATTTGACAGG TTTGAGTGTT
CGTTCCTTAT TCCTTGATAT GATCGGACAA ACGGTGATAC TTTTCTTTCT AATCGACAAA
GACAGTTCAC TTCTAATGAC AATTCCAAGT GCCTGTGGAT GCCTTATTGC ATTGTGGAAA
TGCCAGCGAG CGGCCGGTCT TCGTTTCGTC CGCACAACAC CCGACCGCAA TATTGCGTGG
TGGAACTGGT TGCCGAGTAT GGTTGGCTTT GAACTCCGTG CGACTCGGTT AGAATCGCAA
CTAGCGTCTA TGGCCAGGAA AGAGAAGGAA CATTCGGCGG CGGCACGCAA GCAGGACCTC
ACAGCTTTGA CAATTGAGTC TGACCGAATC GCTACACGGA CGCTTGGAAC GGTTCTCTTA
CCATTCGTCG TCGGATACAC CCTTTATTCG CTCGTCTTTG AGGAACACTT AGGCTGGTAC
TCCTGGCTGA TCACGTCGGC TTCATCGGCT GTGTACGCCT TGGGCTTCGT GCTGATGACG
CCGCAACTTT TTCTGAATTG GAAACTTAAG AGCGTTGCTC ACCTGCCTTG GCGTGTGCTG
GTCTACAAAT CATTGAATAC TTTCATCGAC GATCTCTTTT CCTTCATTAT ACGAATGCCG
ACTATGGCAA GAATTAGCTG CTTTCGAGAC GACGTTGTCT TTTTTATCTA CCTTTACCAA
CGTTGGCTTT ATCCTGTCGA CGCATCTCGG CCAGCGGAAG GTGGT
 
Protein sequence
MDDPTGIAFT PALHVDEIGL TSEKYIPINE TLTSLPLRIS FDRSDMHHQA TTSSTATAGG 
LSPARWRLLM HLSQAIEQQT QLGFEQSDID DVRRLIAETN VTLLAITMLA SALHLLFEFL
TFKSEVNFWS KNEDLTGLSV RSLFLDMIGQ TVILFFLIDK DSSLLMTIPS ACGCLIALWK
CQRAAGLQSQ LASMARKEKE HSAAARKQDL TALTIESDRI ATRTLGTVLL PFVVGYTLYS
LVFEEHLGWY SWLITSASSA VYALGFVLMT PQLFLNWKLK SVAHLPWRVL VYKSLNTFID
DLFSFIIRMP TMARISCFRD DVVFFIYLYQ RWLYPVDASR PAEGG