Gene PHATRDRAFT_39727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39727 
Symbol 
ID7195445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp554516 
End bp555751 
Gene Length1236 bp 
Protein Length411 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183631 
Protein GI219126788 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGGAC CCCATTTCGA ACCATGGTTT ATCCTGGCTA GTCGTCCCTC GACAGCGACG 
GACGAGCAAA GTGCGGGACG AAAGGCTCGC TATGCGCTTG GTCTCACTTT CATCATGATG
CAGTGCATCG TATGGATCTG TGCATCCGTC ACAACTCAGT ATTTATACGG AGGACAAGGC
TTTCATTCGC CGTTTCTCAT GACCTTTGCT GGTGTTGGTA TGTTGGCCAT ATTTTTGCCG
TTGCGACTTC TAGCTGTTCG AATTGGGATA GCTCCGAAGC TCCTCAAAAG TACAGAGGAT
GCAGATCCTG CGGTCAACAA TGGTGTTGGC AATAGTCACG ATGAAAAACT CGCGCAAGCC
ACATCATACC ACCAAGTTTT TGATGCTGTT GCCTCCGAAC GGCGTGAGCT GTCCCATCCT
ACAACGTTCT GGAATCATCG CAAACATGCT TTAGCTGCGC TTCACATTGC ACCCGCCATG
TTCTTTGCCG ACTGGTGTTT CAATCACGGG CTAGCATACA CATCTGTCGC TTCAAGTACG
GTTCTAGTTT CCACTTCCTG CGTCTTCGTT TTCTTGTTCG CTGTTCTGGT GCGAGTCGAG
GCCTTTCACT CTGTGAAACT TGCTGGCGTA CTGCTCGCAG TGGCGGGTAC CGTTTTAACA
ACGATGGGCG ATATTTCCGT CAGCGAGGAA TCTAGCGGTG TGGATGCCGA AAGACATGTT
TTGACAGGCG ATCTCTTCTC CCTCATGGCA GCCATTGGCT ACGCATTTTA CACTGTACAA
GTCCGTGTTT TGTGTCCTCA AAACGAGGAT CTTTACAGCA TGCAGCTATT GCTCGGCTAT
GTTGGTGTAG TTGCCACCAT ACCGCTTCTA CCCGTTGCGT GTTACGCTTT GACGCAAGTC
ACATTCACGC CAAAAATAGC CGCCGTTTTG GTAGTCAAGG GACTGTTGGA TTTTGTTATT
ACGGACTATC TGTTATTTCG CTCCGTAATT TTGACCAACG CAACGACGGC TTCCGTCGGC
TTGGGATTGA CGATCCCCTT GGCTTTTTTG GTCGACTGGG TCTTGGGCAA GGGCAACGCA
ACGACCATTC AATCCTTGCT TGGACCAGTA GCCATCGCTA TCGCCTTTTT GATAGTGAAC
CTTACTGGCA ACTCGATAGA CGAGCGGGAA CAGAATATTC ACGACACAAA TACACCATCG
ACTGAGAATC CGCAATCGGC AGGAGTTTTT GCATAG
 
Protein sequence
MQGPHFEPWF ILASRPSTAT DEQSAGRKAR YALGLTFIMM QCIVWICASV TTQYLYGGQG 
FHSPFLMTFA GVGMLAIFLP LRLLAVRIGI APKLLKSTED ADPAVNNGVG NSHDEKLAQA
TSYHQVFDAV ASERRELSHP TTFWNHRKHA LAALHIAPAM FFADWCFNHG LAYTSVASST
VLVSTSCVFV FLFAVLVRVE AFHSVKLAGV LLAVAGTVLT TMGDISVSEE SSGVDAERHV
LTGDLFSLMA AIGYAFYTVQ VRVLCPQNED LYSMQLLLGY VGVVATIPLL PVACYALTQV
TFTPKIAAVL VVKGLLDFVI TDYLLFRSVI LTNATTASVG LGLTIPLAFL VDWVLGKGNA
TTIQSLLGPV AIAIAFLIVN LTGNSIDERE QNIHDTNTPS TENPQSAGVF A