Gene PHATRDRAFT_31662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31662 
Symbol 
ID7196301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp645249 
End bp646427 
Gene Length1179 bp 
Protein Length392 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177131 
Protein GI219110759 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGAGC GAAGCTGCGC CAACGCTTCT GACGATGACA GCATTCACAA GCAGTCGCCA 
GAATTGGAGG CTGAGTTTCT TCATACCAGC AAGTTGACCT TAGCCGACAT GCGACGATTG
GCGCACGATC CGAAGGATCG GAGGTTGGCA ACAAAACCTG CGGCGCAAGC TACGAAAGAA
GACGTCTTGA CGGTACAACC CATGAGTTTC GTAGAACACA CTGCTTGCTG TCTGTTTCTC
GCGTTTGGAG TGCCCAATGG CGCTCTGACG ATTCCCATAG CAACGTGGCT GATCGGAAAA
TTCGTGGTAC GCAACGTTTT CTTGGCGTTT CTGTTAGCAG GCTGTATACT TCTACCGCTT
GCGATACTGC CGCAAGAATA TGTGCCCGCC CGATTGCAAT CGTGGCTTGC TTTGCAGATA
CTGAAATATT TTTCTTTCTC TTTGGTCATG GAGGAACGCC CTCCGACAAT GTGTACTGGC
AAGCAGCTGA TCGAGCAGCC CGCTCGGCCA CGAATCGTCA CAGCCTATCC GCACGGAGTT
TTCCCATACG GAAACGCGTT GACTGTAGTC ACATGGCCGT TGTTGACGGG ACACCATATT
GTGGGTTTGG CAGCAAATGC CGCTTTGCGG ACACCGATCT TTAAACAAAT CTTGCGGAGC
ATTGGCGTCA AGGACGCCTC TCGAGCGTCG GTACGGAATG CGCTGGAAAC ATGGCCTTTC
ACCGTCGGGA TTTCGCCAGG TGGCGTGGCG GAAGTTTTTG AAACAAACCA CTTCAATGAG
CACATTCTGT TGAAAGAACG TATTGGTGTC ATCAAGATGG CCATTCGCAC CGGTGCGGAT
CTTGTACCAG GCTATATGTA TGGTAATACT AATCTGTACT GGTGCTGGAC AGGGGAAGGT
ATTCCTGGAG CTCGGTGGCT ATTGGAGTAT GTTTCGCGTA AAATCCTAGG TTTTGCCCTC
GTGCCTATAG CGGGTAGATG GGGACTACCA ATACCGTACA GGACTCCGAT ATTGTGTGTC
GTGGGCAAGC CAATACCAAC CATTCATTTG CAAACCGAAG AACCATCAAT GGAGCAAATC
GTGGACATTC AGGAACAATT GTCAACAGAA TTGAAATCAA TGTTCGACCG CTATAAGCAC
CTGTACGGAT GGGAAGATCG AATGCTAGTG ATCACATAA
 
Protein sequence
MRERSCANAS DDDSIHKQSP ELEAEFLHTS KLTLADMRRL AHDPKDRRLA TKPAAQATKE 
DVLTVQPMSF VEHTACCLFL AFGVPNGALT IPIATWLIGK FVVRNVFLAF LLAGCILLPL
AILPQEYVPA RLQSWLALQI LKYFSFSLVM EERPPTMCTG KQLIEQPARP RIVTAYPHGV
FPYGNALTVV TWPLLTGHHI VGLAANAALR TPIFKQILRS IGVKDASRAS VRNALETWPF
TVGISPGGVA EVFETNHFNE HILLKERIGV IKMAIRTGAD LVPGYMYGNT NLYWCWTGEG
IPGARWLLEY VSRKILGFAL VPIAGRWGLP IPYRTPILCV VGKPIPTIHL QTEEPSMEQI
VDIQEQLSTE LKSMFDRYKH LYGWEDRMLV IT