Gene PHATRDRAFT_45966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45966 
Symbol 
ID7200840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp792594 
End bp794111 
Gene Length1518 bp 
Protein Length492 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180316 
Protein GI219119099 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCATCATGA GGTCCTTGGT CTCACAGCCG CATGCAGTAG TGGTTGGCCT GTTCTTGTTA 
GTCCACACAC AGCATAGTCT CTCGTTCAAT CTCGTGAAAA CGAGGCTATC TGTCGGACGC
AATACCGATG GGGTTATTGA TCATGACTTC GCGGACGATC ATCCCAACGT GATGTCGATT
CCTGCTTGTC ACGGTCCGTT CAGCCCCGAC TTACTGACCA ATTACTGGGG ACGCTCTCCG
CTATTGATTC GATCAGCTTT TCATGCCGAA GCTTTGACGG AAGTTTGGCC CAGCCAAGCC
GATCTGTTGG AACTCGCGCT CGACGATGAC GAAATCAGTA GCGATTCGGC CCGGATTATA
ACGCATACCT CTGGGCGCCT TGATTCCTTT GCCTCACAAT TAGGACCGTT CTCCACTTCC
ACTATTCAAG GACTTGAACA TGGAGACAAG ATGTGGACTC TAATCGTCAA CGATGTGGAT
CGATACGTGT CGACACTCGC CGATTGGATG GACGACGAAT TTGGATTCTT GCCCCGTTGG
CGTCGCGACG ATGCGCAGAT TAGTATGGCA CGCACGGGTG GGGGCATTGG TCCCCACGTC
GATAGCTACG ATGTCTTTTT GACTCAAACA TCCGGTCAAC GAACCTGGCT CGTGGGAAAC
ACCATGACGG TCCAAGAAGA AATGAACACG CTCATACCCG ATTTGTCGGT CCGCATTTTG
CGAGACGTCA GCAACCACAA CGAGAGCTCC CACGCGTATA CCCGACTGGA ACTGCAACCT
GGCGATGTGC TGTACCTTCC ACCCCGATAC GTACACTGGG GTACAGCTCT CACGGATGAT
TGCGTGACGT TGAGTGTTGG AGCACGGTCG CCGTCAAGTG CCGAGCTGGT AGCGCGAATC
GCCGAAACCA TGCTGGGGTC CGTCTCGGTT CACGCCGTGC AACGGTACAC GGATCCAGAC
TTGCTACAAG AAGTAAACGG TGCACCGTTA CATTCAATGA CCAATCACGC CAAAGATAGT
ATGAAGACCA TGGTTCTCGA TGCGGTGCAC GAAATCACGG ATGATCCGAT GCGTTGGGAT
GAACTTGTCG CCAAGCTAGC CACCGAGCCC AAACGGATGT CAGAGAACGC TCTGGTTCCC
TATAACGAAA TAAAAGACTC CGAGTATCTG GCAATTTGGG GAGGAACGCC GCGGGATGCG
CTTGCACGGA TACGCGAAGG TCGGGGCGCC CTCTATCGAA TAGAAGGCGT ATCGTTTGCC
ACTTCACGTG TCGAATACGA TGGGGTAATA ACTGAACGAT TATTTGCGCA CGGATCAATG
TGGGAAATTT GTGACGACGA GCTAGCTACG GCAGTTCTTT GCAGAATAGA GAAAGGCAAG
CCAATCACAA TCAGTCACAT AGAAGGTCTT TCTGCCCCTC TGGCGGAGCT ACTGACAAAC
TTGATATCCG AAGGGATTCT ATATGCCTCC GAGGATCTCT CCTAAGGTAA ATAAGGAATA
ATATAATTGT GCTAGTGT
 
Protein sequence
MRSLVSQPHA VVVGLFLLVH TQHSLSFNLV KTRLSVGRNT DGVIDHDFAD DHPNVMSIPA 
CHGPFSPDLL TNYWGRSPLL IRSAFHAEAL TEVWPSQADL LELALDDDEI SSDSARIITH
TSGRLDSFAS QLGPFSTSTI QGLEHGDKMW TLIVNDVDRY VSTLADWMDD EFGFLPRWRR
DDAQISMART GGGIGPHVDS YDVFLTQTSG QRTWLVGNTM TVQEEMNTLI PDLSVRILRD
VSNHNESSHA YTRLELQPGD VLYLPPRYVH WGTALTDDCV TLSVGARSPS SAELVARIAE
TMLGSVSVHA VQRYTDPDLL QEVNGAPLHS MTNHAKDSMK TMVLDAVHEI TDDPMRWDEL
VAKLATEPKR MSENALVPYN EIKDSEYLAI WGGTPRDALA RIREGRGALY RIEGVSFATS
RVEYDGVITE RLFAHGSMWE ICDDELATAV LCRIEKGKPI TISHIEGLSA PLAELLTNLI
SEGILYASED LS