Gene PHATRDRAFT_42151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42151 
Symbol 
ID7202833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp347752 
End bp348960 
Gene Length1209 bp 
Protein Length402 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181890 
Protein GI219123143 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGGGC GCGGTGGTGG TGGTGGACGT TCGTCACGTC GTTCGGCCGC CTCGGCCAAC 
GTCGACACCA CCAAACTCTA CGAAACGCTA GGTGTGGACA AGAGCGCGAC GGCACAAGAA
ATCAAAAAGG CCTACCGCAA ACTTGCCGTC AAACACCATC CCGACAAGGG CGGGGACGAA
CATTATTTCA AAGAAATCAA CGCCGCGTAC GAAATCCTGA GCGATTCCGA GATGCGGACC
AAATACGACA AGTATGGTCT GGAAGGTCTC GAAGAAGGCG GCGGGAGCGG CGGGGCAGCC
TCCGAAGATC TGTTTAGTAT GTTCTTTGGG GGAAGAGGAG GTCGTCGAAG TGCCGGACCC
CGACGTGGCG AGGATGTCAA TCATCCGGTC AAGGTATCGT TGGAGGACCT GTACAACGGC
AAAACAGTCA AGCTAGCCGT CAATCGTCAA GTTCTGGTTG GAGAAGCCCG CGTATGTACC
TCCTGTGACG GCCACGGGAT GGTAATGGAA CTGCGACAGA TTGCTCTAGG CATGGTGCAA
CAGATTCAGC GCGCGTGTCC AGACTGCGAA GGCGAAGGCT ACCAGTGCCA GAAGAAAAAG
GAACGAAAAG TTTTGGAAGT GTTGATTGAA AAAGGAATGC AAAACAAACA AAAGGTTGTA
TTCCAGGGAA TGGCCGACGA GAAACCAAAC ATGGAAGCAG GCAATGTCAA CTTTATTGTA
CAAGAAAAAG ATCACGAGCT CTTCAAGAGA AAGGGTGCTG ATTTGCTCAT TTCCAAGACC
CTGTCGCTCA AGGAGGCACT GTGTGGATTT GCATGGAAGG TAATGCACTT GGACGGCCGT
GAAGTCATCA TCAAGTCAAA GCCAGGAGAA GTCATTCAAG CTGAAGCCGC TGGAGGTCGT
CCGTTTGTCA AATGCGTCCC CAACGAGGGC ATGCCGAGTC ACGGGAATCC CTTTGTGAAA
GGGAATCTGT ACGTGTTATT CACGGTACAA TTTCCGAAAG ATGGAGAGAT CCAACCTGCG
GATGTAAAGC AGCTCAGACG GTTTTTGCCG GGATCGGCCA TGGAATGTGA CTACGACGAA
GACACTGCCG AAGTTGTCCA TCTGGAAAAC GCCGACGTGC GTAGCTTCGG TAAAGGAGGG
GTGCAAAATC AAGACGCAGC TTACGATTCT GACGGGGAAC AAGCTAGTCC GCAATGCCAA
CAGTCTTAA
 
Protein sequence
MHGRGGGGGR SSRRSAASAN VDTTKLYETL GVDKSATAQE IKKAYRKLAV KHHPDKGGDE 
HYFKEINAAY EILSDSEMRT KYDKYGLEGL EEGGGSGGAA SEDLFSMFFG GRGGRRSAGP
RRGEDVNHPV KVSLEDLYNG KTVKLAVNRQ VLVGEARVCT SCDGHGMVME LRQIALGMVQ
QIQRACPDCE GEGYQCQKKK ERKVLEVLIE KGMQNKQKVV FQGMADEKPN MEAGNVNFIV
QEKDHELFKR KGADLLISKT LSLKEALCGF AWKVMHLDGR EVIIKSKPGE VIQAEAAGGR
PFVKCVPNEG MPSHGNPFVK GNLYVLFTVQ FPKDGEIQPA DVKQLRRFLP GSAMECDYDE
DTAEVVHLEN ADVRSFGKGG VQNQDAAYDS DGEQASPQCQ QS