Gene PHATRDRAFT_31781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31781 
Symbol 
ID7196121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp909383 
End bp910522 
Gene Length1140 bp 
Protein Length379 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176681 
Protein GI219109856 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACTG GCACTTCGCC GAAGGCTCCA ATCTACAATC AATACAAGCG CAAGGCCCCG 
TTTTCGTCCT GTTCGAAAGG ATCGGCTTCT GCGGACACCT TTCCAATACG GCAAGAACAA
TCCGCTTCCC GAATTCCTCT CACAGAGATA CGGAACCAGT ATGCTTCTAC CTCCACTAAC
CGTAGACAGA GACGATTGTC AATCCCCAAT GCCATTTCGG TCCAATCTCC CCTGCTCCCC
TCTCCCACAA CTGCCTTGCA GCTGCTACAA CGGCATCAGC AACGAACGCA CACCCTCAGA
TTAATACCGA ACGGTTTGGA ATCCTGGTTG TCCTTGGCCC GGCCCGGCAT TTACGAACTC
GTGGGGGAGG CCGGAACCGG CAAATCACAA ACCGCCCTTA GCGTGTGCGT CCAAGCAGCT
TCCTCGACGA CAGGCCCCTC CCGAGAACCA CCGTTGATTG CCGCACCCAC CACCGGCACT
GACGTCTCTT TGCATCTGAT CCCGTGTCGT GCAATATACA TTTCTCTCAA GGCTCAAAAC
AATGTCGTAC AAATCGTCAA ACGACTGGAA CAAATGGTCC TCTCCCGGCA GGAACAACGA
CCGGCGTCCA CACCTGACCA AATGAAGGCT CCTACCAGAT CTCCCCCTTG CACTATTTTG
CAGCGCATTC TGACACGCGC AGCGTGGAAC GCAGAACAAC TCACGCAGGT ATTGGACGAA
TTGCCAGTCT TGCTAAAATC CGGCACCGTC CGTGTGCTCG TCTTGGATTC CATTGCCGAC
ATGTTCCGCA CCAGTGAGGA CACGGACGGC ACTCGTTCGC AACAATCGTC ACACCACCAT
GCGGCTCGCT CGGCCATTTT GTTTGGACTC GCGGCGCGTC TCAAAAAACT GTCGGACGTC
TTTGATGTCC CCGTACTCGT CATCAATCAA GTGGCCCTGT CCGGAGTCTG GACCAAGCCC
GCCCTGGGTT TATCTTGGGC GCACTGTATC GACGTACGGT ACATTCTAAC CCGACAGGAA
CGCGGCGGAG ACGCGGGTGT CGTCTTTGGA CGCCGCGTCA CGTTGGACGC GTCGTCCAGC
CATGCAACCG GACAGCACAA GGCCTTTTTT ATACGTGCTG ATGGAGTTGT TGCCGGGTAA
 
Protein sequence
MSTGTSPKAP IYNQYKRKAP FSSCSKGSAS ADTFPIRQEQ SASRIPLTEI RNQYASTSTN 
RRQRRLSIPN AISVQSPLLP SPTTALQLLQ RHQQRTHTLR LIPNGLESWL SLARPGIYEL
VGEAGTGKSQ TALSVCVQAA SSTTGPSREP PLIAAPTTGT DVSLHLIPCR AIYISLKAQN
NVVQIVKRLE QMVLSRQEQR PASTPDQMKA PTRSPPCTIL QRILTRAAWN AEQLTQVLDE
LPVLLKSGTV RVLVLDSIAD MFRTSEDTDG TRSQQSSHHH AARSAILFGL AARLKKLSDV
FDVPVLVINQ VALSGVWTKP ALGLSWAHCI DVRYILTRQE RGGDAGVVFG RRVTLDASSS
HATGQHKAFF IRADGVVAG