Gene PHATRDRAFT_40254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40254 
Symbol 
ID7195860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp526792 
End bp529053 
Gene Length2262 bp 
Protein Length753 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184151 
Protein GI219127874 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0361313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAG GGGATTCCCG CCGAAAGATT GGTGCTCAGG TGACAGCGAA GGCCTGTCAT 
GTTGTCCATT TGAGTGAGTG TGCTCGGCGA TATGGTGCTT TGAGGACCAC CAAGGTCGTT
GTGGGGACTT TTGTGGAGGT CAACAATACC AGAAAGGCGC CAAACAACCG TGTATCAACC
TTCATTACTG CTGACTTTGA TATTGGTGGA GGATCAGTCA AGCGGAGCAC TCTGAACATC
CGTAGCGTCA AACTCTTCAA ACCGGACCAG TCGACAGTAC CATCCAGTCC CGCAGCACCA
ATACCGGCAG TAGACAACGC AGACACAGAT TTGGCCGTTC CAGAGCAAGA GGAAGGAGAA
GCGGTCTTGC AGGAGACTTC TCCTGATGAA GAATTGGAAT TTCCAGCACA ACCGATGATG
GAAATTGGAA TAGCTGCGGG GGAACAGGTA GCAGGACCTA CCGCACAAGT AGCCACGCAG
GTTTGGGGTG TTGAAGACGC TTCCTTTGTA ATGGCTCATG AAACAAAGTG GTATGCTGAC
GAGCAAGCTA CATTGATTGA TATAAATGGC AGTGTCCAAA GTAAGCAGTT TGGCATCAAT
ACACCAATTG GTGACCTTCT TGGTCCAGAC TCTGACATTG ATGGAAAATA TTCGCGGCTG
CAATATTTTC TTCTCATGTT TCCACCCGAC CAACTGAGCG CCATGTGTCA GCTAACAAAT
GTGCAGCTTG CCCAACAGAA CAAGCACTGC ATGTCAACAG GAGAGCTGCT TCGATTCTTT
GGCATTCTAA TTCTTGCGAC AAAATTTGAA TTTAGCAGTC GATCGCAATT GTGGTCCACA
ACCGCGCCGT CAAAATACAT TCCTGCCCCT GCATTCGGAA AAACAGGAAT GTCGCGGCAG
CGCTTTGATG ATCTTTGGCG AAATATCCGA TGGAGCAACC AGTGTCCTGA ACGGCCGGAA
GGTATGAGCT CCCATACGTT TCGGTGGCAA CTTGTCGATG ATTTTGTTGA AAGATACAAC
AATCATCGAG CCAATACTTT CAAACCATCT CATCTTATTT GTGTGGATGA ATCAATGTCG
CGATGGTATG GACAAGGGGG GGAATGGATA AATCATGGAC TCCCCAATTA TGTGGCTATT
GACCGAAAGC CCGAAAACGG TTGTGAGATT CAAAATGCCG CATGCGGCTG TTCGGGTATC
ATGCTTCGAT TGAAGGTTGT AAAGGGTAAG ACAGCAACAG AAGATGATGG GGACTACAAT
GAACAGTTGC TGCATGGAAC AAAGATCCTC AAAGAGCTTG TCCTTCCTTG GTGGTGGACG
GATCGGATTG TTTGCGCTGA CTCGTATTTT TCATCTGTCG GTACAGCTAT GGAGTTGCAG
CGACATGGTT TGAGATTTAT TGGAGTTGTA AAAACAGCAA CAAAACAATA TCCGATGAGA
TACCTTTCGA CTTTAGAGTT GAACCAGAGA GGCGAACGGA GAGGGCTTGT GATGCGAGAT
GTTGATACAA ATTATAGCAC TCTGTTGGCT TTTGTGTGGA TGGACAGGGA CCGCCGATAT
TTTGTGTCGA GTGCTTCCAG TCTGGATGCA GGCAAGCCCT ACATACGCTA TCGTTGGAGA
CAGATTGACC AATCTCCGGA TGCAGATCCA GAGAGGCTGG AAATTATCAT TCCACAGCCC
AAAGCAGCGG AATTATACTA TTCTGCATGT GGGATGATTG ACAGGCACAA TCGAAGTCGT
CAGGATACAC TGATGCTTGA ACGAAAGTTG GGTACAACAA ATTGGTCGAC AAGGGTTAAC
CTCTCAATAT TTGGAATGAT TGTTGTTGAC ACTTGGTTGG CCTACAGTCT GTGTACAGGA
ATAGGAAGAG CTAACGGGAG AGAAGAAAAG CAGAAAGACT TCTACACTGC CTTGGCTGAG
GAGCTAGTGG ATAACCAATA CGACAATGTT GGAAGTCGCA GAGTTTTCGT GGAGGCAAAT
TTGGACAATG ACAGCCCAGC ACTTTCAAGG ACTACGGGAG AACCAAGAAG TGGCCTGTAC
GCACATCTAA CACCAACCAA AAAAAGAAGA AAGAACAAAG ACGGTAGTTT TAGCAGCAAT
AGACTACAAG GACGATGCTT GGTGTGTTCC AAGAAGACAA CATATGTTTG CTCAGTGTGC
AAAGATGAAG AAACACCTCA CTCCAGAGAA CCGTGGGTTT GTTATACCAC CAAGGGGAAG
CTGTGCTATG CAAACCACAT GGCTACCTGT CACGGCGCCT AA
 
Protein sequence
MSEGDSRRKI GAQVTAKACH VVHLSECARR YGALRTTKVV VGTFVEVNNT RKAPNNRVST 
FITADFDIGG GSVKRSTLNI RSVKLFKPDQ STVPSSPAAP IPAVDNADTD LAVPEQEEGE
AVLQETSPDE ELEFPAQPMM EIGIAAGEQV AGPTAQVATQ VWGVEDASFV MAHETKWYAD
EQATLIDING SVQSKQFGIN TPIGDLLGPD SDIDGKYSRL QYFLLMFPPD QLSAMCQLTN
VQLAQQNKHC MSTGELLRFF GILILATKFE FSSRSQLWST TAPSKYIPAP AFGKTGMSRQ
RFDDLWRNIR WSNQCPERPE GMSSHTFRWQ LVDDFVERYN NHRANTFKPS HLICVDESMS
RWYGQGGEWI NHGLPNYVAI DRKPENGCEI QNAACGCSGI MLRLKVVKGK TATEDDGDYN
EQLLHGTKIL KELVLPWWWT DRIVCADSYF SSVGTAMELQ RHGLRFIGVV KTATKQYPMR
YLSTLELNQR GERRGLVMRD VDTNYSTLLA FVWMDRDRRY FVSSASSLDA GKPYIRYRWR
QIDQSPDADP ERLEIIIPQP KAAELYYSAC GMIDRHNRSR QDTLMLERKL GTTNWSTRVN
LSIFGMIVVD TWLAYSLCTG IGRANGREEK QKDFYTALAE ELVDNQYDNV GSRRVFVEAN
LDNDSPALSR TTGEPRSGLY AHLTPTKKRR KNKDGSFSSN RLQGRCLVCS KKTTYVCSVC
KDEETPHSRE PWVCYTTKGK LCYANHMATC HGA