Gene PHATRDRAFT_39777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39777 
Symbol 
ID7195636 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp4276 
End bp5483 
Gene Length1208 bp 
Protein Length392 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183791 
Protein GI219127123 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.837678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAG GGGATTCCCG CCGAAAGATT GGTGCTCAGG TCACAGCGAA GGCCTGTCAT 
GTCGTCCATT TGAGTGAGTG TGCTCGGCAA TATGGTGCTT TGAGGACCAC CAAGGTGGTT
GTAGGGACTG TTGTGGACGT CAGCAATACC AAAAAGCCGC CAAACAACCG TGTATCAACC
TTCAATACTG CTGACTTTGA TATTAGTGGA GGATCAGTCA AGCGGAGCAC TCTGAACATC
CGTAGTGTCA AACTTTTCAA ACCGGAGCAG TCGACAGTAC CAGCCAGTCC CGCAGCAAAA
TACCGGCAGT AGATAACATA GACACAGATT TGGCCGTTCC AGAGCAAGAG GAAGGAGAAG
CGGTCTTGCA GGAGACTTCT CCTGATGAAG AATTGGAATT TCCAGCACAA CCGATGGTGA
AAATTGGAAT AGCTGCGGGG GAACAGGTAG CAGGACCTAC TGCACAAGTA GCCACGCAGG
TTTGGGATAT TGAAGACGCT TCCTTTGTCA TGGCTCATGA AACAAAGTGC TATGCTGACA
AGCAAGCTAC ATTGATTGAT ATAAATGGCA GTATCCAAAG TAAGCAGTTT GGCATCAATA
CACCAATTGG CGACCTTCTT GGTCCAGACT CTGACATTGA TGGAAGATAT TCGCAGCTGC
AATATTTTCT TCTCATGTTT CCACCCGACC AACTGACCGC CATGTGTCAG CTAACAAATG
TGCAGCTTGC CCAACAGAAC AAGCACCGCA TGTCAACAGG AGAGCTGCTT CAATTCTTTG
GCATTCTAAT TCTTGCGACA AAATTTGAAT TTAGCAGTCG ATCGCAATTG TGGTCCACAA
CCACGCCGTC AAAATACATT CCTGCCCCTG TATTCGGAAA AACAGGAATG TCGCGGCAGC
GCTTTGATGA TCTTTGGCAA AATATCCGAT GGAGCAACCA GTGTCCTGAA CAGCCGGAAG
GTATGAGCTC CCATATGTTT TGGTGGCAAC TTGTTGATGA TTTTGTTGAA AGATACAACA
ATCATTGTGC CAACACTTTC AAACCATCTC ATCTTATTTG TGTGGATGAA TCAATGTCGC
AATGGTATGG ACAAGGGGAG GAATGGATAA ATCATGGACT CCCCAATTAC GTGGCTATTG
ACCAAAAGTC CGAAAATGGT TGTGAGATTC AAAATGCCGC ATGCGGCTGT TTGGGTATCA
TGCTTTGA
 
Protein sequence
MSEGDSRRKI GAQVTAKACH VVHLSECARQ YGALRTTKVV VGTVVDVSNT KKPPNNRVST 
FNTADFDISG GSCQTFQTGA VDSTSQSRSK IPAVDNIDTD LAVPEQEEGE AVLQETSPDE
ELEFPAQPMV KIGIAAGEQV AGPTAQVATQ VWDIEDASFV MAHETKCYAD KQATLIDING
SIQSKQFGIN TPIGDLLGPD SDIDGRYSQL QYFLLMFPPD QLTAMCQLTN VQLAQQNKHR
MSTGELLQFF GILILATKFE FSSRSQLWST TTPSKYIPAP VFGKTGMSRQ RFDDLWQNIR
WSNQCPEQPE GMSSHMFWWQ LVDDFVERYN NHCANTFKPS HLICVDESMS QWYGQGEEWI
NHGLPNYVAI DQKSENGCEI QNAACGCLGI ML