Gene PHATRDRAFT_47350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47350 
Symbol 
ID7202405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp367919 
End bp369337 
Gene Length1419 bp 
Protein Length415 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181540 
Protein GI219122414 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATAATGGCA CTACCGTAAT TGTCTGATGC ACACCGTAGG GCAAACCAAA TCCACGTATC 
CACGAAACCA ACTGCTGAAT TTTTGGTCTT GTACAAGTAT TTTGCGACAA CCGTAACTCA
CTTCTTGTAT TTCTAAAAGC TAGTCCGTAC AGAGGTTTCT GTTAATCTAA AATGGACTGG
GGGGCGCTCG ACGAAGAAAG CTGCAACTGC GACGAAAGCC CGAGCCACAA GAACCGTGAT
ACTAAAGACT ACTTTTCAAC CAGCAGTGAA AGAAGAGTTT CCGGTCCAGG GAACAGTCAA
TTGTCGAAAC AGACGTCTGG CCAAGCTTGC GCCGATTCGA AGCTGAAAAA GCAGCCCAAG
CAGACAGGGA AATGGAAGAA GCCCCCCGGG ATGCCCAAGC GGCCGCTTTC TGCGTATAAT
TTGTTTTTTG CACACGAAAG GAAGCAGCTT ATTGCCTGTG GCGTCCTTGC CAGTGGCAGA
CAGAAGAAGC ATTATACAAA GCAATCGACG ACTTTACGTA GACCTCCCCG AAAAATGGGA
TTTGCAGGGC TAGCGCGAGC GGTTGCAGCC AAATGGAAGA TGATTGATGA TAGTACCCGG
CATACCTTCA ATCAACAGGC AGAATGTGAG CAAGCAAAAT ACAAAATAGC AATCAAACAG
TGGAACAGCC AGCATCTAGA TTTGTCAGAA ACGAACGTAG TCAATCATGT TGGGCAAGAT
AGAGCCACGT GCAGCGAATA TGACTTCAGC AGCAATCCGC TCCTCCATAC TCCGAAAGAA
GGACAAGCAA CAAAAGGAAA GAGTCTTCCA ATTCCCACTG TGTATGTCGC AAGGCCAGGA
AATGGTACTG TTGGGTGTAA TGTTCATCAA AGTGAAGACA CCCTTCCGCA TGGTTTACAA
GAAAGTCGCG TAGTGGATCG GGCTCGTTCC ACAGAGAGAT GGTACAGCGG GGCACAACCT
AGTGGATTTG TTAGCCTTCC TTTCACTCAT GAGCAGTCCT CTACAAACCG AGTCTGCAAT
AAGCGAGCCA CGTCCCTAGG ACAAGCTGCC GGCGTTGTTC CACCGCGTAT TATGCCGCCA
ATTCTTCAAC ATGACGACTT CAAAATGGTG CCGCAATCCC GCAAGAATCG CTCGTCGACA
CAACGAGAAC AACACATAGC AATGGGCCGT TATGGGTCTG CTCATGCATC AGGGAGTCAA
AGCCGTAGTA AAATGGAGCC CGCAAAAGGC TTCTGTAGAC ATGATGAAAA GGTGGAGTCA
AAGCATCTCA TAAAAGTACT GTCATCTTCC GGAAGTGAAT TTCCGACGAA AGGAGCAAGA
TCTATATTGT CAGCAAAAGG CAGGTCGTTT CGCCAATTAA TATTTGATTT GGATGATGAT
GAAGTAGATC TGCTGCGGGA ACTGGCTAAA AATCCCTAA
 
Protein sequence
MDWGALDEES CNCDESPSHK NRDTKDYFST SSERRVSGPG NSQLSKQTSG QACADSKLKK 
QPKQTGKWKK PPGMPKRPLS AYNLFFAHER KQLIACGVLA SGRQKKHYTK QSTTLRRPPR
KMGFAGLARA VAAKWKMIDD STRHTFNQQA ECEQAKYKIA IKQWNSQHLD LSETNVVNHV
GQDRATCSEY DFSSNPLLHT PKEGQATKGK SLPIPTVYVA RPGNGTVGCN VHQSEDTLPH
GLQESRVVDR ARSTERWYSG AQPSGFVSLP FTHEQSSTNR VCNKRATSLG QAAGVVPPRI
MPPILQHDDF KMVPQSRKNR SSTQREQHIA MGRYGSAHAS GSQSRSKMEP AKGFCRHDEK
VESKHLIKVL SSSGSEFPTK GARSILSAKG RSFRQLIFDL DDDEVDLLRE LAKNP