Gene PHATRDRAFT_49942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49942 
Symbol 
ID7198543 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp366871 
End bp368549 
Gene Length1679 bp 
Protein Length446 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184790 
Protein GI219129215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATTCGGGT CGGACAAGGG GAGAAGATGA CCCCCGTTGG CTTGGGTATT TTGTACATAT 
ATACACACAC TATACACACA CACACACGTG TATACACACT ACACACACGC CATGCGTGAG
GATCCTTTAG AGTCGAGCAC ACGGTCGGTG GTTTCGGTAC GTTCGCGGGA ACGACGGAAA
CCGGGGTTGA CCAACGCTCG GAATCGGACG TCCCGTTGCA TGACGGGTCT CCTCGTCGGA
CTCGCCGGAC TAATGCTTCT TTCTCTGACA CAAATGGTAC GCTTACACCG GGAACCGTTG
GTAGGCATGG GGGGCCTTTC CTCTCTGCCT TGGAGCAACG GTGAATGGAA GCTAGGGCAA
CAACAACAAC AACAACAACA ACAACAACAA CGCCATCGAA TCGGCGACGA CAATCACACG
GCACACAACA ACGACCAAAC GGCACTGCGC CTCCCGCGTC TGGCCTCTCC CAACACGACC
ACGAATCACA GCGATACGAT GATCGTTCTG GATCAGCAGC TATCCTTTCC GGAAGACAAG
CGTGTCATTC TGGAGGTACT CCGCGCGGCT GGCGTCACCT CGATCCAGGC CCCGTCCTGG
GAACTCCTCC CCGATACCCG CACACTCACC ACACTCTACG GTCCCATTGA TCAACCCATT
GTCCTAGGTC TTGAAACATG TCAAGCCTTC CGCGACGCCA TACCCCAGGC GGAACGGTAC
GTCGCACCCG CCGGAATGTT CAACACCGGA ACCAACGCCT TGGAACGACA CTTGACCAAC
AACGTTTTCG GTGTACAAAA AGCCTGGCAA GTTCCGTGGG GCAAACACCG GACCGAAAGC
AAACGATTGC ATCACGCCGC CGTGTCCCTG GAAGGAATCA ATCAAACGGC AGTCTTGCCT
ATTGTCGTTA TACGGGATCC ATTCTTTTGG ATGCAAAGGT GCGTACAAGT GGTCACGGGG
AATCGGCCCG GGTGCGAGGC GACTACCTGT GCACCGCCGA CATGCAAAGC GCGTTCCTCA
CACACAGCCA TTATGTTCAA ACATTCGGAA TGGACATACT ACTTGACAAA TTAGCATGGT
ACGTGGTTAA TGTCGTCTTG GCCATGGTCT GCTCCATTCC CATACCATAC CGCCCACACG
AATCCTTCTG GTTTACTCTC CCGGTACAGT GCAAACATCC CTACGCCGCC AAATGGGCCC
GCGGCGCGCA ACGCTGTCCG GGTTTAAAAA CGCTGCCTCG AGATTTCAGA CGCTTTAGCG
CCGATAAGCT GCCCCGCAAT CAAACCAGCT TTCGCGTCAA GGTCATTTTT AGCGCCACCG
ATGTACAATT TTGGGATTCG CTCGTACATT TATGGAGTCG ATGGTACCGC GATTACTGGG
AGGCACCCTA CCCACGTTTG ATGATCCGTT TCGAAGACTT GCTGCTGCAT TCCGACGACA
TTGTCCAAAG CATTGCGGAA TGCGTGGGCG GTACCGCCAA CCGCAACCAC GTGGTGGAAA
CGGGGACGAG CAAGAACCAC GGTAGCGGCG CGGACTTTGT CAAGGCCGTG ATCAAAACGG
GAGACTTGGG AATGCGGCTC AAACACTTGA CCCAACCCGA TCTTCACTAC GCGACCGAAC
ACTTGGATGC CGAACTGATG CAAGCATTTC GATACAATCT TTCCACAGTC AATAGATAG
 
Protein sequence
MREDPLESST RSVVSVRSRE RRKPGLTNAR NRTSRCMTGL LVGLAGLMLL SLTQMVRLHR 
EPLVGMGGLS SLPWSNGEWK LGQQQQQQQQ QQQRHRIGDD NHTAHNNDQT ALRLPRLASP
NTTTNHSDTM IVLDQQLSFP EDKRVILEVL RAAGVTSIQA PSWELLPDTR TLTTLYGPID
QPIVLGLETC QAFRDAIPQA ERYVAPAGMF NTGTNALERH LTNNVFGVQK AWQVPWGKHR
TESKRLHHAA VSLEGINQTA VLPIVVIRDP FFWMQSMCKH PYAAKWARGA QRCPGLKTLP
RDFRRFSADK LPRNQTSFRV KVIFSATDVQ FWDSLVHLWS RWYRDYWEAP YPRLMIRFED
LLLHSDDIVQ SIAECVGGTA NRNHVVETGT SKNHGSGADF VKAVIKTGDL GMRLKHLTQP
DLHYATEHLD AELMQAFRYN LSTVNR