Gene PHATRDRAFT_45621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45621 
Symbol 
ID7200392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp742342 
End bp743622 
Gene Length1281 bp 
Protein Length426 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179706 
Protein GI219117837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.690367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCCA TTATTAGTGA CGAGTCCTTC CAACTTGTTC GCGCGACGGC TCCTGTCGTC 
GCCGAGCATA TTGAGGAGAT TACGGGTACG TTTTATCCCA AAATGCTTGG TCGCCATCCG
GAGTTGTACC AATTTTTCAA CGAATCCAAC CAACGCGCGG TCCCCGGTCT CTGCCCCGCC
GCTAGCGGGG TAGTAACCAC CCGCCAGTCC AAGACTCTAG GAGATGCCGT AGTGCAGTAT
GCTCTCAACA TTGATAAGTT GGAAAACTTG AACGAGGCGG TACTTCGAAT TGCCCACAAG
CACTGCGCAT TGGGCGTGAA GGCCGAGCAC TATCAGATTG TCCATGACAA CCTCATGGAA
GCGATTGGCG AAGTTTTGGG TAGTGCGGTG ACACCGGAAG TCGCAGCCGC GTGGAGTGAA
GCTGTCATGG CTTTAGGGAA GATATTTATC GAGCAAGAGC AGAAATTGTA CAACGAAGCC
GAAAAAGTAC AGTGGTCGGG ACCGAAAGAA TTCATTATCA CGGATATTAT TGATGAGACC
CCCGTTGTGA AGTCATTCCG TATGAAGAGC AAGGATGGGC AGAAGGTCTG CCCCTTCAAA
CCGGGACAGT ACCTTAGCAT TTACGAGCAA CCCAACAACA AGAAATATTT TGCTCCTCGT
CACTATACGA TTACTAGCCA GCCAGAAGAT GATTTCTACC AGATTACCAT CAAGAAACTC
ATTGACCCAG CTGTTCCGGA TGACCGCACT CACGACGGTA TCCTCAGCCA CTACTTGCAT
TCCAAGAACG TCAACGATGT CATCAAGCTT GGTCCCATCT TTGGTCCGGA GGTTTTACTG
CAGGGGGAAA AATCCCGCGT TGCTGCTTTC ATCAGTGTGG GCATTGGCAT CACACCAACA
ATGGGAATAC TCCCGACTGC CGTCAAGGAA CGTCCTCGTA CTGCCGTCTT CCATGGTGAC
GTTAACGGCT CAAATCACGT TTCTCGCGAA GCTTTGGAAG AGTTTGGCAA CGAGCAAAGC
CTGTTTTCAT ACTCTTACTT CAATCCCGAT GAAGCTGATA CAAAGCTGCA GCACTATTCG
GAAGGTCTCT TAACGGGAAG CAAAATTGTC GATAAGTTGA AGGATGCTGG TGTTAATTTT
GCGACAGGGA CAGACTATTT CATCTGTGCT GGCCCCACAG TTGCACCAAT TCTGGTCAAC
GAGTTACGTG AATTGGGTGT AGACAAGAAG CTTCTACATT TGGAGTTTTT TGGCCCGTTT
GTCTCTCTGA TTGAGGAATA G
 
Protein sequence
MSSIISDESF QLVRATAPVV AEHIEEITGT FYPKMLGRHP ELYQFFNESN QRAVPGLCPA 
ASGVVTTRQS KTLGDAVVQY ALNIDKLENL NEAVLRIAHK HCALGVKAEH YQIVHDNLME
AIGEVLGSAV TPEVAAAWSE AVMALGKIFI EQEQKLYNEA EKVQWSGPKE FIITDIIDET
PVVKSFRMKS KDGQKVCPFK PGQYLSIYEQ PNNKKYFAPR HYTITSQPED DFYQITIKKL
IDPAVPDDRT HDGILSHYLH SKNVNDVIKL GPIFGPEVLL QGEKSRVAAF ISVGIGITPT
MGILPTAVKE RPRTAVFHGD VNGSNHVSRE ALEEFGNEQS LFSYSYFNPD EADTKLQHYS
EGLLTGSKIV DKLKDAGVNF ATGTDYFICA GPTVAPILVN ELRELGVDKK LLHLEFFGPF
VSLIEE