Gene PHATRDRAFT_40022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40022 
Symbol 
ID7195496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp618369 
End bp619814 
Gene Length1446 bp 
Protein Length481 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184032 
Protein GI219127624 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGTA CTGAACACGT TGTGATTAAG CAGTCGGTGA AGATCGAAAA CATCACACGC 
AGTGGACTTC AGTATGGCAC AGCGACCGGA CAAAATCATT CGAGACACAG CAAAACAGCA
GAATCGCCTT TTCAGTCTTT GCAAACGTCA CTAGAATCTC AGCGACTTAC ACCTATCGCT
ACCAGCTCAA CCGTTAGGGG CAATCGCTCC GTCACTGATC TTCCATACGC AGATGCTCGC
GACGAAGATG GTTCCTGGGG GTACATCGCC GACGCAACCC AGGTGAGGAG TCGAGTCCTG
GCGCTTCTAC CCTCAAACCA TACACTGCAC AACAATGTCA CCAGTTTCAT ACCCATGACG
GAATCTGAAC AAGAAGAAAT ATGCCAAAAG CCACCCGGAA GCGGACCGGA GCAAGAATTG
GGCTGGAAAC TGATGCAGCG TGTTGTCGTC AATGCGCCCG AGCCGAGGTA CGCCAACGAG
TCTGCAGTCA TTGTCGCCAA CGAATCTGCA GTCATCGTCA CCAACGAGTC TCCAGTCATC
GTCACCAACG AGTCTCCAGT CATCGTCACC AACTCGTCCT CCATCTCAGT AAGCCACCAC
ACAGAAACAG TAGCACCCAA AATTCTTTGT GTCGTCTACA CGTATGATGC TCATCACGAT
CGAGTTGCGG CGATTGGTGA TACCTGGGGT TGGCGCTGTG ACGGCTTTTT GGCCGCCTCC
AACCGAACTA TTCCGGAGCT TGGGGCTGTA GATTTGCCCC ACGTTGGACC CGAAGCTTAC
GGCAATATGT GGCAAAAGAC GCGTTCTATA TTGGCGTACG TGCACGAACA CTATATTGCG
GAGTACGATT ATGTGCATGT GGCAGGAGAC GACACGTACG TGATTGTGGA AAATTTGAGA
AATTACTTGG AGTTTACGGT AGAGGCAAAA CATGGTCGAG ACAAAATACC ATTATACTTG
GGTCAGAGAA TTGTTGCGGG GGCTGGTTTC GCATTTGTTT GTGGAGGAGG GGGTCACATT
TTGAACCGAC TGGCTTTGGA CCGTTTCGTC AAAGAAGCAC TGCCAACGTG TGAGGCCGAC
AGAGAAGACC CTGCCGAAGA TCGCTGGCTA GGATATTGCT TGAGAGAATT GGGTATTCAT
CACACGGACA CAGTTGATGG TTTCAATCGA CAACGATTTC ACAGTTTCGA TCCATACGAT
TTGGCTTCAA GGAATCCGCA GAGAGGCTTC TGGAAACGGC AGTACAAATT GTGGGGAGAG
ATGTACGGCC TCAAGTGGGG CATTGACTTA GTTTCGACAC AAACCATAAC GTTTCATATC
ATAAGGGGGG CGACTTGGAT GAAGCGGGTG CACGCTCTAC TTTATTGTAC ATGTCCTGTG
GGTACAGTAA TGGGCAATAT CCTCTCGCAG GTAATGGATG CAAGTGTAAG CAATAGACGT
ATATAA
 
Protein sequence
MDGTEHVVIK QSVKIENITR SGLQYGTATG QNHSRHSKTA ESPFQSLQTS LESQRLTPIA 
TSSTVRGNRS VTDLPYADAR DEDGSWGYIA DATQVRSRVL ALLPSNHTLH NNVTSFIPMT
ESEQEEICQK PPGSGPEQEL GWKLMQRVVV NAPEPRYANE SAVIVANESA VIVTNESPVI
VTNESPVIVT NSSSISVSHH TETVAPKILC VVYTYDAHHD RVAAIGDTWG WRCDGFLAAS
NRTIPELGAV DLPHVGPEAY GNMWQKTRSI LAYVHEHYIA EYDYVHVAGD DTYVIVENLR
NYLEFTVEAK HGRDKIPLYL GQRIVAGAGF AFVCGGGGHI LNRLALDRFV KEALPTCEAD
REDPAEDRWL GYCLRELGIH HTDTVDGFNR QRFHSFDPYD LASRNPQRGF WKRQYKLWGE
MYGLKWGIDL VSTQTITFHI IRGATWMKRV HALLYCTCPV GTVMGNILSQ VMDASVSNRR
I