Gene PHATRDRAFT_40362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40362 
Symbol 
ID7198279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp211951 
End bp213636 
Gene Length1686 bp 
Protein Length561 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184322 
Protein GI219128233 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0281204 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTACTT ATTTCGTTGC CTTTGCGTTC ATCATCTGGA CTTTGGCGTT ATGGCGACAC 
GTCTCGCGTC GGGCGGCATT CATCAGCGAC CCAGACCGGC GTCTCCCGTC CCCTCTCTGG
CACGTCCCCG TTGCTTCGTC CCACGGGGAC CACAACGACC CAATGACTAC ACCCACCAAC
GTCAGTCGTA CGGTCGAGTC GTTGCTCTCG TTGCGCGTCG GAGACTTGCC TGGCTACACC
GGATGGGCCC GGCCCGTCGG AACACTGTCA CCGTACTTTC GACAAGTATC CGACGCCCAC
GGCCGTCCCA CTCGAACCAT CGTTACGGTG GGACGCGTCT GGACCGTACG AGTGGCGTGT
ACGGGACACG TACGGTGTTT CCGTCCCCAA CGGGCCATTC TGGACGTACG TGCGTACGGG
CCCAGTGTCG TGGCCGGAAT GGTCGTGCCG TCCCGTGTCC GCCGGGCCTA CGACCCATCC
AACACCACCG CCAGTCACGA CACGATTCCG GGATATTACG ATGTGCAAAT TGTCTTTCCC
GATGCCGGAG TCTACCACGT CGAAGTAGTC CTGGCCTTTT CCAACGCACC GGCGTGGAAC
GAATTTCCAC TGGCTGGACC CGAACCCGAC TACGAAGGCT ACCTTCTACC AGATTTCCCC
TTAACCGTGG TTGTCGATCC CGTAGAACTA GTCAGCGAAG ATCCTACGAC GCACGCGAGT
CAAGACAATC GACCAGTATG TACAAACGCA GACTTGCTCG AAACGTCGCC AACCAGTGCA
ATAATCAAAG GTCGTTGGAG AGTGTCCGAC AAGGTTCAGG ACCGACGCTT GGTCGAAGAT
GCGTACGAGA TTGATCACAG TCCATCCAAC ATTAGCCGAA CAGGATATGA GCAGGGCGTC
AATTCGCTGG GTATTACTAT GGCCTTTGAA TACCAAAAGT GTAAACTCGC GACTGTTCAC
CAAAGTAAGC GGATCTTGAT GGTGCCGAAA ACGAAGGATT GGTACATTCT GTTCGTTGGT
GACTCCAACA TGCGGGCACA GCATTTGACC TTTCAATCTT GGCACGGTCG CGATCGGAAT
CAATATCCCC AAAGTGGCTA CATTTCGACT GCCAAGGGTC TCGCAAAGCA GCTACCCCAA
ATTAGAGAAA AACTGAGTGC TTTGAAGAAG CACGCCACTA ATCGGACCGA GTTTTACGTC
CTGTTCAATG CCGGCTTGCA CGACATTGCG CGCCTCTGTA GTCGAAAGTG GTCCCATGAG
AGATTTAACA ACGGTGACAC CCGACCTTGC GTGGAACAGT ATCGGCAGCA TCTAGGCGAG
CTGATCAATG GCATCAAGGA CCTCTCACCC AAGCTTGCCG TTCTGCAAAC AACCATTGCT
GGTTGGCCCA AATGGGGCAA TTTTGGTTTT GCGTGGCCGC CATCTCAAGG ACAACCATTG
CCCTTCCACT CGTCCACATG CCAATCCTTT AATGAGATTG CCTGGGAAGA GGCAACGAAA
GCGGATATCT CTGTGATGGA TGCGTATTGG TTGACCGTGC CTCGGCCAGA TCATCGACAA
GTCGATAACG AAAATGCGAT TGGAAAGCAC ATGGTACACG TTGGACCAGA GATACACAGC
GTTATGCAGC GCAAGTGGAT TTCCTTGATT GAGATTGCCT TGGGAAACGA CAACCCAGTC
ATGTGA
 
Protein sequence
MSTYFVAFAF IIWTLALWRH VSRRAAFISD PDRRLPSPLW HVPVASSHGD HNDPMTTPTN 
VSRTVESLLS LRVGDLPGYT GWARPVGTLS PYFRQVSDAH GRPTRTIVTV GRVWTVRVAC
TGHVRCFRPQ RAILDVRAYG PSVVAGMVVP SRVRRAYDPS NTTASHDTIP GYYDVQIVFP
DAGVYHVEVV LAFSNAPAWN EFPLAGPEPD YEGYLLPDFP LTVVVDPVEL VSEDPTTHAS
QDNRPVCTNA DLLETSPTSA IIKGRWRVSD KVQDRRLVED AYEIDHSPSN ISRTGYEQGV
NSLGITMAFE YQKCKLATVH QSKRILMVPK TKDWYILFVG DSNMRAQHLT FQSWHGRDRN
QYPQSGYIST AKGLAKQLPQ IREKLSALKK HATNRTEFYV LFNAGLHDIA RLCSRKWSHE
RFNNGDTRPC VEQYRQHLGE LINGIKDLSP KLAVLQTTIA GWPKWGNFGF AWPPSQGQPL
PFHSSTCQSF NEIAWEEATK ADISVMDAYW LTVPRPDHRQ VDNENAIGKH MVHVGPEIHS
VMQRKWISLI EIALGNDNPV M