Gene PHATRDRAFT_22332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_22332 
Symbol 
ID7203383 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp740982 
End bp742816 
Gene Length1835 bp 
Protein Length579 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182753 
Protein GI219124945 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0126456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATTGTGGCTC CGTCATTTTT CTTTCTCATA CTGGGAGCGC TCATGACAAG TGTTATCCCG 
CATTACTACG CCATATGCAT CCAGCTTGTT GCAACATTAG ACGGCACAAG GGCAAAGGTA
TTCCAAGCTT TGACCGGGCT TGTCGTTTCG AGTACACTGG GTGCGTTGTT CACTGGTTGT
CGAGGAAGCT TATTTTGGAT CGCGGGAAGT CGAGCGAACT ACAATATTCG GGTCAAGCTA
CATCGTAGTC TTCTATTGCA GGAAGCTGCT TTCTTTGACT CCAATGAGAC CGGATATTTA
CTCAGCCGTC TGAATAGCGA TGTAAACAAG ATTGGAATGG TGATATCTTA TCATGTCAAC
GTCGTCTGTC GTCAGTTGGC ACAATTTATC TTTGGGAGTG CGTACCTCAT CAGAATTTCT
CCAAAGTTGT CTCTTTGGAC TTTTGCTGGA ATTGGTCTCG TTGCCTGGCT CTCGACTATC
TATGGTGCTT TCAATCGTGT GTTGGCGCAG CGTGTTCAGG ACACCTTTGC GGACGCGACT
GCCGTGGCGG AGACTTCGTT CTCCATGTCG GAAACGATTC GTGCCTTTGA CGGAGTGGCC
GTTGAGTCAA ATAAGTACGA GACGGCGCAG AGCAAAGCGT TAGATCTTGA AGAAACGCAA
GCATGGGGAT ACGGAACGCA CAAATTTGTC TCAGACACTT TGCAAGGAAT TTTGCAAGTA
CTCCTACTTT TTGCTTGTTG GAGTATTGGT CGAACAGGAG GCTTACCCGC TGCTCAATTG
ACGACCTTTA TGTTCTACAC CAATTTTGTG CTCGAGTCAT CCAATGAAGT CGGTGATCAA
TGGGCAAAAA TTCAAGGCGC GATCGGGGCG AGTACCTCTG TATTTGATTT GATTCGAAGA
GTACCTTCTG TACGAGATCC GCCAATGAAG CTGGCGTTGC CCTCCGTCGA TCATTCTGTA
AAACATTTAA ATGGCTCAGA ATTGACGCCA ATTATCAACA TTCACAACAT GACACTAAAG
TACAGTGCAA TGGACTTACC CGCGCTGGAC TGTATCGACC TAAAAATCGA CGAAGGCGAC
CGAGTTGCAA TCGTTGGTCG AAGTGGTAGT GGAAAGTCTT CCATGCTCCG CGCTATACTC
AGGTTCTACG ATCCAACTTT TGGATCGATT CAACTGGAAA GAACTCTTCT GACGGAAATG
TCAAGGAAGG ATATTGCCTC CAAGGTCTCC ATCGTTTCAC AAGAGCCGAG TTTGTTTCCT
ATGAGTCTAA TGGAGAATAT CTTGTACGGT ATTGAAAAGG ACGCCGTTGA TCCGAAGACT
GGTGAGCAAT GCTACAGTGA TGCTTATCGG GAGAGGACAT CCAAATCGTT GGAGCTTGCT
GGTCTCCCAG TGCAGCCGGG GAATGATTTA AACCTTTCAC TTGATACAAG GGTGGGAGAT
GGTGGACGTT CCCTTTCCGG CGGTCAGCGT CAACGGGTTG CTATCGCACG TGCCTTGATT
CGGCATCCCG AAGTACTTTT GTTGGACGAA CCGACAGCCG CCTTAGATTC CCAATCTGAA
AGGGCGGTGG TTGAAGCCTT GCTGCGAGCG ATGGAGCGCT CAAAGAGTAT GGTGATGGTG
ACACACCGAC TAGGTGTGGT TCGCTCATTG AATGTGAATC GAGTAGTGGT TATGGAAAAA
GGGAGGATTG TGGAAACGGG TCATCCCGAG GAATTGCTGT GCAAAGAGAA TGGCTGGTTC
GCTAATCTGG CCCGAGAACA AGGGATTGTG TCCGCTCATA AGTCGACCAC AGCAGTAGAA
GAATACCCTT AAAAAACTTC TTCTCTACAA ACTCT
 
Protein sequence
MTSVIPHYYA ICIQLVATLD GTRAKVFQAL TGLVVSSTLG ALFTGCRGSL FWIAGSRANY 
NIRVKLHRSL LLQEAAFFDS NETGYLLSRL NSDVNKIGMV ISYHVNVVCR QLAQFIFGSA
YLIRISPKLS LWTFAGIGLV AWLSTIYGAF NRVLAQRVQD TFADATAVAE TSFSMSETIR
AFDGVAVESN KYETAQSKAL DLEETQAWGY GTHKFVSDTL QGILQVLLLF ACWSIGRTGG
LPAAQLTTFM FYTNFVLESS NEVGDQWAKI QGAIGASTSV FDLIRRLALP SVDHSVKHLN
GSELTPIINI HNMTLKYSAM DLPALDCIDL KIDEGDRVAI VGRSGSGKSS MLRAILRFYD
PTFGSIQLER TLLTEMSRKD IASKVSIVSQ EPSLFPMSLM ENILYGIEKD AVDPKTGEQC
YSDAYRERTS KSLELAGLPV QPGNDLNLSL DTRVGDGGRS LSGGQRQRVA IARALIRHPE
VLLLDEPTAA LDSQSERAVV EALLRAMERS KSMVMVTHRL GVVRSLNVNR VVVMEKGRIV
ETGHPEELLC KENGWFANLA REQGIVSAHK STTAVEEYP