Gene PHATRDRAFT_44398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44398 
Symbol 
ID7198058 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp428798 
End bp431776 
Gene Length2979 bp 
Protein Length760 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178507 
Protein GI219115423 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.160536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATCGCCAC CGTCTACTTT ACTCGCTTAT CACTCACAGT AAAATACCAT TGGCGGCCTA 
CAAAAGACCT TCGATCACCA TGACACTTAC TGTTTGCGTC GGAAGCTCCG GTTCCGGAAA
AACAACCTTC TTGGAAGACG TTTATAAGAG TCACAAATGT ATCTATATTC GTCAGTACCA
TATCATGCGT CCGTACATAA CGGTTTCCAA GATCCCCAAC TTCGATGCTA CCCGACTTCC
CTATTGGGAC ATTTACGTCA AGGAAGAGAA GGCTGAAAAA ATCCAAGTCG GCGGTACTAT
GGCCGGAGAA TTCACGGCTG GACTTTCGGG CGGGCAGCGC AAGTTGCTTC TCTTTGAATT
AATTTGCCAG CGTACGGCCT CACAGTCTGA GCTTTTGGTT GTCCTTGACG AACCCTTTGC
GGGAGTCACA GACGATTTTG TTCCGTTTAT TGTTGAGCGC CTGAATGAGC TTCGTCAAAA
GCACAACGTG CTGCTGGTAA CCAATGATCA CGTGACGACT CTCACCACTA TGGCCGACAA
CAAGATCACA GTCTCTGCGA TTGATCGTTC CACCGTTCGC ATCAACGATC GCGAAAAGGT
TGACCGCGAA AAAGCCATCA TGGCACTTTC CGTCGGAGAC GCGTACTCAT ACCAGGCTAC
CAACGCTGAT CTGAAGTTCT TTTACGATGT AGAAATACAT TCCAGCAGCG CCTTGATTGG
TATCGCCTGC TTTACCATTT TTTGCTACAG TCTCTTTATA GCTACATTCT GGGATTCCGA
AGAAAGCAGT CAAGCGCTAG TGCTGGTTGC CGGAGGTATC ATTTCGTACT TCTGCGTCAA
TCCATATTTG CTAAGTCTCG TTGACTGGCG GAATGCCCAA AATGAAGAAG CGTATGTATA
CTTTGGATGC CGGCAATATC AACAAGTGTT CCTCTGCAGA TCAGTCCCTC GCTCACTTCT
CATTCTTATC ACAGTGAGGC TCTAGTCCAT GCTTCGAAAA CAATGAACAA GACTCTCAAA
ACACTCTTAA CGTTTTCGCT TATACTTATC ATTTCCTTGA TTGAGTTCGG AGTCGTCAAT
GCTACTATCG ATGGGCTCTC TGAGATTAAA TTCTGGGTCG CAATGCTTTT CGATAGTGCT
TCCTTAACGT TTACTTTGAT TTGCTTGGGG CTCTACACCA ATATGCCATT TCAAGCGGTT
CAAGTTGTCG GAAGCTTGCC ATTTTTGCTG ATGATCTTTC TTTCCACAAC GTGAGTGTGC
TATGAACATA TACATATTCC TTTGTTCTTT TGCTGACACA CACCGTGATC GGTTTGAATT
GCGCAGGTTT TCTCCAGGGT CAGGTGTTCC TGTCCTGAAG GAACTTCGCT ACTTGTATGC
GCGATTCTAT TTCTGGTGTA TGGTTCCTGC TGTACAAGAC ACAATGGAAA ATTGTCCGTC
CGACAATGTG ATTCTTGTGT ACGTGATCTT GAGCGGATGC TTGGGTGTTT TTATTTTCCT
TGTAGTTATG GCAATCCTTA AAATCAAGAG GGGAATCCAG AAGGATAAGG CTGAGACGAA
GCGTGCAGGA CTCCGTGATG ATGAGTTTAC GGAGCTTCAA GTCGAATTAT ACGGGACAAA
GGCTTTACAT CGTCTGATGC ACATGAACAG CAGCCTTTCG CTCAAGAAAC CTGCTTCGAA
TGGGACCATC AAAGAAGCGG TATAGCTGAA GTGAGGTGCT AGCTGGACGG GAGTGACAGT
GAGATGCTGT TTTCGCACAT TCGTTTATAT TAGAATTTCA TAAGGGTAAC ATCGACATAG
CTAAGAATAA GTCAAGACTT TTATTGATAA CAGCTGTTTA TTTCTTCAAG TAGATCGGAA
AGTCAGTAGG TAGGGCTTTC ATTTTGATCC TTTTATGGTG CAAACGGCTG ACTGTGAGTA
GTGCGTTCAA GCACGCAGCT CCGTCTCGTG TGACAGGTGA CGCTCAAGGA GCTGATAAAG
TGTTTTCTCT TGATGTCACC CTTGTATGTA ACTAGAAACA CACGAATCGG GCTTCGGTGA
GCCCGCAAAA TCCATAATCC AGACGAAACA GAGATCCTCG TTTCAGGCAA TAATATTCCC
AAGAAATCAA ACCGAGTTTC TGTGTTCTTC GTGAAATAAC AGCGCTCTGT GCTCTTAAGC
AAACGATCGG CGAAGATGCG ATGCTTGATG GTTGGCAGCA GCCTCATCGC CTTGCTGGTG
GATAATGCTG CTGCCCTCAA CATCGTTCTA CCCGGAGGAA CAGGGTCTAT CGGTAGTAGG
CTCTCGGCAA AGCTGATGGA TCACACGGTT ACGATTCTAA CACGGAATGC ATTTTTGGCA
GCCGCTCCCA ATCGAGTGAC AGAACAGTTT GGGTGGGTCG GATCCAGCTT CTTACGGAAA
AATCCGCATG TTAATCTGCG CGACTGGGAC GGTGGCGACC TGCTCGATAT TGTCGGCCAA
GACTGGATTG GATGGCAGGA AGAAGCATTA TTAGATGCAG ATGTGGTCGT ACACTTTGTG
GGAGGGTTCA CGGAACAACG TACAATGGCT TGTGAAAGAC TGGTACGAGA ATCGATGAGA
GTGAACAAAG ACGCTTTGCA GATTACAGTC AATCCTCTAG ACGAAGAGAT TGGCGTTATT
TCCGTCGGCG CTGTAACGCA GAAAAAAGAA CGCATCCGTG CCTGCGAAGA AATGGTCAAA
ATGAATTGTG TTCACTCAAT GTGCTTACGC ATCGAGTGCT ATCGCGAAGA TGAAGGATGT
GAAAAGATTA AATCTACTAT TGTCGACTGG GCGAAACGTC AGGGGAACAA GTAATTCTTG
TATTCGTGAT CTTACACTTA GATTGACCAA TTTATTTTCC AATTACTGGA AAATTTCGGG
CTAATACTTG TGACCGTGCG TTGAGGGCTG CCGAACACTG CATTGACTGT GGATCAGAAT
AGTAAACGAG GTCAAATCCA TCAAACATCA ATTCTTTGC
 
Protein sequence
MTLTVCVGSS GSGKTTFLED VYKSHKCIYI RQYHIMRPYI TVSKIPNFDA TRLPYWDIYV 
KEEKAEKIQV GGTMAGEFTA GLSGGQRKLL LFELICQRTA SQSELLVVLD EPFAGVTDDF
VPFIVERLNE LRQKHNVLLV TNDHVTTLTT MADNKITVSA IDRSTVRIND REKVDREKAI
MALSVGDAYS YQATNADLKF FYDVEIHSSS ALIGIACFTI FCYSLFIATF WDSEESSQAL
VLVAGGIISY FCVNPYLLSL VDWRNAQNEE AEALVHASKT MNKTLKTLLT FSLILIISLI
EFGVVNATID GLSEIKFWVA MLFDSASLTF TLICLGLYTN MPFQAVQVVG SLPFLLMIFL
STTFSPGSGV PVLKELRYLY ARFYFWCMVP AVQDTMENCP SDNVILVYVI LSGCLGVFIF
LVVMAILKIK RGIQKDKAET KRAGLRDDEF TELQVELYGT KALHRLMHMN SSLSLKKPAS
NGTIKEAIGK SVGRAFILIL LWCKRLTVSS AFKHAAPSRV TGDAQGADKV FSLDVTLRSV
LLSKRSAKMR CLMVGSSLIA LLVDNAAALN IVLPGGTGSI GSRLSAKLMD HTVTILTRNA
FLAAAPNRVT EQFGWVGSSF LRKNPHVNLR DWDGGDLLDI VGQDWIGWQE EALLDADVVV
HFVGGFTEQR TMACERLVRE SMRVNKDALQ ITVNPLDEEI GVISVGAVTQ KKERIRACEE
MVKMNCVHSM CLRIECYRED EGCEKIKSTI VDWAKRQGNK