Gene PHATRDRAFT_44310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44310 
Symbol 
ID7197971 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp197430 
End bp200838 
Gene Length3409 bp 
Protein Length974 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178179 
Protein GI219114767 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTTTACATT ACTTCCAAAA ATATACCTGA CAGTGACTGA AAGTTTTATC CAAACGAAGG 
GTACAAGTTT ATCTTACGTT GTATCTAACC TTCAAAATCC TCATAAGTTT GAGGTGCAAG
CTCTCCTCGA CTTACGTTGT ATCTAATCTT CAAAATCCTC ATAAGTTTGA GGTGCAAACT
CTCCTCGACT TACATCACAC TACACGATAA GCACAAACAT GAAGGACGGC AGCAACAGTC
GAAATGATTC AAGCACGGAG GGTCGTTCCG TTTTTCGGAA TCTAGTTCCA GATCGCCATG
ATGAACATGA TCCGTCTCCT TATGTACACG AACAACGCAC CACCGCTCCA CCATTGGTGT
ACGATGCGCC CTCCTTGTTG AATGAATCCC AAGCCGAAAC GTCCAACTTT GCAGCGATTC
ACTCGAACTA CAACGACAAC AGCACAAGCA GCACCCGCAC ACCCGCTGTC GGATCTCTTC
GGGATGCCGC GCGGCGTGTT TCCATGCTCA ATCGGACTGG ATTGTTATAC AAACATGTAC
TTTCCTCTTC TCACCAGGCC CATCAACGCG TCTCGTCCCG TGCACAAGAT CTGTTGTCAG
CTATTCAAGA AGGTGGACAT CTCAGCAATG ACGAGCATGG CGATGACAAT GTTCTTGCCG
ACGATATGTC AAGGTCCGAT TTGACGGACG AAGAAGAGCA ATTGCTCGGC ACCGCCAGCG
ATGCGGCCAA CAATACGAAC GACACACTTC CGACTCGTCG TTCGGAGTAC GGCTCCGTTC
TCATGCAGCG ACAAGAATCA ACATCCCAAA AGTCTGTAGC ATTCCGGCGC ATGCGGTCCT
GGCGTCGCAA GGTGGCAGTA CTTTTTCACC CGACCCGCAT GCTGCGAATT GCTTGGGAAA
GCTTTCTTTT CCGTCTAGGC TTGCCCTTTT GCCTAGCAGC GTGGTTGCTG TATTATCCAC
TCGGAAACCC CGAAGTAGAC GCCTTACCTG GCAAGACCCG GCTCTCTTGG TGGTTCAATT
TTGCCGGTCG ACAAGTGATT ACTCTAGAAT TGGCCCGACT CACGCAATGG CTTCTATTGG
ACCTATGGCT GTGTAGAAGT GGTAGTAGTG CTAGGGAACG ACCGTGGTGG AAAGTTGATC
CATTGGTGAC ACTCTGGAGC ATTCACAGCC GCGGATGGCC GTTCCTCATT TCGACTTGGG
CGCTTTGGGA TTTACTCATT TTGCATGGCG ATAATCCCTT CAATACGCAC TGGTTGTATT
GGACAGGCTG GGCTTTGTAC AGCAACGAAA ACGCCAATTC TGGGGCCTTC GTAATCAGCA
GTGACCTGTA TTTGCGGATC TTACTGGCCA TGTTGGTTGC AGGCATTGCC ACCGCAGCCA
AATGCGTTTA TGTTGTCTTG CAATTCAGTC GTCGTCAGGT ACAAACATTT CAAATTCGGC
TTAAGGCAAT TTTGGGCGAG CTCGTCACGG TCTCGCAGAT TGCTGCGTTG GCAACGCAGG
CTGATCTTGT GGCGGCACAA TTGGAGATGC AAATGGATGA AGACTACAAC GCTCCGGTAG
GCACATCCAT TCGTTCCATC GGCTCGTCTC TCAACAAGTC GACCAGACCT CAATCTTCCC
TGCCACGCGG GGCTGCGCCT GTTACAGGGA ACACCAATGT TCGTTGGAGT AACCTACATT
TTGAAGAACA CGATTCCAAA GCTCCAGAAC ACGATGATGA TGATGACGAT TCATCCAGAA
GCGAGAATAC ACCGGGATCC TCGTCTGGTC TCAAACGGTC GTTGTCCCAA GACTCGTCCG
GTAGTTTGGC AGTTCTAAGC CTACTTGATC GCTGGGAGCC ACCAGTCAAC AAGGCGAACA
AGAGCGATGT TGCAATTTCG GACGTGCTTA AATTTCAGCA AGCCTTGCGT TACATGGACG
ATGACGCTGT CTTTGGTGAA GACTTCGGAC CCGCTCGAGA TCGTAACGAG TGCGTGGGAT
CAGCTGTCGC AATGTACCAC AAACTTGTCA AATGGACGCC TGATTCCTAC GTACTTAAGT
TTGACACGTT GGAGATTTTG GCCATGGACG AAGACGGTGT TGTGGATCCA CTAAAGCGAA
AAATGTTACG CAAGCTGTTC CGACCAGATA GATCCGGTCG AATCCCATTG GTTGCATTCA
TCCAATCCAT TGATGCTGTC TATAAGAGAT TGCGTTACTT TCGTGCGTCC GTGACGAACG
CGACAGTGAT TGATGACGTC TTGGAACATA TTGTGGATGG GTTGTTCTAC TTTGTATTGA
GTTTGGTTGT ATTGAGTTTG TTAAATTTCA ATCCCTGGAC CTTTTTGGTT CCCATCACGT
CCCTCATGGT GTCGCTTTCG TTTGCCTTTG GTGGGAGTCT CAGCAAATAC GTCGAGGTAT
GTTATTCGCG GGTCGTCTTG GGAAATTTGC GAATTCTGTT GAGCGCTCAC AAAATTTGCC
GGTTTGCTCA GGGTGTGCTC CTCATTGCGG TCCGACGCCC TTACGACTTG GGCGATCGCA
TTTTCATTGG CAGCGCGGAA GCTCAGGCCG AAAGCGATAT GTCGATCCAA ACTTGGTTTG
TTGAAGGTAA GTGCTTTTGT TGTTGAACAC ACCAATTTTG CGCCGTATGC GTAGGGCTCT
TCCATTAACC TCTTATGTCT ACTATTTCTT TTAGATATCA ATTTGACCAC GACGACTTTG
CGATTCGCTC GTACCAACGA GGTCTCCACT GTTAACAACT GGGCCATTTC CGGCTCTCGT
ATTATCAACT GCAATCGCTC ACCCAATGCT CTCATCTTCT ACGAATGGAA GCTTCATATT
AGCATATTCG ACGGCAAGAA CTTGGATAAT TTCAAGGAAG CTTTGAACAA GTACGTCCGG
GACCATCCCC GAACTTGGAA CAGTCTGGCG TTCATCCGAC ACGACGTTAT TGACGCGGAT
ATGGAACAGG TGGGCTTCCG CATGGCCTTT CGCCACCGGA ATGGATGGCA GGACGCAGCG
CGGATCAAAC TCAACCGGGC AGACCTATTG CGCTACATTC ACGACACGGC CAAAGCCATG
GGGGTCAACT TTGAGACCTC CCCGGCCCGA CGTCTCTTGT ACTACGGTGG CGTCTTGGAA
AGCGGCCAAG TCAAGGATTA CAAGAAGAAT CTGTTGCGTC CATCAAACAT TCGTAGTCAC
AGTCACACCT TTGACGATCA TCGTTTTGAG TCGTCCTACC CCGGTACGGC GGGGATTCCG
CATTCTCCTC CTCCGCAAGC AACTCGAGTC GCTCCACCCC CGGGAGACGT TTTAATGGGT
GAATAGCCTA GCTGCAGGAG TTGTCAATGA ATTATGCAGT GAACGATTCC GAACCGATCA
GGATGGTCCA TTCTTTATAT TTAAGTCTAC TTACTTTCGT GGTTTTACC
 
Protein sequence
MKDGSNSRND SSTEGRSVFR NLVPDRHDEH DPSPYVHEQR TTAPPLVYDA PSLLNESQAE 
TSNFAAIHSN YNDNSTSSTR TPAVGSLRDA ARRVSMLNRT GLLYKHVLSS SHQAHQRVSS
RAQDLLSAIQ EGGHLSNDEH GDDNVLADDM SRSDLTDEEE QLLGTASDAA NNTNDTLPTR
RSEYGSVLMQ RQESTSQKSV AFRRMRSWRR KVAVLFHPTR MLRIAWESFL FRLGLPFCLA
AWLLYYPLGN PEVDALPGKT RLSWWFNFAG RQVITLELAR LTQWLLLDLW LCRSGSSARE
RPWWKVDPLV TLWSIHSRGW PFLISTWALW DLLILHGDNP FNTHWLYWTG WALYSNENAN
SGAFVISSDL YLRILLAMLV AGIATAAKCV YVVLQFSRRQ VQTFQIRLKA ILGELVTVSQ
IAALATQADL VAAQLEMQMD EDYNAPVGTS IRSIGSSLNK STRPQSSLPR GAAPVTGNTN
VRWSNLHFEE HDSKAPEHDD DDDDSSRSEN TPGSSSGLKR SLSQDSSGSL AVLSLLDRWE
PPVNKANKSD VAISDVLKFQ QALRYMDDDA VFGEDFGPAR DRNECVGSAV AMYHKLVKWT
PDSYVLKFDT LEILAMDEDG VVDPLKRKML RKLFRPDRSG RIPLVAFIQS IDAVYKRLRY
FRASVTNATV IDDVLEHIVD GLFYFVLSLV VLSLLNFNPW TFLVPITSLM VSLSFAFGGS
LSKYVEGVLL IAVRRPYDLG DRIFIGSAEA QAESDMSIQT WFVEDINLTT TTLRFARTNE
VSTVNNWAIS GSRIINCNRS PNALIFYEWK LHISIFDGKN LDNFKEALNK YVRDHPRTWN
SLAFIRHDVI DADMEQVGFR MAFRHRNGWQ DAARIKLNRA DLLRYIHDTA KAMGVNFETS
PARRLLYYGG VLESGQVKDY KKNLLRPSNI RSHSHTFDDH RFESSYPGTA GIPHSPPPQA
TRVAPPPGDV LMGE