Gene PHATR_33231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33231 
Symbol 
ID7204308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp372217 
End bp374973 
Gene Length2757 bp 
Protein Length918 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186050 
Protein GI219112933 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGCA CAAAACGTCA AGCAACCCCT TCGGTTTCCA AAAATTGGGC CATGATCAAA 
GCGCAGCTTT TGAATAAAGC CCATTCCCCG GTGGGTTCTA TTTCACGACC CCAGTGGAAC
CAACTTGTGG AGGCGCTTCA CAACCTAAAA GCCCCACTGG GCAAAAATCT TTATAGGGCT
CTCGTGGAAG CCTTTGCTCT CCTGGATCGA ATGCATGAGG AGGTAACGGA AAAAAAATCG
GCAATGCCTC GTCTGCCTTT ACCTCTTCCC ACGGTTCTCC TTCATCTGAC GATCGATTCG
TGGCGCAGGC AGGCGGCTGA ATTTGCTACC AAAGGTAAAG TCTTTCCAAT CTCTGCCAAG
TCTCTCCTAC AGAAAGTCGA GCACTTTACG GAAACAGGAC TATTTGACCC TAGCGCCAAA
ACTTATGGAA TGCTTGTCGA GGGCATGGCA TCGACCGAAG ACAAGCATAA TGCGCCTATC
TTGGCCGAAG AGCTTCTGGA GCGCATGACC AAGCAGTACG GCCAGGATCC TGCCGGGCGA
AAACTTGAAT GTGCACCCAA TACTGTAATA GTCAACAGCA TCATTAATTT GTGGATAAAA
AGTGGTCGGA GTGATTCCAT GGATGCCATT GAGCGGAAAT TTCAAAAGTT GAAAGACTGG
TACAGGTTTT CTGGACGGGA TGATCAAAAA CCAAACGCAT ATACTTATTC GTCTCTAATC
AGTGCGTTGG GTCGTTGCGG ACACCACCAA GCTGTCGAAC GAGCTGAAAC TTTATTGAGA
GAATACGAAG CCAGCACAGG CAAGCAGCCT TGCACAGTCC TATACACTTC CTTCTGCCAG
ACTCTGGCAA CAACAAACGC CCCCGGCGCT GCTGATAGAG CACAAGCGGT GCTGAACGAA
ATGCTCTTTC GCTCACGCCA CGGGGAAGAT ATGGCGAGGC CAAATTCGTA TACCATCTTG
GCCGTTGTTT CCACTTACTT GAAGGAAGGA AGAATTGATG AGGCTGAGGT GCTAGTCCGT
AATATGGAAG ATCTATCCCG TCAAAGGGCC GACGATGGTT TGCGGCCCGG TATTTTTTGC
TACAACTCCC TCATTCACGC TTGTGCCAAG AACGGAAATG CAGAGCATGC AGAATCGATA
CTCAATCACT TGTTAGACTC GGCAGAATCC GGAAATAGTA TTCTGCAGCC CAACATTGTT
ACATGGAACA ACGTTCTTCA CGCTTGGGCA AAAAGTAAGC ACCCCGCCCG GGCTCAGCGA
GCATCAAATG TGTTTAAACG AATGCGACTG CTCGATCGGG CTGGTATATC AGGGGCGAGC
GCCGATACCC GAACCCTTAA TATTCTTATC GACTGCTATG TGAACAGCCC AAACCCCAAA
GCATTTGTTC ACCAATCCAT CGAGCTTTTC GAATCCGTTA AGAGCCAAGG TGTAGACTCG
TCGAGTGGCA TTTTTGACCC TGTTTCATAT CGCGGAATGA TGGACCTACT ATGTAAGGCC
GGCGAGTTTG ACAGAGCACT GCGACTTCAC AAGCGTTACA TAGAAAAGTC TACAGCTCCA
GAAGCGCCGC TTCAACCAGA CAGGGCATAT TTTAATGTGC TAATGTCAGG ACTAGCACGG
TCCGGATGGG AAAAAGCTGT TGAGTCTGTG GAAGCCATGT TAACCGAGAT GCATTCTCTT
GCCAAATTAG GCTACAATAC CCATCCCGAT GTGGTCTCTT ACAACTCACT TCTAAATTGT
TTTGCTACAT CGAGCCAAGT AAACGCACCG AGCAAAGCTT TCTCAGCACT TCGGAGAATG
GAGCAATTGT GGGCAAACGG AGATTTATCG GCTGCACCGG ACGCATCGTC ATATTCAGCA
GTTTGCGCTA CCTGGGCGAA TGCTGGTCTG GCTGAGGGTG CCGAGAAGGC TGAGGAAGTC
TTGCGTCATA TGCTATTGCA CCGGGACCAG AAGATCCAAA GCACGAGCCA AGTGTTTAAC
ACTGTAATGG TTGCATACGC TAGACAGGGT GATGCACCTA AAGTACAGAA GCTCTTCGAC
GAGTGGCTCG AGACTGATAA CACTTATGAC TCTCGTATAT ATGTCACTTT GCTGCAAGCG
TGGTCAAAAG CAGGCAATCC GGAATCTACA GCTAGTGTAC TCTATGAATT GATAAGACTG
TTTGACAGCG CAGCCATACG TAATCCACCA ACGACTCAAA TGTTCAATGC TGTCCTCCAG
GCTTGGCTAC GGTCAGGACG CAAGCACGCG GAAGTACAAA TAAAAGCGGG GGTCGATGAA
ATGTCGTCCC TAGCAACCTC TGGGAGATTT CCGTGCGCCC CAGATGCTCT GACATACTCA
ACTCTGTTTT CCGCTTGTGT GCGATCAGGC AGAGATGATC TTGGTGAACT AGCGCATAAT
GGGCTTTTAG AATTGAAATC GCGTTTCGTC ACAACACGAA ATCCAATGTA CCGGCCTGAT
TTGCGAATAT TTGCAGAGGC TATCATGTTG GTGGCTATGG ATGATAAATA CTCAACAAAA
GATATTTTGT CGCAGCTACT ACTTGAGCTG AACGCAGTCA ACGGGACAAT CTGGAAAAAG
CAAGGACAAA TCGCGATGAA TCGAATTCTG GCTGCTATCT CGCGGTCTAC CATCAATGAA
AAGGAAGTCT TGGCTCAACT TGGAGTAGAG ATCATGAGAG CACAAAATGT TTCACCAGAC
GAATCGACTC GAAAGTTTTT AGCTCGATGT TGCAATAAAG AGGGACAACA AGCTTGA
 
Protein sequence
MTRTKRQATP SVSKNWAMIK AQLLNKAHSP VGSISRPQWN QLVEALHNLK APLGKNLYRA 
LVEAFALLDR MHEEVTEKKS AMPRLPLPLP TVLLHLTIDS WRRQAAEFAT KGKVFPISAK
SLLQKVEHFT ETGLFDPSAK TYGMLVEGMA STEDKHNAPI LAEELLERMT KQYGQDPAGR
KLECAPNTVI VNSIINLWIK SGRSDSMDAI ERKFQKLKDW YRFSGRDDQK PNAYTYSSLI
SALGRCGHHQ AVERAETLLR EYEASTGKQP CTVLYTSFCQ TLATTNAPGA ADRAQAVLNE
MLFRSRHGED MARPNSYTIL AVVSTYLKEG RIDEAEVLVR NMEDLSRQRA DDGLRPGIFC
YNSLIHACAK NGNAEHAESI LNHLLDSAES GNSILQPNIV TWNNVLHAWA KSKHPARAQR
ASNVFKRMRL LDRAGISGAS ADTRTLNILI DCYVNSPNPK AFVHQSIELF ESVKSQGVDS
SSGIFDPVSY RGMMDLLCKA GEFDRALRLH KRYIEKSTAP EAPLQPDRAY FNVLMSGLAR
SGWEKAVESV EAMLTEMHSL AKLGYNTHPD VVSYNSLLNC FATSSQVNAP SKAFSALRRM
EQLWANGDLS AAPDASSYSA VCATWANAGL AEGAEKAEEV LRHMLLHRDQ KIQSTSQVFN
TVMVAYARQG DAPKVQKLFD EWLETDNTYD SRIYVTLLQA WSKAGNPEST ASVLYELIRL
FDSAAIRNPP TTQMFNAVLQ AWLRSGRKHA EVQIKAGVDE MSSLATSGRF PCAPDALTYS
TLFSACVRSG RDDLGELAHN GLLELKSRFV TTRNPMYRPD LRIFAEAIML VAMDDKYSTK
DILSQLLLEL NAVNGTIWKK QGQIAMNRIL AAISRSTINE KEVLAQLGVE IMRAQNVSPD
ESTRKFLARC CNKEGQQA