Gene PHATRDRAFT_54139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54139 
SymbolhBRM 
ID7197022 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1316659 
End bp1320025 
Gene Length3367 bp 
Protein Length995 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178118 
Protein GI219112733 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCTGGTACC GGCGATTTCG CCCAGACCCA AACCCGTAGT GCCACCGGCT CCACCGAAAC 
CAACCGTAAC AACTTCCTTT GCCAAGCCGG CTCCGGTCAA TGTGTACACG AAACCAGTAC
CGGCAACGAC ACCGGTGCCA GCAAAACCCA AGACCCCATC GTCCGCCCAG CTGCAACAGC
AGCAACAGGC CAAAGACCTA CGGCAGAAAG CCAAAGAACG TGCTCAATGG AAACGTGTTC
AACACGGAAT CTTCATGCTC CAAAAGGAAA AGTTTCTGGC CGTACCTTTT AGTGTGGGAG
CAATGGTAAG AAGCCGGGAC ACTGCCGCCA CATTGCCACC TCCGGCTCCA CTGCCGAAGC
GATCTCTCTC CGATCTTTAC CCCACAATCG CCGAGCTACA AAGGCAACTC CGGGCACAGA
AAGCTGCCGG TCAGCCCACT GCAAAACGAT CACCTACGCC CTTATTGGAC CCAGAAAAGT
TCAAGCGCAT CAAGGTAGAA CCGAAAAAGT ACGCTAAAGC GATTGACCGC GCCGCTCGCA
AATCTCGACA GACGACGGCC GACACGCTCA GCAAGCAGCT CAAAGACGTA CACAAGGTTC
TCAACGCCCA TCAAGTCGAC TTCTTCAAGT TTCACCGCCA GCGCCGAACC GAACACGCCA
AACTTCTCAA GACCATCCGG GACGTCTTTA ACAAGGAAGC CCGGAAGGTC GAAAAGGACG
CGACGCATGC GGAGAAGGCG CGAATTGCCG CATTGCGGGC GAACGACATG ACCGCTTACT
CAAAGCTTCT AGAGGAAACT CGCAACGACC GTTTGCAGTA TTTACTCGAC AAGACAGAAA
AGCACTTTAC GCAGATCTCC TCATTGCTAC ACCAGGAACG GTCGGACGAC GGGGGCGATC
AAAAAGGCAA CAATTCCTAC TACGCGTCGG CTCACTTGAA AACGGAAGAA GTCCGACAAC
CAAGTATACT GGTGGGTGGT GAGCTGAAGG AGTATCAGCT GCTAGGATTA CAGTGGCTCG
TATCTTTGTA CAATAACAAG CTGAACGGGA TTTTGGCGGA CGAAATGGGA TTGGTACGTA
CCGGCCAGTG CCGTAACTAG ACGAGTTGAA TAGGCTGGTT TCTGACTAAG ACTTTGCTGT
TTTGTGATTT CAGGGGAAGA CCATTCAGGC TATTTCGCTC ATTGCGTATT TGATGGAGTT
TAAACAAAAT CTAGGTCCTT ATCTGGTGAT TGTGCCTCTT TCCACTCTCT CCAACTGGCA
AAACGAGTTT CTCAAGTGGT GCCCGGCGGC GAGGCTAATT TGCTACAAGG GAACGCCGGG
GTTGCGAAAA GAGATTTATC GCGACCAGGT CCGCACCGGC CACTTCAATG TATTGCTAAC
TACGTATGAA TATATCATCA AGGACAAGAA ATTTTTACGC AAGATTGATT GGCAGTACGC
GATCGTCGAT GAAGGTCACC GCATGAAAAA CGCGCAGTCC AAGTTTGCCG TTACCCTCGG
TACGCAGTAT TCAACGCGAT ACCGCGTCTT GCTGACCGGA ACTCCACTGA TGAACGACTT
GAGCGAGCTT TGGTCACTTT TGAATTTCTT GTTGCCCACT ATTTTTAACT CTGTGGAGAC
GTTTGACCAA TGGTTTAGTC GACCGTTCGA ACAGTTTGGC GGTGGTTCTA ACACGGATGA
AGGTGATGAC TTGTTGTCCA ACGAGGAGCG CATTCTGGTT ATCCATCGTC TGCACGAGCT
TCTTCGACCT TTCATGTTGA GGCGTGTCAA GAGTGAAGTG CTTGATCAAC TACCCGAAAA
GGTCGAGAAG GTCTTGCGTT GCGAGTTGTC CTCTTGGCAA AAAGAACTCT ACAAACAAAT
TAGTAAAAAA GCGGTGGCCG ACACAGCTTT AATGGGTACC GACACGCAGG CTCCATCACG
TGGCCTGAAC AACATTGTGA TGCAGCTACG CAAGGTCTGT AATCATCCCT ACCTTTTTTC
TCCGGAAGGT TACCACATCA ATGACATAAT AGTTCGTTCT TCGGGAAAAA TGGCTCTTTT
AGACCAAATG CTACCCAAAC TCAGAGCCGC TGGTCATCGT GTGCTGATGT TCACGCAGAT
GACAGCAGTC ATGACAATCA TGGAGGACTA CTTTGCTCTC CGGGGATACA AATCGCTGCG
CTTGGACGGG TCAACTCCAG CGGAAGAGCG AGAAAAGCGT ATGTATAAAT TTAACGCGCC
GGACTCTCCT TACTTTGTTT TCCTATTGTC CACACGGGCA GGTGGTCTCG GACTCAATCT
TACTTCGGCG GATACCGTCA TTATCTTTGA TAGTGATTGG AATCCCATGA TGGATCTGCA
GGCACAAGAT CGGGCTCATC GCATTGGACA GCGTAGCGAC GTTAGTGTCT TTCGGCTTAT
TACTTACTCT CCAGTTGAAG AAAAGATTCT CAGTCGAGCG AATGAGAAAC TCAGTGTCTC
GGAACTGGTA GTGGAATCGG GCCAATTCAA CAAGCAAGGC GGTGAAAGTG ACAACAGTTT
AGAGCGAAAA CGGCTGATGG AAGTGTTGCT CACGGATTTT GAAAACGCAC AACCCAAAGC
TGTGTCAGAG AAATCAGCTG GCTCGGAGGA TGGGGAGGAG GACGATGACA ACAACAGTGA
GAGCAGCGAC AAGGAAGATT TAAACGAAAT GTTGAGCAAT AACGAAGCAG ACTATCAACT
CTACTCGTCA ATTGACGAGC AGTTAGAAAG AGAAGGTGGT ACTCTCGCTC CGCTGTACAT
AAGCGACGCT GATGTCCCTG ACTGGGTTCG TTATCCTCAT CAAGGAGCCA ACGATGGCGG
GTTTGAAGCA CCAAGCAACT TTTTGGGCGA CGGCTCGAGG AAACGAAAAG CGGTCATGTA
CGACGACGGT CTTACAGAGA AGCAATTTCT TCGTATGATG GAGAAACAGG CCGTCCAAGA
AGAGCAGCAA CCTCGGAAAC GTCCGAAGCT TCAGAAAATC GCCCCGAGTA CAGTCTCTGC
AGCAGCTATA CCTGATGCCG AAGAGCAAGC ACCTTTACGT AGTGATTCGC TCCTGACAGA
TTGGACATTC CGCAAACTAA TTAGCTGCTC AAAATCGGTC GTCGCTTTAA AGGATCCCAG
CACAAAGCGA CGTTTATCTG AGTTGTTTCT TGAAAAGCCG GACCCCGCAA CTTTTCCCGA
CTATTACGAG ATTGTTGAGA AGCCAATGGC AATCAACGAC ATTCTTCGCA AGTGCCGCGC
GAAGATATAT TCTAACTTGC AAGAATTCAA CGATGATTGG ATGCTCATGT TTGCGAATGC
CAAAAAATTC AACGGCGAAG ACTCGTGGGT TGTGGAGGAC GCTAAGGCTC TGGAGAAAGA
ACTCCAG
 
Protein sequence
MLQKEKFLAV PFSVGAMVRS RDTAATLPPP APLPKRSLSD LYPTIAELQR QLRAQKAAGQ 
PTAKRSPTPL LDPEKFKRIK VEPKKYAKAI DRAARKSRQT TADTLSKQLK DVHKVLNAHQ
VDFFKFHRQR RTEHAKLLKT IRDVFNKEAR KVEKDATHAE KARIAALRAN DMTAYSKLLE
ETRNDRLQYL LDKTEKHFTQ ISSLLHQERS DDGGDQKGNN SYYASAHLKT EEVRQPSILV
GGELKEYQLL GLQWLVSLYN NKLNGILADE MGLGKTIQAI SLIAYLMEFK QNLGPYLVIV
PLSTLSNWQN EFLKWCPAAR LICYKGTPGL RKEIYRDQVR TGHFNVLLTT YEYIIKDKKF
LRKIDWQYAI VDEGHRMKNA QSKFAVTLGT QYSTRYRVLL TGTPLMNDLS ELWSLLNFLL
PTIFNSVETF DQWFSRPFEQ FGGGSNTDEG DDLLSNEERI LVIHRLHELL RPFMLRRVKS
EVLDQLPEKV EKVLRCELSS WQKELYKQIS KKAVADTALM GTDTQAPSRG LNNIVMQLRK
VCNHPYLFSP EGYHINDIIV RSSGKMALLD QMLPKLRAAG HRVLMFTQMT AVMTIMEDYF
ALRGYKSLRL DGSTPAEERE KRMYKFNAPD SPYFVFLLST RAGGLGLNLT SADTVIIFDS
DWNPMMDLQA QDRAHRIGQR SDVSVFRLIT YSPVEEKILS RANEKLSVSE LVVESGQFNK
QGGESDNSLE RKRLMEKSAG SEDGEEDDDN NSESSDKEDL NEMLSNNEAD YQLYSSIDEQ
LEREGGTLAP LYISDADVPD WVRYPHQGAN DGGFEAPSNF LGDGSRKRKA VMYDDGLTEK
QFLRMMEKQA VQEEQQPRKR PKLQKIAPST VSAAAIPDAE EQAPLRSDSL LTDWTFRKLI
SCSKSVVALK DPSTKRRLSE LFLEKPDPAT FPDYYEIVEK PMAINDILRK CRAKIYSNLQ
EFNDDWMLMF ANAKKFNGED SWVVEDAKAL EKELQ