Gene PHATRDRAFT_47762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47762 
Symbol 
ID7202743 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp759177 
End bp762486 
Gene Length3310 bp 
Protein Length1014 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182129 
Protein GI219123637 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTAA TTGACAACAA TAGCGAAGAC ACTGGGGGAA TGCGTTGTTG GTTGCCACGG 
GAAAATGGTT CTGACGACCA AGAGGCGGCG AGCCTACAGA ATTTGGATTG TCTGTGCTTG
GGGACGGGAC GGTTTTTGCG TTCCGTTCTC GTGCCGGCTT TAAACTCCTT TAGTCATTCA
GTTTTGGTGC AAACGCGGGG ACGCTCCTTT CTGGAATATA TGGCGACGCA GGATGGGGAT
GACAACGGAA CGTTTCCGGT GGACACCGTC CTGCCATCGG GCGAAATAAA GACTGACCGA
TACCGATGCT ACGGGGCATT TTCGTGGGGA CGAGTCGAAG ACAAGGCAGC TTTTTATGAC
GTTTCCCGGA AGACTAGTGG ACCCTCGGTT ATCGGTGTAG GAGTAACGGA GGCGGGCTTG
GCATCATCCG AGACCCAGGC TATGAAAGAC CTGTACGATT TTTTGGAGTA CTATCAAGAC
ATGTGGGAGG AACGCAGCCT TTGGAAACCA GCTCTAACCC CACACAAAAA GCTTTGTGTC
ATCGATATGG ACAACATTCC CCAGAATGGG GATGTCTTGG CGCGTCATAT GAATAGTTTG
GCGCAGGACA ATGCAAGAAT GTTGCGTTTC TTGGCCGACA AGGTTGTATT TTTGAATACC
ATGGTCGATC GGATCACATC CCATCGTGAA GGAGACCCAA TGGTTCCCAA GGCAGAACCG
GTCCCGGCCA AGGCTTTGGT AATTCTTGAT TCTGAGGGGG ATCTTCCAGT AGCGTTTCAT
AAAATGAAAG AATCCCACGG GGTAGTAGTA CGCTCAACGC GGGCCGAACT CGAAATTGAC
TTGGCTCTAA AGTTACGGGT TGCCAATGGC ACGCACACAG CCTTAGCACA CATACTGGCT
CTGACTAAAC GAACAATGAC AGATGCACTC ACTGTTGACG GAGTTGCTGG ACCGTTGCTC
TTGGCATACT TAGATGCGCT TGTGGAAACA CAGATTCTAG CTGCTGGCGG GGCGTCGGGA
CTGGAACCCC ACGCTACAGC CGCCTTAGAA GTATGGCAAG ACTGGCGATC AAGGCTGACG
CATCCATATT TTGGTCTAAG TTCTTTTTTC ATTACTCAAA ACGGAGCAGC TAAGGGCGGA
ATCCGCCTTG GACCAACTGT GCTAGATCTG GTAACAAGAA GTCAGACTAC ACAGCCGCTC
AATGTCGCGA TGGCGTTTGC TTGGGCATGC TTGCTGCGCT GGTTGACGCC AGACCGCAGG
AGAGATAGTG AGGATGAGAA AAGTAGTCGC TATTCATTGA CGGAAGAAAT GACGTTTACA
ACCGCTAAAG GTGTCTATAC AGGTTGGTTA CAAGGGTCAG AACTCAATAA CACGGAAGAC
GCAACTACGA CATACGCTGA TGGATTGCAC TACAATCTGA GTCAAGATTG GTATGAATTT
CGGTGCTCCT GCAAAGTGCC AGTAGGCAGC AGAACTCAAC TGCAAAAACC ATTGTCAGAT
GTTTTGGGTG CTTTAGTTTG TAGTGGTCCG CGGCAGCCGG TAGCATACCA TGGAATAGTC
CGGTCGTACC TCTTGGCAAC CGACGGCGGA AATTTAAACG CGATTGCCGA CAAGCGGGCC
ATGAATGACC TCGTGGCTGG AGTGTCCACT CTATACGCTC GCATGATTGT CGGGGACGAC
ATTTTGAGTA TTCTGAAAGA AATCGGGGAC AACGACGGCG CCTTTATTGA TGGTTTCGCC
ACAGCGTGTA CATCTATGGC AGATGTGTCT TGTTTGAGTC AGGGTTGTCC TTTAGCATTT
CGACGTAGTC CTGTCCCGAA TCACAGCCGA CTACTGTTGT TGTCTATCCA CAAAGATACG
ATCGATACAG TTGTAACTTC TGAAGTAGCC TCCGCTATCG CCATTGATTT GCATACTCAC
TTGCTGCCAC CCTCACACGG CCCGCTCTGC TTGTGGGGTA TTGATGAGCT ATTGACTTAT
GTATGTGTAG CAGTGGGCGA TCCGTTACTT ATAAGAAAGT ATCCAGAAAC ATTCTTTCAC
ACTTTTTCTT TGTTCTTGCG CAGCATTATT TAGTGGCGGA GTTCTTTATA ACTGCTCCGG
CATCGATGAC ACCAGACGGC TTCTATGCTT TGCCAAAGAA ACAGCAAGCG GATACAATTT
GGCGGGCACT TTTTGTGGAG AGATCACCGC TCTCGGAGGC ATGTCGCGGA GTCATTACAG
TTTTGGTGTC TCTCGGATTA GAGAACGCAC TAGCGGACCG CGACTTAAAC TTGATTCGTA
AATTTTACAA AGGCTTCCGG GACGAAGGCC TAACCGGAGC AGAGAAGTTC AGTTCTTTAG
TTTTCAGCAA ATCGGGTGTC CGGTACAACA TTATGACAAA CATTCCCTTT GATCCCAACG
AAGAGAGGTA CTGGCGTCCT AAACCAAAAG ATTATTCGGA CAATTATCGC TCTGCTTTAC
GTGTAGATCC CCTCCTGACT GGTGATTGCC GAACGATTGA ATTGGCTTTG AAGGGCTCGG
GATACGACAA TACTATCGAG GGGGCGCGTC AGTACTTACG AGACTGGTGC GATACAATGA
GTCCGGAATA CATGATGGCG TCGACGCCGC ATGACTTTCT GCTGGAAAAA GGCACCTTGG
GTTCCTCGAC ATCTACCGGT ATTAATGAAG AGGCATTGAA ACTCCCGGGC GCTTTCGCTC
AGCTCAAGAA CCAAGAAATT AGCTGCAATA GCACAGAAGA TGACAGTCCG AGTGTTATTA
ATGAGAACAG TGATTTCTTG GGCAACGTGT TGATGAAAAT CTGCGAGGAG CGCGATTTAC
CTGTGGCCCT AAAGATTGGA GCGCATCGGA GAGTTAACCC AGCCTTAAAG CAAGCAGGCG
ATGGTATGGT TGCGTTTGCT GATGCTGGTA TGCTTGGGCG GCTGTGTTCC AGGTTTCCAA
AAGTTCGCTT TCTCGCAACC TTTTTATCTC GTAACAACCA ACACGAGGCT TGTGTCTTGG
CGTCCAAGTT TCGCAATTTG CACATTTACG GATGTTGGTG GTTCTGCAAC AACCCAAGCA
TTATTCGAGA GATTACCCAA ATGCGAATTG AAATGTTAGG TACTGCCTTT ACGGCGCAAC
ACAGCGATGC CAGGGTTCTT GATCAGCTGT TATACAAATG GCCCCACTCG CGAGCCGTAA
TTGCGGCAGT TCTGAAGGAT GAAATGGCCA AGATGGTAGC TTCGGGGTGG ACTCCTACCC
GTGCCGAAAT TAGACGAGAC GTCGCCCGTC TTTTCGGGGC TAGCTATGAG GAGTTCATGC
GCAAGTCTCT
 
Protein sequence
MTVIDNNSED TGGMRCWLPR ENGSDDQEAA SLQNLDCLCL GTGRFLRSVL VPALNSFSHS 
VLVQTRGRSF LEYMATQDGD DNGTFPVDTV LPSGEIKTDR YRCYGAFSWG RVEDKAAFYD
VSRKTSGPSV IGVGVTEAGL ASSETQAMKD LYDFLEYYQD MWEERSLWKP ALTPHKKLCV
IDMDNIPQNG DVLARHMNSL AQDNARMLRF LADKVVFLNT MVDRITSHRE GDPMVPKAEP
VPAKALVILD SEGDLPVAFH KMKESHGVVV RSTRAELEID LALKLRVANG THTALAHILA
LTKRTMTDAL TVDGVAGPLL LAYLDALVET QILAAGGASG LEPHATAALE VWQDWRSRLT
HPYFGLSSFF ITQNGAAKGG IRLGPTVLDL VTRSQTTQPL NVAMAFAWAC LLRWLTPDRR
RDSEDEKSSR YSLTEEMTFT TAKGVYTGWL QGSELNNTED ATTTYADGLH YNLSQDWYEF
RCSCKVPVGS RTQLQKPLSD VLGALVCSGP RQPVAYHGIV RSYLLATDGG NLNAIADKRA
MNDLVAGVST LYARMIVGDD ILSILKEIGD NDGAFIDGFA TACTSMADVS CLSQGCPLAF
RRSPVPNHSR LLLLSIHKDT IDTVVTSEVA SAIAIDLHTH LLPPSHGPLC LWGIDELLTY
HYLVAEFFIT APASMTPDGF YALPKKQQAD TIWRALFVER SPLSEACRGV ITVLVSLGLE
NALADRDLNL IRKFYKGFRD EGLTGAEKFS SLVFSKSGVR YNIMTNIPFD PNEERYWRPK
PKDYSDNYRS ALRVDPLLTG DCRTIELALK GSGYDNTIEG ARQYLRDWCD TMSPEYMMAS
TPHDFLLEKG TLGSSTSTGI NEEALKLPGA FAQLKNQEIS CNSTEDDSPS VINENSDFLG
NVLMKICEER DLPVALKIGA HRRVNPALKQ AGDGMVAFAD AGMLGRLCSR FPKVRFLATF
LSRNNQHEAC VLASKFRNLH IYGCWWFCNN PSIIREITQM RIEICYTNGP TREP