Gene PHATRDRAFT_48519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48519 
Symbol 
ID7194701 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp87505 
End bp90661 
Gene Length3157 bp 
Protein Length953 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183029 
Protein GI219125527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.598613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGACGG TGAGACCCAC CACCACCCCT ACCTACCGCG ACAACGGGAG CTTGGGGACC 
ATTCTCGAAG TACCCGATTC TCCACCGCAA CCCAGTAAAC CAGAGTTGCG CGTCGGCGTG
GACAGTGTGA CTGTGTGAGA AAGGTCCAAG ACCGAGAGAT ACGGTACACA TAGGTAGGTA
GCTAAGTAGG TAGCTGACTA GCTAGGTATT CCCCCTGCCC CCTTTTTCAG ACACAGCCAT
CCTCTTTAAT TCGCTGCAAT CCCACTCACG GTCCATATTC GGTTCCGTTT AGCTCCATTG
CGTGGAAGGC AATTTTGAAT GCAATCGATT CGAAATCGAA TAGACTGTGA ATTAGTGGAC
TCTGTTCGAT TCAACTGACT GTCTGACTGA CTGATTGTTG GAAAAGACTG TTCCACAACA
GGTCGTACGC GTCATTGCCC GCATGTCCGG TTTCGTCGCC AAGCTTCCCC TGCGTGTGCC
GTTGGAGACA GATTTGGGGG GACAATTGGG GCGTTGGTTG GATCAAGCGC ACGTGCAAAA
GTGGGAAGGC CGACCCGGGA TGACCTCTGC CGACTGTCGT GAGGATCTGG ATCGGCTCGA
TCAGATGCGA CGCAACGTCT ACACTGCCTG TCGACACGGC GTCGCCGACG CCCTCCCCCA
CTTGCACGTC TTGCAAGAAT ACGCGGCGGC CCTCGAACTC TGTGAAGAGC AAGGCTTTCC
GTACAACAGC GGTGCCGTTG ATGATTCTGG AACAACTCGT GACGACGAAC ACGATCGGCA
ACATCGACAC AAATTGAACA AACGAGGACA GCAACATCAT TCACCGCTAT CGTCTGCATC
GCTCGGAAAC AGCAGCCTTG CCAACATGCT TGAATTCCCC TGGAAGTCTT CCGATAATCA
GGAAGAAGTC GACGGTACCT TGGCGTGGGA ACGTGCCAAC GTCCTTTGGA ATCTCGCCAT
CGTGCAAGCC CACCAGGCCT ACGCCGTCGA GAAGACACCC AACAATCCAC AGTCTCGCAC
CGCCTGGAAA CAAGCTGGTT TGCATTTGCA AACTGCCGCC TCACTTTTGC GGTATTTGCA
AACGGATCTT TTACCGGCCG CTACGGAACG CTCTTTTCCG TCACACGATT TGTCGGCCTC
CTTCTTGACA CTCTGGGAGC GCTTTTGTTT GGCCGATGCG CAGTACGCCT TTTACCAAGC
GGTTGCGGCG GCGCCCCGTC CTTTGCACGC TCTGCTGGCC AAGGTATCGG CCGCGGCAAT
TCCACTCTAC GGCGTCTGTG AAGAATTGCT TCTCGACGAC GACGATTACG GTCTCGATAC
GAGTAGCTCA GCGAGTATCA CTGCAAACGC CAGTGCTACC GGCCACGCGG CGGCTAACCA
GTTCCGCAGT AAGCGATTAC AAATTTGGGG CGACGCGGTG CGGGCCTGGG GAATGTGGAT
GAGTGCCTTG TGTGAATATC ATCAGGCGCA AACCCACGCG GACAAGGGTG AACGGGGTCC
CGCCCATGCA CGCCTCGAAG CGGCACAGAA ATTTGGATCT CTCTGTCTCG ATTTCTGTAA
CAGCGAGGAA GAATCGCTCT TGGACGATTT GGCGGAGCTA GTCTACGTAA CCTTGCAAGA
TATGGAGACG CAATTGGAAC AAGCGGAGCA AGCCAACAGA CTAGACCCGG TCGACATTCC
CGATCGGAAC GATTTGCCCG AAGTTCCACC CCAAACCATG GTCGAAGTCG AAAAGGACGT
ATCGAGTAGT TTGCCAAAGC TGGCACCACC GCTCTTTACC AGTGGTCCCG GTTCTGTGCT
GCGTCGGTAC GAGCAAACCT TTCGATACGA CATGCAGCGG CTCCTCACTA ACACCACGCT
CGCCGCCGAA GATAAAACGG ATCAAGGACG ACGAGCTTTG GCGACAGTTA ATTTGCCGCA
TTCTGTTACA GCCTACCAAC AGGAAAGTCA GGGTGGGGGC ATTCCGGACG CTCTGTGGGA
AAGGGTCCGA GTAGTGCAAG ACCAAGACAT GCTTCGAGAA TTAAAACAAT CGGTTTGGGA
ATTATGTGAT ATTGCCGAAC GGGCGCGTTC GTTGTACCAG ACTGTTCAAG AAAATTTGAA
AGAAGATCTG CGGGTGGATT CTCTATTTCG CAGTCAAAAT AGTACGTTTG AAGGACACAA
TGTATCGCAA GTTCAAAAGA GTTTTCACAC AACACTCGAG AACTACGATT CGTTGCTGAC
GTCAGCTCGG GAAGGGGACC AGCTTGTTAT GCAACGCGTC GAATTGCTCG ATACAGATCC
AAAGTATAAG TTGCTACAGT TTCGGAAATC GCAACTGGAT AGACTCTTGC CTGCGGGAGA
TCAGAATGTG GACGTGTCCA CGCTCAGTCG AATGCTAGTG GAATTGTCGG CCTTGTTTCA
GCGCCGCGAC GTTTCGCTAG AGGAGTTGCG CAACAAAATG GAGGCGTATG ATTTTACGGG
TGAATTGGTG CAGGTGGATG AGCTTGGTCT GGAGGCAGAA GCTGAATACA AAGCAGTTTT
TCAGCGGGCG AAGGATTCCT TTCAAGGAGC GTTGAACGGA ATTGAACGAA GTATGGAGGA
GCAGTCGAGG TTGGTACGTG AAATTTTGAC GGAAAACGAT ATTTTCATGC ACGAACGCGA
AAACAGTCGT GCGAAAGGGA GCACTGACCG AAGCATCACG ATGATTGAAG ATGCAGTAGA
CGAAGTGGAG CAATTGTCCA CTCATTTGAA GGAGGGGCGG GATTTTTACG ATTCGGTCCT
GCCCAAATTG GAAAAGCTTC GCAAACAAGT TGGCGATGTC AGTGCCCGTC TCACAATGGA
GCGGTGTGAA TATGAAGACA ACACCCAGCG GAACCGACAA GAAGCCGATG ACGCACGTAT
GGCCGCCAAT TTGTCTGATC ACGGTCAAGG TCAACAAACG CAAACCTCTA TACGGTATAT
CGACAATGGA AGTGGCTCGT CCCCTAGGCG TCCTATGGAC CGTGTGGCGA CCCCTGGCAT
GCATCCAGTA TCCCACGAGC TTCCTCAAGT ACGCGTAGAC GACGAAAAGG TCGCAAGTTT
GGTAGCCATG GATTTCGATC CTAACCGAGT CTTTGCAGCT TTGTTACGAT ACGACAACAA
CTTTGAGCAA GCTTTGAATG ATCTGTTGTC GGGATAG
 
Protein sequence
MVTVRPTTTP TYRDNGSLGT ILEVPDSPPQ PSKPELRVGV DSVVRVIARM SGFVAKLPLR 
VPLETDLGGQ LGRWLDQAHV QKWEGRPGMT SADCREDLDR LDQMRRNVYT ACRHGVADAL
PHLHVLQEYA AALELCEEQG FPYNSGAVDD SGTTRDDEHD RQHRHKLNKR GQQHHSPLSS
ASLGNSSLAN MLEFPWKSSD NQEEVDGTLA WERANVLWNL AIVQAHQAYA VEKTPNNPQS
RTAWKQAGLH LQTAASLLRY LQTDLLPAAT ERSFPSHDLS ASFLTLWERF CLADAQYAFY
QAVAAAPRPL HALLAKVSAA AIPLYGVCEE LLLDDDDYGL DTSSSASITA NASATGHAAA
NQFRSKRLQI WGDAVRAWGM WMSALCEYHQ AQTHADKGER GPAHARLEAA QKFGSLCLDF
CNSEEESLLD DLAELVYVTL QDMETQLEQA EQANRLDPVD IPDRNDLPEV PPQTMVEVEK
DVSSSLPKLA PPLFTSGPGS VLRRYEQTFR YDMQRLLTNT TLAAEDKTDQ GRRALATVNL
PHSVTAYQQE SQGGGIPDAL WERVRVVQDQ DMLRELKQSV WELCDIAERA RSLYQTVQEN
LKEDLRVDSL FRSQNSTFEG HNVSQVQKSF HTTLENYDSL LTSAREGDQL VMQRVELLDT
DPKYKLLQFR KSQLDRLLPA GDQNVDVSTL SRMLVELSAL FQRRDVSLEE LRNKMEAYDF
TGELVQVDEL GLEAEAEYKA VFQRAKDSFQ GALNGIERSM EEQSRLVREI LTENDIFMHE
RENSRAKGST DRSITMIEDA VDEVEQLSTH LKEGRDFYDS VLPKLEKLRK QVGDVSARLT
MERCEYEDNT QRNRQEADDA RMAANLSDHG QGQQTQTSIR YIDNGSGSSP RRPMDRVATP
GMHPVSHELP QVRVDDEKVA SLVAMDFDPN RVFAALLRYD NNFEQALNDL LSG