Gene PHATRDRAFT_47504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47504 
Symbol 
ID7202281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp832197 
End bp834751 
Gene Length2555 bp 
Protein Length812 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181634 
Protein GI219122609 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.467202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGTTG TACGGAACAC CGCCATTCGA CAATCGCTAC TCTTCCCTAC ACAAGCCTTT 
TTCTCCGGCA CCAGGCTACC GTCGTCCATT ATGGAGCGGT GTCGGTCGTT GAGGATCATG
ATGGTGCTGC ATCTAATTGT ACTGGTAGTA TTAATGAGTG CACCAGTGAA TGCTGCCGTC
TTTCCCGGGC ACAGCAGCAG CAGCACGGCA CCGTCAACAA CAACAAAAGG GGCGACAACG
ACATATATTC GTCGAGAATT GCAAGAGGAG AAAAAACAGA GCATTGTACA ACAATACGAG
CTATGGGAGG CGGAAGAAAT CAGTTCCCAG ATACAAAAAT GGGCTTCCCA CTATCCCGAT
CTACTCCGCG TCACTACCTC GCAAGAAGCC TACGGCTTGC CTCGAGCCGG AGGGGCCGAC
GATTGTCCGT TCGACAAGGG CGGCGACGGT TGTCTGAACT ATATACTTAC ACTACAAGAC
TTTGTGAAGC ATCCGGAGGG ATCGGCGACG TCCAATCAGC TGCCGGAAGT CCTCTGGAGT
GGTGAAGTCC ACGGCAACGA ACGGGTCGGC CCGACCGCCG TACTGGAAGC CGCACAGCTT
CTCATGGAAG CGGCTTCGTG CATCGCACAT CCCCGGGTAG CACTGCGAGA TAATCCCAGT
GCCTGGAAGC TAGAATTGAC CAAAGCACGC TCGTGTCGGG AAGAAATGAA GCACATGGGC
CTGGATGAAA GCCACATCCA GTGGTTGGCC CGCCTTGTGT CGACTCGACG GATTGTGATT
ATCCCGACCG CGAACGCACT CGGGTATTTC CGTAAAGTGC GCGAAGAAGG CAATGTCGAC
CCGAATCGAG ACTTTCCCTA TGATTTGACT GATCCCACGC TCTGCATGCA ATCAGTTGCC
GCCCGGACTT TGAACGAGGT CTATCGGGAG CACTTGTTCC AACTTGCCCT TACCTTTCAC
GGTGGTATGG AAGCGATTGG GTATGAATGG GGGGCTCCAA CTTGGAAATC GAAAAAATCG
CCGGATGATT TGGCACAACA AGAGATCGCC GACGCGTACA GTCGCTACGC CGGGGGATGG
TTCGGAACGC GAAACTACCA ATACGGCCCC ATGAACGATC TGGTGTACCC CGTGCGTGGT
GGGATGGAAG ACTGGGCCTA CGCAGGCTCC TGGGATACTG ATAGAGTCAT TGCTTGCCGA
CCCACTACAT TTGGGGGCTA CCCAGAAGCC AAGACCGTCT ACGACAACTC CACTTTACGG
GTCTTTAATA TGCTGGTGGA AACCAGCAAT GACAAAACTC CGCTCAAGGA TCAACTCGGC
ACATCCTTGG ACGTTTTGAA CTCTGATACT ACCGGTAACG GCCACGTGTC CCGCAACATT
CGATTGGCCT TGTTATCGGC GGATTTGGTA CAGCCTTATG TGACACTGCA ACGAGTGAAC
GACTTGCACC TGTCCGACGA TGTAGTATCG CTCTCACGAG AGGATGGTCA CAGTTGTCAA
GGTACACGCA CAGTGACGAT CTCCTCAGAA CGCCCAACGG TCACGTTGGA GTGGACGGTT
GGAGGGGCAC TACAAACCCA TGAGACACAA GTGTTTTACG CCAAATGGAG CGATCTTCCC
TTGGAAAAAC TCGACTGTGT GAACACACCG AACATTCAAG ACGTGGAAAG TCTCATGATT
GAAGGAACCA TGACTTCAGT TACGTCCGGT ACCAACCATT TCGCCGACGC CAAAGCGACC
ACGTTCAAGT CGACCATCGA TGTGCAAAAC TTTCACGCCC ACGACAAGAT TGTGGTTATT
GTCATGGCCA CGGCTGATCA AAATTGGCAT ACACAAGATC CCAAGGACGC TGTTGGTCCG
GAGAATAGTC CACCTCAATC GCACATTGCC AACGCCCGCA TCAATGCGGA TTGGTACTTT
GCCAAAGAGA ATGGCAAAGT CGTTCAGGGC CGCCGCGAAT GGTTTTCCCA ACCTTTGACT
ATCGAGATCG GTGAATTCGC CACCAATGGC ATGGGCGCCC ACGGAAGCCT CGTCGTCGAA
ACCTTCGAAT TGTCGAATCG CCTCGGGGAA ACGACGGGCG GCGGTTTCCC CACTGGTGGT
GTGCGTCCCA ACGCGGGCGT GTCTCCCGGT ACGATACAAC CACGGTCGCT GTTCCGAAAG
GTCGCCGCCG TTGGTATGCT GGTATTTGCA GCGATCGCTG TGGCATACGG GGGGCGGCTC
TACCTACGGA ACAAAATGCG GTCCAGTCGA CGAACGCAAA TTCGTAACTA CATTCAGGAC
GAAAGTGCAC CGAGTCCGGG GTTGCGCGAT ACAGCGCGTG TCAACGGTGC CAGCAAGAGT
GGATACGTTC GCTCGGCATT CCGAGACGAT TTGGATCTCG AAGAAGAAGA TCCTCGACGA
GAGCAAAGAA GCGAAGTGGA ACTGGGACAG TACACTTAGT AGGTCCACTA CTTACCGCGG
CAAACGGCAA AACCTAATCT GATTTGTCAA ACATCATAAA TTGTAGTCTA ATCCATGAGA
CACTTTTGAT CGCAACTAGC CAGAGTAATA CATTT
 
Protein sequence
MGVVRNTAIR QSLLFPTQAF FSGTRLPSSI MERCRSLRIM MVLHLIVLVV LMSAPVNAAV 
FPGHSSSSTA PSTTTKGATT TYIRRELQEE KKQSIVQQYE LWEAEEISSQ IQKWASHYPD
LLRVTTSQEA YGLPRAGGAD DCPFDKGGDG CLNYILTLQD FVKHPEGSAT SNQLPEVLWS
GEVHGNERVG PTAVLEAAQL LMEAASCIAH PRVALRDNPS AWKLELTKAR SCREEMKHMG
LDESHIQWLA RLVSTRRIVI IPTANALGYF RKVREEGNVD PNRDFPYDLT DPTLCMQSVA
ARTLNEVYRE HLFQLALTFH GGMEAIGYEW GAPTWKSKKS PDDLAQQEIA DAYSRYAGGW
FGTRNYQYGP MNDLVYPVRG GMEDWAYAGS WDTDRVIACR PTTFGGYPEA KTVYDNSTLR
VFNMLVETSN DKTPLKDQLG TSLDVLNSDT TGNGHVSRNI RLALLSADLV QPYVTLQRVN
DLHLSDDVVS LSREDGHSCQ GTRTVTISSE RPTVTLEWTV GGALQTHETQ VFYAKWSDLP
LEKLDCVNTP NIQDVESLMI EGTMTSVTSG TNHFADAKAT TFKSTIDVQN FHAHDKIVVI
VMATADQNWH TQDPKDAVGP ENSPPQSHIA NARINADWYF AKENGKVVQG RREWFSQPLT
IEIGEFATNG MGAHGSLVVE TFELSNRLGE TTGGGFPTGG VRPNAGVSPG TIQPRSLFRK
VAAVGMLVFA AIAVAYGGRL YLRNKMRSSR RTQIRNYIQD ESAPSPGLRD TARVNGASKS
GYVRSAFRDD LDLEEEDPRR EQRSEVELGQ YT