Gene PHATRDRAFT_54505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54505 
Symbol 
ID7201038 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp815477 
End bp818897 
Gene Length3421 bp 
Protein Length1056 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180126 
Protein GI219118718 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGACGGAATT TGCTGAGTCG CAGAGGATTC GCCACAAGCA ACAGCGCTTG CTGCTGCTTC 
GTCATGCATC CCGCTGTCAA CATGAAGCGG GGAAATGCCC AGCTACCCCT CACTGCGCTA
GTATGAAGCG ACTTTGGAGA CATATTGCAA ATTGTAAGGA TCAGGACTGC TCTGTTCAAC
ATTGTCTTAG TAGTCGCGGC GTTCTCAGCC ATTATCGACG GTGTAAAGAT GCGCTCTGTC
CTGCATGTGG GCCTGTCCGA GAAACTATAC GGAAAAGTCA TGAGATGGAA AGTCAAAGCA
ATCCACAAGG GGTACCGTCC GACAATCGGT TTATGGGTCG AGATGATTCG TTCGGTCGGT
CAAGTTCCGT TACATCGCCA ACCGAACAGG AACCGAAGCG TATGAGAACA GAACATCGCC
CTAGCGCGGC GTCTATAAAA TCAGCGCGCT CTACGCCTGT GAGCGCGCCT CCTTTGAAGC
AAGAACCCCC TCGAAGCATA GGCAAAGGTG AGAAAGTAGC TCCATCTGCT GAAAAAGATT
CGAAAGGAAG TGTCGACCGA TCACTCCTCG AGAGTTTCTC GGTGAAGGAG CTCGAAACTC
ATTTGCGATC GCTGGAACGA GAGACCCAAC TTCCTCCGGC GAAGCTCAAG TCTAAATGTC
TGGATGTATT AAAGGGTTTA ATGGCTCACC AACACGGTTG GGTTTTCAAT GGTCCAGTCG
ATCCAGTTGA GCTCGGTCTT GTTGATTATT TTGAAATTAT CAAGAAGCCC ATGGACCTCG
GCACCATTCA AAAGCGTTTG GAAAGTAGTG CATACCACTC CATCGATGAC TTTAAAACGG
ATATCTTCTT AACTTTTGAG AATGCAATGG TGTATAATGA GGATGGTTCC GTTGTCTACG
ACATGGCGAA GCAGCTGAAG GTTAAAGCCG AATCTGACAT GAAGAGACTT GTGGCACAAC
TGGAAACAGA AGACCTTGAA AGACGCCAGA ATGAACGCGC GTGCACCTTG TGTGGTTCAG
AGAAACTGTT GTTTGAACCT CCTGTTTATT TTTGTAACGG AATTAATTGT CAATCGCAGC
GGATCCGACG AAACAGTCAC TTCTATATCG GAGGAAACAA CCAATACTTT TGGTGTAGCC
CTTGCTTTAA TGAACTTGAT GATAAAATTC CGATTGAGCT TGCCGACTTG ACAGTCATGA
AAAACAATCT GAAGAAGAAA AAGAATGACG AGATTCACGA GGAGAGCTGG GTACAGTGTG
ACACTTGCGA ACGGTGGGTT CACCAGATAT GTGGACTTTT TAACACCCGT CAGAATAAAG
AGCACCACAG CGAGTACTGT TGTCCTAAAT GTTTGCTTGA AAAACGCAAA ACTGTTTCAA
TAACTCCAGC GCCGAAGCCA TTGCTGGCTG CGGACTTGCC GCGGACTACT TTATCGGAGT
GGCTAGAACG CAGTGTCACT AAGAAAGTGG AAAAAAGGAA GAGAGAACTG GCCGAAGAGC
GTTCGCAGAA TGAGGTACGT GTCTCTACAT TTTGTTCATC GAGCAAGTCG TTTATGTATC
GATTTAACTA AGGAAGTCTT TTCTCTTTCA GGGGATATCT CTTGAAGAAG CTTTGCGACA
GGTAGAAAGT GGCGGCCCAA TAATAATTCG TCAAGTTACC GCGATGGATA GAAAGCTTGA
GGTTCGCGAG CTGATGAAAA AGCGATATGC ACACAAGAAT TATCCTGACG AATTTCCCTT
TCGGTGCAAA TCGATTGTCG TTTTTCAGCA TCTTGACGGA GTTGATGTCA TTCTGTTTGC
GTTGTATCTC TACGAACACG GTGAAGACAA TCCTCCGCCC AACCAACGAA CCGTGTACAT
CTCATATCTG GACAGTGTTC ACTTTATGAG GCCTCGCAAA CTCCGGACCT TTGTGTACCA
TGAGATTCTG ATTGCCTATT TGGACTACGC TAGGCGACGG GGATTTGCAA CTGCTCATAT
TTGGGCATGC CCACCTTTGA AGGGTGACGA TTACATTTTC TACGCTAAAC CAGAAGACCA
GAAGACTCCG AGAGATTCAC GACTGCGCCT TTGGTACATT GACATGCTCG TAGAATGTCA
AAAAAGGAGT ATCGTCGGCA AAGTAACGAA TATGTACGAT ATTTATTTCG CAGACCCGAA
TTTGGACGCC ACTGCTGTTC CCTATTTGGA GGGCGACTAT TTTCCTGGTG AAGCGGAGAA
TATTATAAAA ATGCTCGAAG AAGGTGGAGG CAAGAAACTT GGGTCAGTGG GGAAAAAGAA
GAAAAGCAAA TCGTCGAAAG CGCAGAAGAA TAAGGGAGGA AATACGGGTA CTAGATCCAC
TGGAGTCGAC GAAGAAGCGC TTATTGCGAG TGGTATTCTG GATGGAACCA AGAGTTTAAA
GGACCTTGAT CGTGATCAGG TCATGGTGAA GCTGGGTGAA ACGATTCAGC CTATGAAGGA
AAGTTTTATA GTAGCGTTCT TAAATTGGAA AGATGCTCGC GAAGAAGATA TGATAGTCCC
AGAAGAAATC GAAATGGCTA GGATTGAATA CGCAGCGAAA GGTGATCCAG AGCTTGTTGG
AAGCAAACGT GATGCTGCTG GAAACATGAG AGACGCTACG TCGAAGACGG GCGCGAATGG
AGAGCCTGTA AAGGTTATTG ATGACGACGC TGAAGATCTA GATTGCGAGT TTTTGAACAA
TCGCCAAGCA TTCTTGAATC TTTGTCGAGG AAACCATTAT CAATTTGACG AGCTCCGGCG
AGCAAAGCAT ACTTCATTGA TGCTCCTTTG GCATCTACAT AACAGAGATG CACCAAAATT
TGTGCAGCAG TGCGTTTCTT GCAGTCGCGA AATCCTCAGT GGCAAACGTT TTCACTGCGA
CACGTGCCCT GACTATGATC TCTGTCAAGA TTGCTACAAA GACCCTAAGG CAAACAGAGG
TAACTGTACG CACGCTCTTA AACCACTCGC CGTTGAAGCT GATTCCGGAC AGGATCGCAG
TGGGCTATCA GAGCAAGAAC GCATGCAACG CCAGCGAAAC CTGTTGTTAC ACATTCAACT
TATCGAACAC GCTTCAAGGT GTTCCTCTCA GACATGTTCT TCATTAAATT GCGCAAAAAT
GAAAAAATAT CTGCAGCATG CTCGTGTCTG CAAGGTTAAA GTATTAGGAG GGTGCAAGAT
TTGCAAAAAG ATCTGGACCT TACTCCGAAT TCATGCGCAG AAATGTAAGG ATACAAATTG
CCCCATTCCA CAATGCAATG CGATTCGTGA GAAGATGAGG CAACTGCAAA AGCAGCAGCA
GGCTATGGAC GACCGGCGCC GTCTGGAAAT GAATCGTCAC ATGCGTTTCT CCACCGCAGG
AGGCTCTTGA GAAACCGAAA ATATTTTTCG TAATATAATA GAGCTTTGTT TTCATATTTT
A
 
Protein sequence
MKRLWRHIAN CKDQDCSVQH CLSSRGVLSH YRRCKDALCP ACGPVRETIR KSHEMESQSN 
PQGVPSDNRF MGRDDSFGRS SSVTSPTEQE PKRMRTEHRP SAASIKSARS TPVSAPPLKQ
EPPRSIGKGE KVAPSAEKDS KGSVDRSLLE SFSVKELETH LRSLERETQL PPAKLKSKCL
DVLKGLMAHQ HGWVFNGPVD PVELGLVDYF EIIKKPMDLG TIQKRLESSA YHSIDDFKTD
IFLTFENAMV YNEDGSVVYD MAKQLKVKAE SDMKRLVAQL ETEDLERRQN ERACTLCGSE
KLLFEPPVYF CNGINCQSQR IRRNSHFYIG GNNQYFWCSP CFNELDDKIP IELADLTVMK
NNLKKKKNDE IHEESWVQCD TCERWVHQIC GLFNTRQNKE HHSEYCCPKC LLEKRKTVSI
TPAPKPLLAA DLPRTTLSEW LERSVTKKVE KRKRELAEER SQNEGISLEE ALRQVESGGP
IIIRQVTAMD RKLEVRELMK KRYAHKNYPD EFPFRCKSIV VFQHLDGVDV ILFALYLYEH
GEDNPPPNQR TVYISYLDSV HFMRPRKLRT FVYHEILIAY LDYARRRGFA TAHIWACPPL
KGDDYIFYAK PEDQKTPRDS RLRLWYIDML VECQKRSIVG KVTNMYDIYF ADPNLDATAV
PYLEGDYFPG EAENIIKMLE EGGGKKLGSV GKKKKSKSSK AQKNKGGNTG TRSTGVDEEA
LIASGILDGT KSLKDLDRDQ VMVKLGETIQ PMKESFIVAF LNWKDAREED MIVPEEIEMA
RIEYAAKGDP ELVGSKRDAA GNMRDATSKT GANGEPVKVI DDDAEDLDCE FLNNRQAFLN
LCRGNHYQFD ELRRAKHTSL MLLWHLHNRD APKFVQQCVS CSREILSGKR FHCDTCPDYD
LCQDCYKDPK ANRGNCTHAL KPLAVEADSG QDRSGLSEQE RMQRQRNLLL HIQLIEHASR
CSSQTCSSLN CAKMKKYLQH ARVCKVKVLG GCKICKKIWT LLRIHAQKCK DTNCPIPQCN
AIREKMRQLQ KQQQAMDDRR RLEMNRHMRF STAGGS