Gene PHATRDRAFT_42944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42944 
Symbol 
ID7196191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1592525 
End bp1597584 
Gene Length5060 bp 
Protein Length1422 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176813 
Protein GI219110123 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.237933 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCCC CGCCGCTTGA CTTTGAAGCC CTTCTTTCCA GAGAATACCG GCCGGCACGA 
ATCGCTCGCT TAATTCCGCG AGAGGCCTAC CATCCAGAAA ATAATCAGAA GAAAAAGTCT
GTTTCGATCA AGCATTCTAC GGCTGCGCTC GAGGACGCAA CGCGACGACG AGCCGAGGAG
GAAACCTCAC GATTGGTATT GAGCCTTGTT ACGCAGACTT CTGCTTTTAT TCAGCATCTT
CCTGAATCAG AGCGACGGGA TGTGTCGAGT CTGCTGACGA AGAGGCCTCC GGTGGTCTCC
TCTTGGCCGG CGTACGAGAC CATCCCAGCA CGATTCCACG AGATTGAACT CGGCGACTGG
GAATCGAAGG TTCGTTGGAA AATTGAAGAA GAACCAGAGG GCGAGTCCAA ATATTCCAAT
CGAGACCCTA CGGATTTACT GAGACGGCCC CGAAATCCCT ACTTGGATAA CCTTGTATTA
GACGAATCCA CAATTTGTTG GGACGGATCA CTCGAGAAAC TGCAGGAAAA GGCTCGAAAT
ACTCCTCTGA TTTTAGAACT TGGCGTAGCT GGACAATCTG TAGCCCGTCA TGTGTATCAA
AATACGGTTC TTTCTGCACA ACGCCCTACA CCTGCACTTA AGTCGGATGC CTATCAAATG
CGGCGAGAAC GGGAATGGGC CAATCCAATT ACTTCCACAG CAGAAGTCTC TAAAGCTGGT
TCGCTGCACG CTGATAAGGA CAAAATGGCG GCGTTGATCG AGGCTCGTCA AAAGCAACGA
GCACAAATGG CCGAAGACAA GACGAATCGC GTGACAGAAG CCATGGGAAC GTTGGCTTTG
GGTGGTGGAA AAGGACGGAC TATCACTTCC TCGCTGATGG GTCCAGGAGG AACTGAACGC
AGCGGGCGGC CGTCCCGAGA TGTGGGTTCA TCGGGATTGC ATGAGGCCGA ATATATTGAG
CAGCTCGATA TGATTAATAG CCATAGTCTC GTGCGCGATC TCTCCAAGGT TCTGTTACGA
CAATATCATA GACCCAAGTT GCCCCTCAGT GTCGTTCGTC AAGACCTGTC CTGGCAATTC
CAGATCCGCT TTGCTCCCAC CAGTAAAAAG ACGGAAGTTA CCGGCGCTTC AGGATCATAC
CAGGCAATTA TGACAGGAGC TCACGCAGGT GCGATATCAA AGGCCAAGCT GCGCAGCGAA
GCGGATCTGA GTCCAACAGA AGGCAAGCTT ATTTTGCTTG AGTACTGCGA AGAGCGCCCT
CCAATCCAAC TAACAAAAGG CATGGCGAGC AAGATTGTCA ATTATTACCG TGGAGACAAG
GCACATTGCC CTGTTTCGGC TGGTGGGGGT GACCGGCCCG CACGCAGAAA AAGAGCCGAG
CCCATTGCTG GAGAGGCAGA CGCTCGATCG AGCCGTGCAG AACGACTTCC TCGGTTGGAA
GGGCCAAGTC GGGAAACGTC GGTTTTGGAA TGGGTTGGCA AAGTTCCTAA GAAATCACAG
AAGGAGCGTG CGGAACAAGA TGCAATAGAT ATCCTTCCGG AGGGTGTAAC TGAAAACTTA
CATCCCAAGG TCCACGGGCC ATTTCTTGGC GAAGTCGAAG ACAGCACAAC AGTGACCGGA
ATTATTACAA ACTTGTTCGT CGCGCCGATG TTTCTTCACG AACCTGAAAC AACTGACTTT
TTGATGGTCC TCACACCACC TAGTGGAGCG GCAAGGCCCG GCCAGCGTGA GTCAATGAGC
GTGATCCTTC GAGATTTGCC TACGAGCACT TTCACTGTTG GTCAAACAGA GCCTCGTGTG
CGGGTCTTCG CGCCAAACAC CCAGGGTGAA AAGAACTTTG TAGGGCCTTT TGTTTCATAT
CAAATTGCTA GAGCTCTCGC TCGTTCTCAA GGTCGAGAAG GACACGGTTT ACGATTTGAC
GAGATCCAAG ATCGCGTACT GCCTAACCTT GAGTTACCGT CGAATGCGTT ACGGCCCCGC
CTCAAACAAG TCGCTCTGTA CGACAAGAAT ACTCAGATCT GGACGACGAA ACAAATAGGG
TTTGAAGAGT ACCCCGGAGT TGACGCCCTC GGCAGGACTA TTGCACCCGA AGGTGTTGCA
GCTTTTGAGA GTGCTTGCGC AGCCAGTCGC CGACTGTCAG ACCTTGGAAT CCACCAACTT
CTAGCTGGCT CACATACTGT TCTAAGCGTG GGCGTCATTA TGGTCTATAT TTCCGGACAG
CTGAATGCAG CCAAAGACTT GTCAAGAAAA ATGAAGAAGC TAGCGGAATT GCGTCGCTCG
AACAAGAGCA TCTCGGCTGT CCAAGTCGCT TTTTATGAAC AGGCGGCCGC GATCATTGAA
TCTCATTTTA AGATCTTGCG GCAAAAACAT GAAATTGCAC AGTTTATATA TGAGCAGCTT
CAACTTGCTC CATGGCATTT GACCGGTGAG TTTATCGATG TTCATAAAAA AGGCGACGGG
ACTGGCATGA TGAAGCTAAC TGGTCTCGGC GACCCAAGCG GTCAAGGAGA AGGTTTTAGC
TTTATCCGTG AAGCGGACTC GAAACCAAGC AAGTCCGTCG GGAATGCAGC TCTGAGTGCA
GAGGTCAAAA AGATTACTGG GACAGAAGAT GATCTTCGAA AACTTACGAT GAAGCAAATG
GCGAGCTTGC TTCGCTCATA TGGTATGACA CAGGAAAAGA TTGATACACT AAAGAGGTGG
GATCGAGTGC ACGTCATCCG GGATCTTTCC ACGAAAGCCG CGAGCGATGG AATAGGTGAC
GGCCTTGAAC GCTTTGCCCG CGGTGAAAAA ATGAAACTTT CGGAGCAGAA GCAGATGTAT
CGGGATCGTG TCCGAGTCAT ATGGAGGCGA CAGATTGCTG CATTATCGAT GGATGACAAG
GTCGCTGGAA GCACAGAAGG AGCGGCTATC GCTGACGGCG AGAACGAAGT TTCTGGAATG
GCACAGCAAT CACAATCCAA TAAGCCGGAC AGCACCAGTA AGCTAGGCTC CGATTCAGAT
TCATCAGATG ACGATGACGA TCTTGCAGCG GCGTTGGAAG ACGAAATGCT GGATCGATCG
GAAGCGAATC AGCTCGTTGC GGAGCATACT GGCGGAGGAG AAGCTGACGG CGGTCTGGGA
CAGCTGCGGG CGGCTGCACA GGACCACGAG ATGAATAAGG ATGCCCGCGA GCTTGCAGCA
TTAAAGCGGC AGCGTGAGGA GGAAAGGGCA GTCCGCGAAG GTTTGCAGTC GAACAAGCCG
AAGGTAGAAT CCTTCGATAC ACAAATGCGT TCAAACAGAA AGGTCATAAG AAAGAAGGTA
GTAAAAACAC ATCCCGACGG CCACCAGACA ACAACCTTCA AATTCGTCCT CAGACCCGAC
GAAGTCGGAA AGATCATGGC CCGGCTTCAG CAAGACAACA GCGAAGATCA CCGTCGAAAA
AAAGAGTTTC AGTACGAAGC AAACTCAGAC GAAAAGCCTC CAGGTCAAGC TTTGTTCGAA
GACGAAGACG ATTTTGAGTA TTCTTCCCGC GGGCGCTTTG CCGACAAACG CGGGGGGAAT
CGTAAGCGGC GAGCCGGAGG TCGAGCAACT CCCCGGGGTA CGCTCCAGTT TGGAAAACTG
AAAAGCAAAA TATCCAAAGA GGAACGGATG CGGAAGAGAA AACGGGAAGA GGAAGAATTA
GAAGTTTATA CTGCGTCAGC GAAGCACAAA GGAACTAACA ACCGCAAGGA GCGCGGTTCT
ATTCGAAACC GCCGACCACA TGTTATTTTC TCCGAAAAGT TGGAAGCAAT TCGGTCAGCC
GTAGAAGCCC GTCCAGGAGC TTTACCGTTT GTGAAACCTG TTAATCGACG TCTTCTACCC
AAGTACTACG AAGTTATCAG CGATCCCATC GACCTGCAGA CAATTCGAGA TAAGATCAAG
CGGTACGAAT ACAGATCTGC CGACAACCTT GTTCGCGATT TTGACCTCAT GAAAAGCAAC
GCCGTCAAAT TCAATGGTCA AACCAGCCCC ATCGCTCAGG AAGCGATCGC AATTCACGAG
TTTGTTTCGA ATCAAATTGA ATCACACCGG TCCGAACTTA GCGCTCTCGA GACAGCGGTA
CAGGATCAGA TGAATGGAAA GCCGAAAAAG AAAGTCAAGA AAGGCCTAAT GAAATCTAGT
GGATCTGGAA ACACTGCAAG AATAGGAGGC ATATCAGTCA ACCTTGGAGA TTTTCAGGGA
ATGCAATTCG AAGGGAACGA CTCAGATAGT GGGGATGAAG TTTCGTTTAC AGGGCTTTTG
GATTTTTAGA GGGAGATTAA TGATCTTCTT AACATACTTT TCTGATGTTA AGGGTTCTAT
TGCCATGTAC CTCGATACAT GACGACTTCC TCTCCTTCTT CAATATCCTC CGTTGCCACC
ATGAACACGT CGAAACTGAC CCGATTCCGT TCGTAGACAG GCGAATACTT AGGGCGAATT
CCCGACGGAT GATTGGGCAC CCATTTGTCT ACGTTTGCCT CCTCTCTTCC GTCTACACGC
CGAATCAAAA CGGAACCGCC CAATTCGACG TAGTGCTGGG CACTGCCATC GGCCGCGCTT
TCGTGCGCGT AAGATTCAAA AAATCCGAGC AAGTCCTGAA TCACTGCCAC ACGACCCCCG
CCCACTTCTG TATTCTTTTT CAATCCTTCC AAGTTGCGGG TTGTGACTTG CAGTGAGCTG
GCGAGATGCG CTGGCATTAC AAATGATCCT CTCGGAATGC TCGTAGTTGC GTACACTTTC
GTCGAAATTA TCTTGCCACT TTCGTTTTTG TGAGTTTCAA CACGGAACGA ACTGTTCTCT
TCATTTTCCA AATCAAAATC GTGCAATTCT GCCTGCATAT TCATGGTACG GTAGGCACAC
TCAAACGGCA TTGGTTCACG ACGACAGTAG ACGGTCTCCC ATCCCTTCTT GGGCCACTGA
TACGAGCGCT GGGTGGTCCC ATCGAAATAG GTCAAGGCCC GAACTTTGTT ATGCGTTCGA
ACGATCCGAT CATAAATTTC GTAGTCAACC TGATCGCTAC GAGCGTACCA GCGGCTACGG
CAAGTGACGC TCTTACAGAC
 
Protein sequence
MAAPPLDFEA LLSREYRPAR IARLIPREAY HPENNQKKKS VSIKHSTAAL EDATRRRAEE 
ETSRLVLSLV TQTSAFIQHL PESERRDVSS LLTKRPPVVS SWPAYETIPA RFHEIELGDW
ESKVRWKIEE EPEGESKYSN RDPTDLLRRP RNPYLDNLVL DESTICWDGS LEKLQEKARN
TPLILELGVA GQSVARHVYQ NTVLSAQRPT PALKSDAYQM RREREWANPI TSTAEVSKAG
SLHADKDKMA ALIEARQKQR AQMAEDKTNR VTEAMGTLAL GGGKGRTITS SLMGPGGTER
SGRPSRDVGS SGLHEAEYIE QLDMINSHSL VRDLSKVLLR QYHRPKLPLS VVRQDLSWQF
QIRFAPTSKK TEVTGASGSY QAIMTGAHAG AISKAKLRSE ADLSPTEGKL ILLEYCEERP
PIQLTKGMAS KIVNYYRGDK AHCPVSAGGG DRPARRKRAE PIAGEADARS SRAERLPRLE
GPSRETSVLE WVGKVPKKSQ KERAEQDAID ILPEGVTENL HPKVHGPFLG EVEDSTTVTG
IITNLFVAPM FLHEPETTDF LMVLTPPSGA ARPGQRESMS VILRDLPTST FTVGQTEPRV
RVFAPNTQGE KNFVGPFVSY QIARALARSQ GREGHGLRFD EIQDRVLPNL ELPSNALRPR
LKQVALYDKN TQIWTTKQIG FEEYPGVDAL GRTIAPEGVA AFESACAASR RLSDLGIHQL
LAGSHTVLSV GVIMVYISGQ LNAAKDLSRK MKKLAELRRS NKSISAVQVA FYEQAAAIIE
SHFKILRQKH EIAQFIYEQL QLAPWHLTGE FIDVHKKGDG TGMMKLTGLG DPSGQGEGFS
FIREADSKPS KSVGNAALSA EVKKITGTED DLRKLTMKQM ASLLRSYGMT QEKIDTLKRW
DRVHVIRDLS TKAASDGIGD GLERFARGEK MKLSEQKQMY RDRVRVIWRR QIAALSMDDK
VAGSTEGAAI ADGENEVSGM AQQSQSNKPD STSKLGSDSD SSDDDDDLAA ALEDEMLDRS
EANQLVAEHT GGGEADGGLG QLRAAAQDHE MNKDARELAA LKRQREEERA VREGLQSNKP
KVESFDTQMR SNRKVIRKKV VKTHPDGHQT TTFKFVLRPD EVGKIMARLQ QDNSEDHRRK
KEFQYEANSD EKPPGQALFE DEDDFEYSSR GRFADKRGGN RKRRAGGRAT PRGTLQFGKL
KSKISKEERM RKRKREEEEL EVYTASAKHK GTNNRKERGS IRNRRPHVIF SEKLEAIRSA
VEARPGALPF VKPVNRRLLP KYYEVISDPI DLQTIRDKIK RYEYRSADNL VRDFDLMKSN
AVKFNGQTSP IAQEAIAIHE FVSNQIESHR SELSALETAV QDQMNGKPKK KVKKGLMKSS
GSGNTARIGG ISVNLGDFQG MQFEGNDSDS GDEVSFTGLL DF