Gene PHATRDRAFT_46118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46118 
Symbol 
ID7201450 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp310110 
End bp312470 
Gene Length2361 bp 
Protein Length604 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180412 
Protein GI219119298 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCTGCGTTT CGTGGTTCGC CTCGGTTCGC AAAACTTCAT ACAATGAATA CATTCGCAAG 
GCAATCTCTA GACAGCATCT CCCTGTCACC GGAAACGAAC TCATTCATAA CACCTCCTTC
GCACAGACGA GACTCTTCTT GGAGGGCAAG AGGTGAAAGT AAAACATCAA AGATGGACTC
TTCGACAGAT CAAGCTATTT CCGGAGATCA GCCTATGTTC AACATCGAGC GGCCACATCA
CAACGACGTA CTTTCAGGAC GAGGTGTCAC CACCAATAGG CATGAAGGTA ACTCCAATTT
TAGAGGTTTG GTAGCTTTGA ACAAGGTAAG AAAGAAAAAG AAAGACCAGA AAATCCTCGG
CCGGAATCGT TTTTGCATTG ATTTACTAGG TATTTGATTG TGTGAAGGTA TGCGTGGCTA
GGCAGGATTC TTGATTCCCA GATTTGTTCC TGCTTTTAGT AGGGGTTAAT ACGCAGTACC
GCTTTCGCAA TAGAATCCGG CACACCGTCC ATTTAATTTG TTTCTGTTAA ACGGCACGTA
CTATCAGCCT CGTAAACTAA CCCAAATGTG TTTCTTCTGG AGCAGGAATT ATACGTCACA
AGCACGAAAA GACAAAAGAT GCAGATATCT CGAAGTATTG TCGATGCGGT GCGGAGCTTG
AATCCGCCAG GTCGATTTCT TGAAAAGAAT CCAGAAACAA GACTTTACAG CGATATCGGC
GAGCGGAAGG CAATTGAAAA AACGAGCCAG GCACTTCGCG ATGGAGCCTC CAGCCTTCGT
AAGCAGCTTT CCGAAGATCT AGGAGATCCA GATTTTCTAA CTGCAGTATT TGATGGAGAC
AGTACATCCA GCTTCATGGA CAAAACGAAA TCACTGAAGG TATGATTTAT TTGGGGGCTA
GTTTTACTTC ATTATCAGAT CGCTCAATCG AATTTCCCAA TTTATTGCTC CCCGACTACA
GACTAAAGCT ATGAAGAAAG GCCATAGACG TACTAAGTCA ACGCCTGATG CACCCACGGT
ATCGAAGCAT AAGTCCCCGG TGCGGAAGAT GAAGTTATCG GATCATCCAT TATCTCCCAA
TATGCCGCAG CGACGGTCAA TTCCGCTGAA GGCTCCACGG TCACAGCCGG CGTCCCCGAT
GGACGGGAGA CGCCGGGGTC CGTACGCAGA GTCCCCGTCG CCACCTCCCT TTCCTTCGCA
CCAGGAATTC CAAGCTTTTC AAAAATCAAA TAGGCCTCAC GGGTTCGGTG CTTCAAGCAT
CTATAGATCC CATGGTGGCC AATACGATGC TTCTTACCAT CCAGAACCAC CCGCCCAATG
GCACCATGGT CCTGCAAGGC ACCACGGGTG CTACCTTTCT CCAGTGTACA TTGGCTATCC
ACCACCTCCG TACCAACCGC ACCCATCGCA TCGCCCGACT TCTCCAGACA GTAATTTACA
TATACGTAGA CCTCCCTTGT CTCCTGGCGA TCGTTCTTGG TCATCTGGCA CATTTATTAC
GCAAGGACAG TCTCCGCGGC CGATGAAGCG ACAGCATTCC CCGCCATCGT TTACAGGATA
CTCAGGAAGC GAGCTGTCTG TTCCTCCTTT AGAACGCGGC GGACACAAGG ACTATTTTCG
CACTCCAATC CAGCCAACGT CGTCGCCTTC AACAGAACTC TCTCCACGTG CAAGGCCAAA
CGATTTCTAC AGCGACCTGT ATGGAACGCA TCTCACCAGT ACGGTCCAAG ATTTCCTTCC
ACCGCCATCA CCAGGTCGAG CAGCGCATAC GGCAGGTAGT TTTGGCTCCA GCACGGGAGG
ACGCGGATCC TACAATGCAG CGCTCAATGC GGGAAAACCG CGTAACGCAC GATCCTTTCA
GAACGATGTT TTCAGCACGA AGAGCTACTT ATCTGAAGAT GCTTGCGGGA CGAGCCCGAC
TGTTGTTTCT TCAGATATAT TCCAGGGATC ACTCTCAGGA AAGTCGATTA GTCAGGAAAT
TAGCAAGAAA CTGAAGAGCG AAACGGAAAC TGATAGTTTT TTGTGCTATC CGACCGACGA
AGACGACATT GAGGGCGTAG AACGGGCTCT TTCGCCTCTC CCGTATGATC GGAACGATGT
TGGTGATTGG ATGGATATGT CCGACGATCT TTTGAAACTT CCTATTGCAT CATGCGGACC
TTATGATGTA AATATGGACG CGAGTGTCCG GGCAATCTAA TTTGGTACTG AAAAGTGGAT
GGTCCGAAAG GGTATGGAAA GGTTGTGCAT GTTTAAAAAT GTGTTTTTGT GATGTAACAG
TCCGTGTTCT GTGAAAGTAG TTTTCATAAT GTGGATATTT CACTATCAAT GATCCTTGAA
GTAAAGATAC AATACTTTGG C
 
Protein sequence
MNTFARQSLD SISLSPETNS FITPPSHRRD SSWRARGESK TSKMDSSTDQ AISGDQPMFN 
IERPHHNDVL SGRGVTTNRH EGNSNFRGLV ALNKELYVTS TKRQKMQISR SIVDAVRSLN
PPGRFLEKNP ETRLYSDIGE RKAIEKTSQA LRDGASSLRK QLSEDLGDPD FLTAVFDGDS
TSSFMDKTKS LKTKAMKKGH RRTKSTPDAP TVSKHKSPVR KMKLSDHPLS PNMPQRRSIP
LKAPRSQPAS PMDGRRRGPY AESPSPPPFP SHQEFQAFQK SNRPHGFGAS SIYRSHGGQY
DASYHPEPPA QWHHGPARHH GCYLSPVYIG YPPPPYQPHP SHRPTSPDSN LHIRRPPLSP
GDRSWSSGTF ITQGQSPRPM KRQHSPPSFT GYSGSELSVP PLERGGHKDY FRTPIQPTSS
PSTELSPRAR PNDFYSDLYG THLTSTVQDF LPPPSPGRAA HTAGSFGSST GGRGSYNAAL
NAGKPRNARS FQNDVFSTKS YLSEDACGTS PTVVSSDIFQ GSLSGKSISQ EISKKLKSET
ETDSFLCYPT DEDDIEGVER ALSPLPYDRN DVGDWMDMSD DLLKLPIASC GPYDVNMDAS
VRAI