Gene PHATRDRAFT_41850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41850 
Symbol 
ID7197631 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1093879 
End bp1096886 
Gene Length3008 bp 
Protein Length685 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178361 
Protein GI219115131 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAAGG TAGTTCCTTT GCATGCGGCG GATCACGAAG AATCCCGGTC GGGAGATCTG 
GTGCTCCTCC AAGCAAAGGA ACCACCTAAA GTTAAGGCCG TTACCTTTGC GCCGGTTGAA
GACAGTCTCA AGCAGTTGGA ATCGGCGGAA GGAGGCGACG AAATTGCGGA GGAAGAATTG
CAAGCCAGGT TTGTCCAACC ATACGTGGAC AATCCACAAC AAGCCATGGT CAAGAAAGGC
CTCTTACTAA TGCTCCGGGA TGAAAACAAC AAGGCGCTAG AATTCATGGT GACACATATC
GATACGGAAA ATGATGCTAG TGAAAGTAAA GCCTCTGAAG GTACGTGTCA AACAACACTG
GATTGTATCA GTATTTATGT ATGTTGACTG CACGATAATT TCTAATAGTT CCATGCTTTG
ATTGTTGCGA TTTACAGACG ACGACGAAGA CGTTGCGGTG GCGGGAGAAA TGACTTCTGA
GACGGAAGTC ATCATGGGAA GCTCGACACC ACGTCTCGAA GTTGGACTAG GCTATGACTC
CGTCGGAGGC CTCGATTCAG CGATTCAGCT CATGCGTGAG CTGATCGAAC TTCCGTTGCG
TTTTCCCGAA TTGTGGACGA CTGCTGGTGT ACCTACGCCG AAGGGAGTTT TGCTGCATGG
CCCTCCTGGG TGTGGGAAAA CGCTCATTGC GAATGCTTTG GTGGAAGAGA CGGGAGCGCA
TGTCGTCGTT ATCAACGGCC CGGAAATTAT GGCACGCAAA GGAGGAGAGA GCGAAGCAAA
TCTTCGCCAA GCCTTCGAAG AAGCCATCGA AAAGGCTCCG TCGATCATAT TCATGGATGA
GCTTGACTCA ATTGCACCGA AGCGAGACCA GGCGCAGGGT GAAACGGAAA AACGCGTCGT
GTCACAATTG CTAACCTTGA TGGACTCGCT GAAACCCAGT TCAAATGTCA TGGTCATCGG
TGCCACAAAC CGCCCCAACG TAATCGAGTC GGCTCTCCGT CGTCCCGGTC GTTTTGATCG
TGAACTAGAG ATTGTCATTC CCGATGAGGA TGGCCGCCAT ACCATTTTGA AGATTAAGAC
GAAGGACATG AAAATTAGCG CTGACGTTGA CCTATTCCAA ATCGCTCGTG ACACACACGG
ATACGTGGGT GCGGACTTGC AGCAACTTAC AATGGAAGCT GCTTTGCAAT GTATTCGTTC
CAACATTGCA AATATGGATG TGGACAGCGA GGAACCTATT CCTGAAGAGA TTCTCGATAC
GTTGGAAGTC ACTAACGATC ATTTTATTTA CGCGCTAAGT GTGTGCGATC CCAGTACCCT
TCGCGACAAC AAGGTGGAGA TTCCAAACGT GAAATGGGAA GATATTGGTG GTTTGGAGGA
GACCAAACGT GAACTACAAG AAATGGTTCG GTATCCGATC GAGCATCGGC ATCTTTTTGA
GCGCTTCGGA ATGCAAGCCT CTCGTGGGGT TTTATTTTAC GGCCCACCTG GTTGCGGAAA
GACGTTGATG GCCAAGGCTA TCGCTAACGA ATGTGGCGCC AACTTCATTT CCGTGAAAGG
CCCCGAACTT TTGAATGCTT GGTTTGGAGG ATCCGAAGCC AACGTTCGTA ACCTTTTCGA
CAAGGCCCGT GCCGCCAGTC CGTGCATTCT TTTCTTTGAC GAGATGGATT CAATCGCGCG
TGCTCGCGGA GCGGGTGGTA GTGGCGGTTC CGAAACTAGT GATCGTGTCA TTAACCAAAT
CCTCTCCGAA ATCGACGGCA TGGGATCGGG CAAAACGCTT TTCATTATTG GAGCGACGAA
TCGTCCCGAT ATTCTGGATC CCGGTATCAT GCGTCCTGGG CGATTGGATC AACTGATTCA
CATTCCGCTA CCGGACCATG ATTCGCGTGT TTCAATCTTT AAGGCCAATC TACGAAAGAG
TCCTATCGAC GAAGAGGTCA ATATGAAACA GCTGGCAGAC GCTACTGAAG GGTTTTCGGG
AGCTGACATA ACTGAGATTT GTCAACGAGC CGCCAAGAAT GCTATTCGAG ACAGCATAAC
AGCCGGTATT GAGCGACAAA AGCGTGTCGA AGCAGGGGAG CTTTCGCAAG AAGAAGCCGA
TGCTCTTCCA GACCCCGTAC CGTTTATCAC CAAAGCACAC TTTGAAGCTT CCATGAGCAA
GGCGCGACGT TCGGTAGGCC CCGAAATTGT AAAACAGTAC GAAGATTTTA CTGCCAAGAT
AAAGCAACAA TGGAGTAGCT CCGGTGCCGA AGGTGCAGAA AACGTTTACG ATATCGACGC
GGCAGCAGCC GAACAGGCAC GCGAGGACTC AATGGTAGAG GGGGATGAAG AAACACTAGT
CCCAGTCGTT GGTTCAGATT CCGATAGCAA TGAATAGACC AAGCTAATGA CTGATTATGC
TGACAGTGAA TTTGAATTCT AAATAACCCC TTTGAGCGTA GTATACTGAA CTTTTTCCAT
GGAAATGGAT AGTAATTGTG TGAACAAATG TGGTAGTGGA TTGGGTCCTG TGACATAGTT
CCTATGTCCG GGTACAACGA AATATACTCG CCGTTCATTT TGCGTGTCGA AACGATATCG
GTATCCTTCG TTCCGAACAC TCCGAGATTC ACTCATGTAA GCCGAGTGTT GTAAATTTTT
GTAGACACCC AGGTGATTCT TTTCCTACGA CTTTGCAGTA AAGGATTTAT GTTAGATTAG
AGACTCGCTG GCTACCGTTA TAAATCCCAA CCCGACTGTT GAACCTTGTG GAATATGCGT
TAGACCACAT GATAGTCTAT TATTGGTAGA GCCCTCTAGT CCCACAAAGT CCAGTGTATG
TAGTCTTGTT TAACTGGAAA TCGGTGTCCG AATTGTCGAA CCGTCCTATT CCCACACACA
TATAAATATG TATGGATACA ATACCGGTAG TAATTTGATC AGCGATTTTT TCAACTTTCT
AAATATATCG CATTTACTAA AAAGCCGTAC CACTACGACA AAAACATCTC TTCCCAAAAG
AGGTCTAA
 
Protein sequence
MVKVVPLHAA DHEESRSGDL VLLQAKEPPK VKAVTFAPVE DSLKQLESAE GGDEIAEEEL 
QARFVQPYVD NPQQAMVKKG LLLMLRDENN KALEFMVTHI DTENDASESK ASEVIMGSST
PRLEVGLGYD SVGGLDSAIQ LMRELIELPL RFPELWTTAG VPTPKGVLLH GPPGCGKTLI
ANALVEETGA HVVVINGPEI MARKGGESEA NLRQAFEEAI EKAPSIIFMD ELDSIAPKRD
QAQGETEKRV VSQLLTLMDS LKPSSNVMVI GATNRPNVIE SALRRPGRFD RELEIVIPDE
DGRHTILKIK TKDMKISADV DLFQIARDTH GYVGADLQQL TMEAALQCIR SNIANMDVDS
EEPIPEEILD TLEVTNDHFI YALSVCDPST LRDNKVEIPN VKWEDIGGLE ETKRELQEMV
RYPIEHRHLF ERFGMQASRG VLFYGPPGCG KTLMAKAIAN ECGANFISVK GPELLNAWFG
GSEANVRNLF DKARAASPCI LFFDEMDSIA RARGAGGSGG SETSDRVINQ ILSEIDGMGS
GKTLFIIGAT NRPDILDPGI MRPGRLDQLI HIPLPDHDSR VSIFKANLRK SPIDEEVNMK
QLADATEGFS GADITEICQR AAKNAIRDSI TAAHFEASMS KARRSVGPEI VKQYEDFTAK
IKQQWSSSGA EAVPLRQKHL FPKEV