Gene PHATRDRAFT_40783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40783 
Symbol 
ID7198640 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp386319 
End bp388405 
Gene Length2087 bp 
Protein Length663 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184794 
Protein GI219129223 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.26017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTGG CTTCTGGGGG AAGTATGGGT ATAAAGAAAG GACCCGTTAT CCCATTAGAA 
CGGTACGTAC CGAACGAGTA CACACAGCAT TGGTTTTGCT ACTATACACT GCGAAATTAT
TTCCACGCTC ACGTTTCTTT ATATCTTGCA CTTGTAGCAA AGAATTGATT TCATTAGGCT
CCTTATGGTT TGGCCTTTTC GTAATCCTCG CCTTACTCCA CGCCGCCGAG ATTGCCATTA
CCACCCTTTA CCCCTGGAAA GTCCGCGAAA TTGCCGAAGA AGAGGAAAAG CAAGGCAACA
TGCGTGGCAC GTTCAAAGTT CTCAACGAAG ACATTACCCG CGTCCTTACC ACAATCCTGG
TTGCCTCGAC GGCCTGCTCC ATTTTTGCCA CGACTTTGTT TACGCATTTG GTGGCGAGCC
TGTTTGGATT GCAAGGCGAA CGATACGGTG CCATAGCACT AACCGGATTG ACGTTGTTCT
TCGTGGAACT TTTGCCCAAA AGTTTGGGTG TCACCAATGC CGAAACAGTC GCACGAATAA
TGGTACCACC CGTTAATGTC GCTTCGGCTA TTGTGAGTCC GCTCGGTATT TCTCTCTCTT
GGCTAGCCAA GCGCACCTTG TCCATGCTAG GTGTCAAGGA CAAAAACAGT GGCTCGGGTG
TATCCGACAG CCAGCTGCGC TTGATTGTAA CGGGCGCCTT GGATTCGGGT ACCATTGATC
ATGGTGAACA AGAAATGATT CAGGGTGTTT TAAAGTTACA AGATCAGCGG GTGAAGGAAA
TCATGCGCCC CCGCGTCGAA ATGGTAGCAG TTCCAGTAGA CATGTCGGTC GCTAGCGTAC
TAGGCGTCGT TCGAGAGTCC GGATACTCAC GAATTCCCGT GTACGATGGC GAGATCGACA
ATATCGTGGG CATTGTACTG GCCAAGTCCG TGTTGGATTT TTTCGTAAAT GGAGTGCTGG
TCGACGAAGA TTTGAGCAAA AAGTTGGGTA AGAATACCGA AGAAATCAAG GCTGCAGTAG
AAGACCTCAA GGCCGCTGAC GAAGCTCGTC GCGGCGAACT ACCTGACGAT ATACAAGCCG
ATGCGGACAA GGTCATGGAA CGCGTGGATC TTAAAATTGA TGCACTAGTC GATCAGCGTA
TCGATGCGAA CATTGACGCG TCCCTTCCAC CCAGTCTATA CACTCCGGAG CGTATTCCGA
TTCGTAGTAA GGGCAACAGC AATGAACCAC AAGGATATGT TCGATCACTG ACAGCAACCG
AGTTGGCTTC TCGGATGGAG AAATCTATCC AGGAAGCTGG CTTGATCGAG TCTTGTTATT
TTGTGCCGGA CACCGCCAAC GGGTGGTCAG TTCTACAAGA AATGCGTCGT CGTCGGGTGC
ATTTGGCAAT TGTCGTGGAC GAATTCGGCG GAACAGGAGG ACTGGTGTCG CTGGAAGATA
TTGTGGAAGA AGTGGTTGGT GAAATCTACG ACGAAGACGA CGATGAGGAT TTTCAGGTTT
CGGAGGACTC GATTGCCATG CAGGACGACG GAACCTTTTT GATTCGCGGC GACGCCGATT
TGGAAGACTG CGATACTATT TTGGAACTGA ACCTGGATGA GGAAGAAGCT CTGAAAGAAT
TTTCGACCCT ATCTGGTTTT CTGTGCATGT GTGCAGGGGA GATTCCTTCG GTTGGGGACT
TTATTATGAG CCGAGGTTGG TCGTTTGAAA TTTTGAGCGC AGATGACAAG AAAGTGTTAC
TCGTCAAGGT GGAGCGCTTG GTTGGTGCAT TCGATAATGA AGAGGAAGCA GCGAGTGAGA
ATGTTCTCAA GAACCTGCTA AAGTTAAATT CCAACAAAGA GCACAACAGC AATCATAACA
GTAATGACGG CGACTCCGAC AGCGAAAATC GAGACGGCCG GGATTCCGAG CAGGATCGCG
AACAGCAGGC CGAGGGCGAA CTCCAGAGTA CCGTCGCTGC GAATATGGCT GAAGCCAGAG
AGATCGAACG TATGGTGGAA GCCGGGGAAC GTAAACGAGC AGTACTGGAA GCAATCAAGT
TCGCATCGTT GGCCAACAAT ACGTCGCCCG ACAGAAACGA GTTGTGA
 
Protein sequence
MAVASGGSMG IKKGPVIPLE RKELISLGSL WFGLFVILAL LHAAEIAITT LYPWKVREIA 
EEEEKQGNMR GTFKVLNEDI TRVLTTILVA STACSIFATT LFTHLVASLF GLQGERYGAI
ALTGLTLFFV ELLPKSLGVT NAETVARIMV PPVNVASAIV SPLGISLSWL AKRTLSMLGV
KDKNSGSGVS DSQLRLIVTG ALDSGTIDHG EQEMIQGVLK LQDQRVKEIM RPRVEMVAVP
VDMSVASVLG VVRESGYSRI PVYDGEIDNI VGIVLAKSVL DFFVNGVLVD EDLSKKLGKN
TEEIKAAVED LKAADEARRG ELPDDIQADA DKVMERVDLK IDALVDQRID ANIDASLPPS
LYTPERIPIR SKGNSNEPQG YVRSLTATEL ASRMEKSIQE AGLIESCYFV PDTANGWSVL
QEMRRRRVHL AIVVDEFGGT GGLVSLEDIV EEVVGEIYDE DDDEDFQVSE DSIAMQDDGT
FLIRGDADLE DCDTILELNL DEEEALKEFS TLSGFLCMCA GEIPSVGDFI MSRGWSFEIL
SADDKKVLLV KVERLVGAFD NEEEAASENV LKNLLKLNSN KEHNSNHNSN DGDSDSENRD
GRDSEQDREQ QAEGELQSTV AANMAEAREI ERMVEAGERK RAVLEAIKFA SLANNTSPDR
NEL