Gene PHATRDRAFT_43594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43594 
Symbol 
ID7197318 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp907679 
End bp909649 
Gene Length1971 bp 
Protein Length553 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178030 
Protein GI219112557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATCGC AGCTCAGGTA CCATGACTGG ACCGTACACG GCAGTCGATC GAGTTTGGCG 
GCGGGTCCCC CCTGGAGCGT GACCGGACTG ATTCGGAGTC GCGGAAAGCA CCCCACGTTC
ACGCCCCCCT CTCCAATTTC TCCACTTCAC TGTGCGAACG CGGTAGAAAG GGTAAGCCAC
AGAATAGGCT ACGTATTTGT GACTAACAAA ACAGCAGTAC CATCCATGGT ACTACCATGT
GGCACGCCGT AAACGCGAAT GCAAAGAATT TCGAACGCGA GTCACTCAGA TCGTCGAGTC
GTTCTCACCA CCGCATCCGC ACTGGCATGT CCGTCGAAAC TAGTCTCTGA TTCCTCAGCT
CTTGCCTCCA CACTACATTC ATAGACACAC GTATTGTTGC ATCCTTGCAC TCTTTACTGG
TGACAGCATC CATTCATTCT TTCCGTCGTC GTCGTCGCTG CACCTCATTG CCATGCCACG
TTCGTACGCT TCGCTCCCGG GCGTTTCCGG GATGTTGCTA GTCATTCTGA TCGCGTCGGA
TCGCTCGTTG CAATCCACAG CGTGGCAATT AAATACAAAT CTGGAACCTA GCGCCACCGG
GTCGACCCTG CACCGGCGTC ATGCGATTGA ACGCATGGTG TCGGGTGCTG CGGTGCTAGG
TGGATTGGGC GGGGGACTCG GGACGGTACA CGCCGCCGTA CCGTCAAACG GACTCACCTT
TGACTCGTAC CGGGTCGTTC CCGATTCCAC TGCCGCACTT AATCCTAGTC TGTTGCCCAT
TCAGGTTCGT ACCTGTCCTT CGTTATTGAG TAGAGGAGAC CCGGTGTCAG TGTTGTTGTT
GTTGTTGTTC GGCGAAGCAC CATCTCACGC TTTTCGCTCG GCCATCGTCG TCGTTCTATT
GTTGAATTCC TTGTACAGAA GGCGGATTTT CTGCAAACAA TATCGTCGCG CAACGGCGGA
GCCTTGTGGT TGGGCGAGCA CCACAACTCC GTCAAGGACC ACAATTTGCA AGTCGACATT
CTCCGCCAAG TGCATCAACT CCGCCAAGCC ACCGGGTCCC CCACAGCGGT AGGACTGGAA
CAGGTACAGA TTAAGTTTCA GCCTGTTCTG AACGACTACC TGGCCGGGAA GATATCCGCC
GCCGAAATGC GTCAACGCGT TGAATGGGAC ACGCGCTGGA TGTGGCCGTT CGAAGTGTAC
GAGCCCGTTT TTGCCACGGC CAAGGAATTG CGTATGCCTC TAGTGGCACT CAACGTCAAT
TCAGAAGATT TGGTACTCGT CGAAAAAGGA GGTCTACCGG GGTTGCCGAG TGAACGACTC
CGGCAGTATA TTAGTGACGC GTACGTTGAT AGCGTGGTCG GAGATTAAAA TCCGTGTACG
ATTGGGTTTT TTCTCGTTCC CATCCATTGC TAATTTTGTT ATTCTCTGCT ACAACACTAC
AGACCTGGTT TTGCAGCCTT TGCCAAGCCT CGTGAATTCG GAACCTATGT CGACTACGTT
ATCCGACCCT CCTACGATCT ACATGAAGCA ATGGGTCTGC TCAAGTACAG CATGTCGGGG
GAAAAGTTGG ATGAGCCCAT GCCCTTTCGC AATTTCTTCA GCGGGAGAAT TTTGTGGGAC
GAAGCTATGG CGAACGCCGC CTACTCCTGG ACCAAGGCGA ATCCCGGTGG ACTCCTCGTG
GGTTTGGTAG GGGCGGATCA CGTCAAGTTT CGCAACGGAA TTCCGGGGCG ATACGCCCGG
CTTGCGCCGA ATGACGCCGC GTGTGTTTCA GTTCTGCTGA ACCCGACATT GATTGATACG
CGACCGTCGG GCACGGTAGG CATGGAGGGT GCCGTTTCGG ATCGTCCGGA AACCATTACT
CTGCAAATCC GTTATTTGAA AGATGACGTA CAATTTGATT CCCCGGAACG AACCTTGCCA
TCGTCAACGG GTGGTGTCCT GGCTCTCGCC GATTACTTGG TGGTAGGTTG A
 
Protein sequence
MISQLRYHDW TVHGSRSSLA AGPPWSVTGL IRSRGKHPTF TPPSPISPLH CANAVERQYH 
PWYYHVARRK RECKEFRTRV TQIVESFSPP HPHWHLLPPH YIHRHTYCCI LALFTGDSIH
SFFPSSSSLH LIAMPRSYAS LPGVSGMLLV ILIASDRSLQ STAWQLNTNL EPSATGSTLH
RRHAIERMVS GAAVLGGLGG GLGTVHAAVP SNGLTFDSYR VVPDSTAALN PSLLPIQKAD
FLQTISSRNG GALWLGEHHN SVKDHNLQVD ILRQVHQLRQ ATGSPTAVGL EQVQIKFQPV
LNDYLAGKIS AAEMRQRVEW DTRWMWPFEV YEPVFATAKE LRMPLVALNV NSEDLVLVEK
GGLPGLPSER LRQYISDAPG FAAFAKPREF GTYVDYVIRP SYDLHEAMGL LKYSMSGEKL
DEPMPFRNFF SGRILWDEAM ANAAYSWTKA NPGGLLVGLV GADHVKFRNG IPGRYARLAP
NDAACVSVLL NPTLIDTRPS GTVGMEGAVS DRPETITLQI RYLKDDVQFD SPERTLPSST
GGVLALADYL VVG