Gene PHATRDRAFT_36312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36312 
Symbol 
ID7201908 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp135764 
End bp138180 
Gene Length2417 bp 
Protein Length740 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180950 
Protein GI219120423 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAAG ATGCCGGAGC CGGCACGATT GCGTTCCTGC AGCGTTCGCT ACAAGCGAAC 
AACTCGACGG ATTCCACCGA GGAGGCAGCC TCGTACAACG ACGGCAGCGT GTTGCGCGAT
ACCTTTACGG TGTACGGGTC GATTTTGCTC GTAATCTTCA TCGCATTTTG CTGGTTGCGG
CGCAAGTATC CACGAGCGTA CAACGTCCGA AATTGGGTCG AAGATATTAA GACCCCGCTC
GCCAAAGACC AGTTCGGTTT CTTTTCGTAA GTCCACGGAA GATCCATAGA TGTGGCGCTG
TCTCTTGTTG GGGGTAGCCA CGACAACAAC ATCTGCATTA ACAGTATCCC GCTCACGCGT
CTTTTCCTTC CACAGGTGGA TTTGGGAGAT TTCCACCATT ACCGAAGACG AGATCATGGA
CGAATGCGGC TTGGACGCCT TGTGTTTCGT CCGGATCCTT AGCATGGGCT ACCGCATCAG
CTTAATGGGT GTCTTCAACG CCATCTGGCT CATGCCCGTT TACGCCACGG CTGACGTATC
GGACGACACC CGCGGTATTG TGGACCGCAT TGTCGAAGTC TCCATTGCTC ATGTTCCTGC
GTCGTCGCCC CGACTCGTAG CAACGGCTTT GGCTGCCTGG ATCGTCTTTG GCTACACCAT
GTACCTCATT TTACAGGAAT TCGAATGGTT CATCGACAAG CGTCACAAAT TCCTCGCCAA
ACCTCGACCC CAGAACTACA CTGTCTACGT CCGAAACATT CCCATCGAAT ACCGCACGGA
CTCGGGCTTG GAAGACTTCT TTCGGCAGTG CTTTCAGTAC GAGTCGGTCC TCGAAGCCAA
CGTGCGCCTC CGGACACCCA ATCTTGCCAA GCTCGTGGCG CAACGAAGCG TGCTCATCGC
CAACCTCGAG CACGCCATTG CGATTGAAGA CATTACCGGA GAGGCGCCGC AGCGATCAGC
TTCACTCAAA TCCTCCCTCA TGATTATGGG CGGGGAAAAG GTCAACGCCA TTGAGGCTTT
CGCCGAAGAA CTCAAAGCAC TCAATGCGGA TATCAAAGCA CGTATTGAAG AGCTCGAAAC
CAAAAAATTG TCCCAACTGT TCATGCAAGA TGTGGAACAA CAGAGTTTGG CGACCTTGGG
ACACTCGGTG GCCGGGCGCG GCGATTCAAT GTATGGAGCA GGCGACAATG TGAACGCGGA
GGAGTGCGCG TCCTTGACAC CTAGTGCCCT GGCGGTTGTA CCGCGTCCCA ACGGATATGG
TACCGAAATC TCCAACGTAG AAACTGCCAT TTACAACGAC GTTATTGTAG AGGAGGAAGA
CGACGATGGA GACTTGTCGA CTCTGGCCAG TCGTCAGAAC GTGACCAACC ACAGCAGCTC
ATCGAAGTCG ATTCTGGATG CCAAAAAGTC GATCAAGCAG TCAGTCCACC TTTTTAAAAA
GGCCGCCAAT GCGGTCAAAG ACTCGGCTGT GGCGGTAGGA GAAAACGCCG CCCACATGTT
GCAAACGAAC GCCGACGGTG AGAGTTACGA AGCAGGTTTT TTAACTTTTA CCAATCTACG
AACAGCACAA GCGGCTTTAC AGATGCTACA TCACAGCAAG CCGTTTTCGA TTGAAGTGCA
AGAGGCTCCG GATCCGCAGG ACGTCTTTTG GTTCAATGTA GGACGCACGC ACAAAGAATT
GCAGATGGGA AATTTGTTGT CGTTGGCAGC CACGACTGCC TTGTGTCTTC TTTGGACGAT
TCCCATGAGT TTCATTGCTT CACTGTCCAC GATTGATGCT CTCCGCTCGG AATTTGATTT
TATTGACAGC TTGCTGGATG ATGCCCCCTT TTTGGTTCCC GTGTTTGAGA TTGGAGCGCC
CTTGTTGGTC GTGGTGGTCA ACGCGTTGTT ACCCGTGATC CTACAAGTCT TCTCCATGAT
GGAGGGCCCT GTGTCTGGGG CAGTTGTGGA AGCTTCGCTC TTTTCTAAGC TTGCAGCCTT
TATGATTATC CAAACCTTTT TCGTCAGTGC AATTTCTGGT GGACTCATGC AGGTACGTGT
CAGCAAAAAC GGAAGGCTAC AACAAGACTG TCGTGTAGTG GAATGGTGAG CTCACGCGCA
TTTGCTTTTG ATCACAGCAA CTCTCCGAGA TGATAAATGA TTATACCCTA ATTATTGATT
TGCTGGCAAC CTCCTTGCCC GCTCAAGCAA CCTATTTCAT TCAAATTATC TTTGTGACTA
CGGTTTTTTC TTGCGGTATG GAAATCTTGC GAGTCATCCC GGTAATTAAA GCAGCATTGC
GAAAGTGCAT CGGACCTCGC TTGACCAAAA GAGAGCGTCA AAAAGCATTT ATGGGTTTGC
AACCTCTGGG CGACCCGCTA GATTTTGAAT TTGCGGATTT TTCTTCGAAC ATGGTAAGCT
CCTCACGTCC AAAGTAG
 
Protein sequence
MTEDAGAGTI AFLQRSLQAN NSTDSTEEAA SYNDGSVLRD TFTVYGSILL VIFIAFCWLR 
RKYPRAYNVR NWVEDIKTPL AKDQFGFFSW IWEISTITED EIMDECGLDA LCFVRILSMG
YRISLMGVFN AIWLMPVYAT ADVSDDTRGI VDRIVEVSIA HVPASSPRLV ATALAAWIVF
GYTMYLILQE FEWFIDKRHK FLAKPRPQNY TVYVRNIPIE YRTDSGLEDF FRQCFQYESV
LEANVRLRTP NLAKLVAQRS VLIANLEHAI AIEDITGEAP QRSASLKSSL MIMGGEKVNA
IEAFAEELKA LNADIKARIE ELETKKLSQL FMQDVEQQSL ATLGHSVAGR GDSMYGAGDN
VNAEECASLT PSALAVVPRP NGYGTEISNV ETAIYNDVIV EEEDDDGDLS TLASRQNVTN
HSSSSKSILD AKKSIKQSVH LFKKAANAVK DSAVAVGENA AHMLQTNADG ESYEAGFLTF
TNLRTAQAAL QMLHHSKPFS IEVQEAPDPQ DVFWFNVGRT HKELQMGNLL SLAATTALCL
LWTIPMSFIA SLSTIDALRS EFDFIDSLLD DAPFLVPVFE IGAPLLVVVV NALLPVILQV
FSMMEGPVSG AVVEASLFSK LAAFMIIQTF FVSAISGGLM QQLSEMINDY TLIIDLLATS
LPAQATYFIQ IIFVTTVFSC GMEILRVIPV IKAALRKCIG PRLTKRERQK AFMGLQPLGD
PLDFEFADFS SNMVSSSRPK