Gene PHATRDRAFT_37109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37109 
Symbol 
ID7202108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp182702 
End bp184321 
Gene Length1620 bp 
Protein Length539 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181320 
Protein GI219121952 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAC AACATGAGCA CGAAGCACGA AGCGACTTAG GAGCAGCCGA TCTTATAGAT 
GAGCATGAAG GACCGCGAGA GCAACAGGCA TGCAGCAGAA AAATTCATGC AAGTGAAAAG
AATCGAACAG ATCGGACAGC TGACGCTCAA CCAACAACTT CGCTAGAGAC CGAACCTTTC
GATACGTCTC AACTTCCCCG TTTTTACCCA CGCTGGCGTG GGAAACCCTG GTTTTGTGGT
AATGCTGAAG CGCTGGGGTG GGCACTGGAT GCCATCGGCC GAGCAGTTGC ATTCATATCA
GCGGGAGCAT TTTTGAGCAC GGCCATGTTA CGACTTGCAA AAACCGACGC CGGCTGCGCT
GTCGATCCTC CCGACGGATC AGACGAGATT CCAGAGTGTT ATGGACGAGT GTACGGAATT
CGACCATCGT CACTTTTGAC TATATACACC ATGGTGGTGG GAGTTTCAAG TTCAGCACTC
ATGCCTTTTA TGGGAGCCAT TGTCGATTAC ACTCCGCATC GCCTGCTGTT TGGTAGATGG
CTGTCGATTA TGTTCTGCCT GTTGCTTTTG CCGCTCGTGT TTCTCTCCCA AAACAACTGG
TTCTACATGG CAAATGTGCT GGTGATATTG GCGCTCGTTG GATGGGCACA AACCATGGTT
ACTTATGCAT ATCTTCCCGA ACTCACCGAT CAAGAGGAAC GTCTTAGCAC CTTCACTCGC
AGTTTTACCG TGGCAAGTTT CGGGTCAATG GTTTTGTTTC TCGCCAGCAT CGTCAGTATT
GCTACGCTCG GAGGTTTTTC TAACAACGAG CTTGCTGTCG CAAGAGTAGG ACTTTCTACT
GCGTTTGTGC TGAGTAGCTT GACTCTTTAC TTGGCCTGGT TCCGGCTTCT CCAAAGCCGT
CCTTCCGCGC GCTCGTTGCC GCCGGAAAAG TCCATCTGGA CTGCCGGCTT CATTCAAGTC
TACCGCACAA GCATTCATAT TTGCAAAAGC TTACCTACAC TTAAGTATTT CTACCTTTCC
GTTGCGTTCG TCGACGCTGG CGTCAATTCA CTGGCGACGA TTGCATTGAC GTACGTCACC
GATACTTTGG ATTTCAATAC GACCGAAAGC GGCTTTGCAA TTCTCGCCAT GCTCCTCGGA
ACCGTTCCGG GCGCGTACCT GGCCGGTGTA ACCATGAAGC GCCTCGACCC CATACGATCC
TCAATATTGG CAACCATTTT GATCACGATA AACACTATTT TGGCGGCAAT AGTTTTAAAG
GGACCCGGAC AACAAGTCGA GACAGGAGTC CTAGCGGTAG TGTGGGGTAC GGCTACTGGC
TGGAAATGGA CGACAGATAG AATCTTGGCC TCTGTTCTCA TTCCGACGGG CCAAGATGCA
GAGCTGATGG GAGTCTACTT GTTTTCGGGT CAAATTTTAA CTTGGTTGCC CCCGCTGATA
TTTACGGCCT TGAACGAAGC TGGCATAAGT CAACGTATCG GAATCGGTAC CTTGGCGATA
TGGTTTCTCA CAGGTATAGT TTTTCTAATT TGCATCGGCA GCTATCAAGA CGCGGTCGTC
AAAGCAGGGC GGGGACATTT GGTGGCAAAC GGAATGCTTA TTACCACATC AGCTACATGA
 
Protein sequence
MKQQHEHEAR SDLGAADLID EHEGPREQQA CSRKIHASEK NRTDRTADAQ PTTSLETEPF 
DTSQLPRFYP RWRGKPWFCG NAEALGWALD AIGRAVAFIS AGAFLSTAML RLAKTDAGCA
VDPPDGSDEI PECYGRVYGI RPSSLLTIYT MVVGVSSSAL MPFMGAIVDY TPHRLLFGRW
LSIMFCLLLL PLVFLSQNNW FYMANVLVIL ALVGWAQTMV TYAYLPELTD QEERLSTFTR
SFTVASFGSM VLFLASIVSI ATLGGFSNNE LAVARVGLST AFVLSSLTLY LAWFRLLQSR
PSARSLPPEK SIWTAGFIQV YRTSIHICKS LPTLKYFYLS VAFVDAGVNS LATIALTYVT
DTLDFNTTES GFAILAMLLG TVPGAYLAGV TMKRLDPIRS SILATILITI NTILAAIVLK
GPGQQVETGV LAVVWGTATG WKWTTDRILA SVLIPTGQDA ELMGVYLFSG QILTWLPPLI
FTALNEAGIS QRIGIGTLAI WFLTGIVFLI CIGSYQDAVV KAGRGHLVAN GMLITTSAT