Gene PHATRDRAFT_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_1870 
SymbolTRD4 
ID7200137 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp678615 
End bp680131 
Gene Length1517 bp 
Protein Length467 aa 
Translation table 
GC content47% 
IMG OID 
ProductTRD4 
Protein accessionXP_002179486 
Protein GI219117383 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAAG GCAGTCAAAC GTTCCCTGTT GCACAGCATC ATAATCGTGA CAGAGTATAC 
TCACTGTATG CCGTAGCTCT AATTTTTGCC GTCCCGGCAC TGGGCGGTTT TAATTTTGGT
TTTGACATTG GCGCTACGTC GTACGCCATC GTCCAAATGC AGTCTCCTGT CTTGTCTGGA
GTGTCATGGT TTCATACTGT TTTATCTTCG CCAATATTAC GCGGTACTAT TCTGTCATCG
GGATCGGCAG GTGCTTTGAT TGGTAGTTCA CTGGCGTTTG CGATCGGTGA CAAGATTGGG
CGAAGACGGG AATTGCAGCT GGGATCTTTA CTGTATCTCC TCGGAGCATT GCTAGAAGTC
TGGACGGCCC AATCGAGCAG CTGGGGAGCA GTACTTGGTA TTACTGTTTT GATTTTAGGA
AGAGTCGTTT ACGGAATCGG CATCGGCATA TCGATGCATG CGGCGCCGAC ATATATTGCC
GAAATGGGTC CTTCCAGTAT TCGAGGTTTG TTGGTATCGC TCAAGGAAGC ATCGATTGTT
CTAGGTATTC TCACCGGATA CATGATTGGA TACGCGTGCT CGAAACATAC TGGAGGATGG
GCGTGGATTT ATGCATCGAG TACTATGTTT TCGATGCTTA TGCTGATATT GTCAACCAGG
ATTCCGAGAA GCTGTCGATG GCTCATGCTG AACAATATGG AAGACGAAGC GCTCGAATCA
CTGCAATTTG TGTTCACAGA GGAGCAAGCT CAGGTAGAAT TTTCCAACAT GAAACAATCA
CACGAGGAAG CGTGTGCGTT GCTAAGCGAT GAAGAGGAAG AAAAGACCGT CTGGCATCGG
TCATACCGAG CGCCGCTCAT TGCGGGCGTT GGACTTGTCG TACTCCAACA AATTACCGGA
CAACCGTCTG TCTTGTCCTA CGCCACTCCA ATTTTCCGAG ATGCGGGATT ATCGGACTCT
GCACCCGTAC TTCTCGCACT TTTCAAGTTG CTGGCGACTC TGTCGGCAGC TGTCACAGTC
GAAAAATACG GAAGGAAAAT GCTGTTGTAT ACAGGCTGTT CCCTAATGCT CATTGGGCTA
ACTATTCTTT CCTTTTCTCT GGATGGTGGT ACCTATATTG CCAAAGTGGC TGTCTTAGTG
GCCATGTTCG TCTATATCGG TGGTTATCAA GTGGGTTTTG GTCCCATCAC GTGGCTCCTT
ACCAGCGAAC TATATCCTTT GAGCATCCGA GGCCAAGCAG TAGCTATTGC TGTACAAATG
AACTTTTTGC TTAACACTGC AGTCCAGTTT GGAGTTCCGC TGTTGCAAGA AGTCATTGGA
TTGAGCTTCA CGTTTGCATT ATTTGGTATA CTTACAGCGT ACAGGTAAGT GAGCCGGACA
AGTGTATGGT CCCAGATAAT GTCCGTGGTA TCTCTTTCAT TCTAACACAT TGTCAAACGA
TTGTTCCACT TGACAGCATC TTTTTTGTTG CAACTCGTGT ACCAGAAACA AAAGGTTTGT
CCTTGGAGGA AATTGAG
 
Protein sequence
PVAQHHNRDR VYSLYAVALI FAVPALGGFN FGFDIGATSY AIVQMQSPVL SGVSWFHTVL 
SSPILRGTIL SSGSAGALIG SSLAFAIGDK IGRRRELQLG SLLYLLGALL EVWTAQSSSW
GAVLGITVLI LGRVVYGIGI GISMHAAPTY IAEMGPSSIR GLLVSLKEAS IVLGILTGYM
IGYACSKHTG GWAWIYASST MFSMLMLILS TRIPRSCRWL MLNNMEDEAL ESLQFVFTEE
QAQVEFSNMK QSHEEACALL SDEEEEKTVW HRSYRAPLIA GVGLVVLQQI TGQPSVLSYA
TPIFRDAGLS DSAPVLLALF KLLATLSAAV TVEKYGRKML LYTGCSLMLI GLTILSFSLD
GGTYIAKVAV LVAMFVYIGG YQVGFGPITW LLTSELYPLS IRGQAVAIAV QMNFLLNTAV
QFGVPLLQEV IGLSFTFALF GILTAYSIFF VATRVPETKG LSLEEIE