Gene PHATRDRAFT_33692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33692 
Symbol 
ID7198005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp133025 
End bp134572 
Gene Length1548 bp 
Protein Length515 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178172 
Protein GI219114753 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCCG ACGGCGCCAG CACGAATCAG TACTCTCCAC AGTCGTTCAA TAGCAGGATG 
ACTGCCGTGG TTCCTTTTGT GGTCCTTCAA CAACCCGACT GGGCCACTCT ACACCGCACA
GTGCAGGCAT TTCGCCAAGA ATCGCAGGCA GCCTTACAAG GGCTCAGTCG TGACTTGCAG
GTTGCCTTGG CTACCGAGCA AGAATATCGC CAAGTATTGC AGGAGAGCAC TGAATCCGAC
TCCGGTACTC TGGCCGATTC TCTCCAAAAT CGTATCGATC GGCTATCGAT ACTCTTGGAA
CACAACGCCA AACTTCTCGG TACCTTGCTC CAACCGTTTC CAACGATCCG CGTTTTGCCT
AGTGCCAACC GAGAGGGGCA AGTCGGTACA GTGAACTCCC CAAACCCCAA TGATACAATC
GATCACACCT TCTCCCGGGC ACACACCTTC CCCATTCCCC GCGGAGCGGC AGCATTCTTT
GAGTCCTCTT CTCCGCAGCA GCCGTACGAT TCGGGTGCCC AGATATTGGC ACACATTGTC
CGCGATTGGA GTGACGCAGG GCGTCCCATT CAGGCTTCTC TGTACGATTG GTGCGTCGAA
CAAGTACTGG CCTACCGCAC GAGGACCCCA TCGTTGCATC CAACCGTCCG CCAGGCACAG
CACGATTCGA CTCGACCAGC AGACCGGATC CTGGTTCCGG GTGCAGGGTT GGGACGCCTG
GCGTGGGAAC TCGCCGCGTT GCCAACGCCG TCGGGAAACA ATAGTGCGGA GCATCGTGCT
GTGTATGTGG AAGCAGTGGA ATGTTCCGTA TCCATGGCGG CCACCGCCGC AATGATCTTG
CCCCATACGT ATCGACACAA GTTGGACGAA TCTGTTACTA CTACACCTGG TGGCTGGAGT
GGACGGGCCG CCGCGCATTG GACGGCCTAC CCCTACGTCG TGGATGCCTT TTCTAACGAA
GTCGATAGCG AACGACGGTA TCGAGCCGTG CATTTTCCTT CCGTGGATCA ATCCGAGAAG
GCGTACGAAA CCGGTGTCGA TGTGGGCGCC GAGTCGTCCG ACCGGTACCG ACGGAGTCGC
CATCTTGACT CCCGGAACAA TCTGTCGTAT ACGATAGCGG ACTTTACGAC CTACCGAGGG
TTGACGGAAA CCAGTGGAGC GTACCGGTTC GTTGTAACAT GTTTTTTCCT CGATACCGCC
ACGAACGTGT ACGACTACGT GGCGACGATT CGGCACGTGT TGGAAGGACC ATCCCGAGAC
CGCGATATGG CACTCGATGA TGAGCGTTGT GGTGGTGACG GTGACGGCGG CCTGTGGATC
AACGTGGGTC CGTTGCAGTG GCACCGCAAT GCGGTATTGC ATCCGAGTGC GAACGAGCTG
CGTAGTATTG TGGAACGCAT GGGCTTTACT ATTCTGTATT GGAAAGTGGA CGCAGCACCC
GTGGAATACC GCGACGAAGT TGTTGGAACG AGAGGTCCGA AGGAGGAACC GCGATCCACT
CATTACGATG CCTATTGTCC GTTGCGCTTC GTCGCACGGC GCAATTAG
 
Protein sequence
MDSDGASTNQ YSPQSFNSRM TAVVPFVVLQ QPDWATLHRT VQAFRQESQA ALQGLSRDLQ 
VALATEQEYR QVLQESTESD SGTLADSLQN RIDRLSILLE HNAKLLGTLL QPFPTIRVLP
SANREGQVGT VNSPNPNDTI DHTFSRAHTF PIPRGAAAFF ESSSPQQPYD SGAQILAHIV
RDWSDAGRPI QASLYDWCVE QVLAYRTRTP SLHPTVRQAQ HDSTRPADRI LVPGAGLGRL
AWELAALPTP SGNNSAEHRA VYVEAVECSV SMAATAAMIL PHTYRHKLDE SVTTTPGGWS
GRAAAHWTAY PYVVDAFSNE VDSERRYRAV HFPSVDQSEK AYETGVDVGA ESSDRYRRSR
HLDSRNNLSY TIADFTTYRG LTETSGAYRF VVTCFFLDTA TNVYDYVATI RHVLEGPSRD
RDMALDDERC GGDGDGGLWI NVGPLQWHRN AVLHPSANEL RSIVERMGFT ILYWKVDAAP
VEYRDEVVGT RGPKEEPRST HYDAYCPLRF VARRN