Gene PHATRDRAFT_54068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54068 
Symbol 
ID7196612 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2524994 
End bp2526574 
Gene Length1581 bp 
Protein Length450 aa 
Translation table 
GC content47% 
IMG OID 
Productn-acetylglucosaminyl-phosphatidylinositol biosynthetic protein 
Protein accessionXP_002176995 
Protein GI219110487 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.15052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAAAGAAAAA GGAGCATCTC GAGATGGCTA TAGCCATGAT GTGTATTGGC GTTATTACGG 
CAAACATAGA CAGAGCACCA TCTATTTGTT CAGAAGAACT CACGTTAACT GATGGGTCAA
TAGGAAGAGG GAAGTATGAG GAAAAGCTAC CGAATCGCCA TGGTGTGTGA CTTTTTCTAC
CCTCGGCTCG GAGGGGTCGA GAATCATATC TGGAGTCTAT CACAAGAATT AGTAAAAATG
GGGCATAAGG TTATCGTAAT AACTCACGCC TACGGATCCC GAAGAGGTGT TCGATACATG
TCGGGACCTT TGAAGGTGTA CTATTGCCCA ATAACGCCGA TGGTTGATCA AGATGCCTTA
CCCTCGTTTA CCGCCACCTT TCCATTATTG CGGTCAATAT TGATTCGAGA AGGGATTGAG
ATTATTCATG CACATCAGGC GACAAGCACT ATGGCGAACG AAAGTATTGT GTATGCCGGT
ATTCTCGGCA TCGCCAGTGT TTACACGGAC CACTCGTTGT TTGGATTGAA CGACATTGCT
GGTATTGTAT TGAATCGCGT GCTACAGACG ACTATGTGTG CTGTAGACGC CGCCATCTGC
GTTTCTCATG CATGCCGGGA CAACTTTGTC CTGAGAGCGA ACGTGGGTGC GGAGCGCGTT
CGAGTCATTC CGAATGCTGT GGACGCATCT ATGTTTCAAC CCGACCCATC TCTTAAGTGC
CAACCTCGGT ATGTCAACCG GTGACCGTCC GTTGCCTCTT GCGCATTCTG ACTAAAAGTG
CGAATTCTCG CAGACTCACA AAGGTTCTTT TAAACCAGGC TCAATATTGT TGTTGTTTCT
CGTCTGGTGT ATCGGAAAGG GGTAGACCTT CTGGTTGGTA TTATACCAAA AATATGTCGA
AGACTTGACT CAGTTGATTT CATAATAGGT GGCGACGGCC CCAAAATGTT GGACTTGCAA
GAAATGGTGG AACTTGAACG TCTGCATGAT CGTGTTACAT TTTTAGGTTC TATCCCGCAC
GCACGGGTCC GAGACGTCTT GGTACAAGGA CATATTTTTT TGAATTGTTC TCTGACGGAG
TCCTTTTGCA TCGCTATCCT AGAAGCCGCT TCCACTGGCC TCCTTGTTGT TTCTACGAAC
GTCGGTGGAG TTCCCGAAGT TCTTCCAGAT GATATGATCC TTTTGGCCGA CTCAAACGTA
CCAAGTATTG TGCAGAGGTT GGTGGAAGCG GTTGATCGCT TTAACAATGG CAACGTTGCT
GACTCTTGGA GTGCCCACCA ACGAGTCACA GACATGTATT CATGGCAACG GGTTGCCGTC
GAAACTGAGC AAGTGTACAG TGACGTTTGC CGGCAGTCAA AACGATTGTT TTATGACCGA
ATTTCCCGAT TCTGCAGCAT AGGCGGAATT TCTGGGCCTG TTGTCGTGGT GCTGCTAGTT
ATCGTGGAAT TGTGGACTTG CGCCGCAAAT TGGTACGAGC CGTCTGCGAA TATTGACATT
GTGCCTGATG TAATCGAGAG CAAGACAAAG AAAAAAACGC GTTCGAAGAC GGAAGAACCA
CAAGTAAAGA TGCCATAGTA G
 
Protein sequence
MRKSYRIAMV CDFFYPRLGG VENHIWSLSQ ELVKMGHKVI VITHAYGSRR GVRYMSGPLK 
VYYCPITPMV DQDALPSFTA TFPLLRSILI REGIEIIHAH QATSTMANES IVYAGILGIA
SVYTDHSLFG LNDIAGIVLN RVLQTTMCAV DAAICVSHAC RDNFVLRANV GAERVRVIPN
AVDASMFQPD PSLKCQPRLN IVVVSRLVYR KGVDLLVGII PKICRRLDSV DFIIGGDGPK
MLDLQEMVEL ERLHDRVTFL GSIPHARVRD VLVQGHIFLN CSLTESFCIA ILEAASTGLL
VVSTNVGGVP EVLPDDMILL ADSNVPSIVQ RLVEAVDRFN NGNVADSWSA HQRVTDMYSW
QRVAVETEQV YSDVCRQSKR LFYDRISRFC SIGGISGPVV VVLLVIVELW TCAANWYEPS
ANIDIVPDVI ESKTKKKTRS KTEEPQVKMP