Gene PHATRDRAFT_9020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_9020 
Symbol 
ID7196421 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1198725 
End bp1199861 
Gene Length1137 bp 
Protein Length364 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176741 
Protein GI219109977 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAGCCGGCTT GGCAAGTTGC CTTCTCCAGG GATGGCCGAT ATTTGGCCGT CTGCTACGGA 
GCAATCGAAC CTTGCGTCCG GATTTGGAAG CAGCAGTCGC CCTTTCATGA AGATAGCGGG
TGGATTCTGG ACGCGACGCT AACGGGCATT CAAACACGGA CGATCCGATC CATCGCATTT
GCACCCATTC GAACGCCGCT GATCCTAGCC TCAGCATCAT TCGACGGCAC TGTTGCGGTA
TGGGAACACT ACCCTGCCAC AAATGGAGCA CTAGTCACAG CATCAGCAAA AAGTCCATCA
GGGGTGGACG AATGGGAGTG TACGGCTCAG TTGGAAGGCC ACGAGAGTGA AGTCAAGTGT
GTGCAATGGA ATGCCACTGG GTCACTTTTG GCAAGCTGTG GACGCGACAA GACCGTTTGG
ATCTGGGAAT GCTTTTTGCC TGGTGCTATT GGTGGCCCCA GCGCAGCCCA CCCGTCACCG
TCAGGCCACA ACTCTGGTGG TGGTGATTTC GAATGCATCG CTGTCCTTCA TGGTCACGAA
GGTGACGTTA AGTGCGTACA ATTTACAAGT AGTCACGACG AGTGGGGCGA CGGGGACGAG
ATTTTACTTT CCTCTTCATA CGACAATACT ATCAAGTGCT GGGCCGAAGA CGCCGGTGAT
TGGTACTGTG CGGCCTCGAT TGAAGACGTT CATTCTTCAA CTATTTGGTC ATTGGCCATG
TCTCCCAGTG GACTACGGAT GATATCGGGT TCCGACGACC AGAGCCTAGG TATTTATAAA
TGCTATACAG CTTCCGAGAA GAAGAGACAC TTCCCTGACG AAGGCAAAAA TCGGAACGGC
CTGTGGAAAT GTGTGGGGCA TCTTCCAGAT GCGCATTTGG CAAGTATATT TTCGGTTGCG
TACGCTCCGT CACGGGCCGG CCACGGACGG ATAGCAACGG CCGGGGCTGA CAACCGGATA
CAAATATTCC GAGAGGTGTC TGGTAGCGTT TCTGATCAAC CTCTTTTTAC CGTAGAAACA
TCGGCTACAA ATGAGCTAGG AGATGTCAAT TGCGTAAGTT GGCACCCTTC AGATGGCTCC
ATCCTTGCCA CTGCCGGCGA TGACGGATCC GTGTGCATCT GGAAGTTTAA CTTGTAG
 
Protein sequence
EPAWQVAFSR DGRYLAVCYG AIEPCVRIWK QQSPFHEDSG WILDATLTGI QTRTIRSIAF 
APIRTPLILA SASFDGTVAV WEHYPATNGA LVTASAKSPS GVDEWECTAQ LEGHESEVKC
VQWNATGSLL ASCGRDKTVW IWECFLPGHN SGGGDFECIA VLHGHEGDVK CVQFTSSHDE
WGDGDEILLS SSYDNTIKCW AEDAGDWYCA ASIEDVHSST IWSLAMSPSG LRMISGSDDQ
SLGIYKCYTA SEKKRHFPDE GKNRNGLWKC VGHLPDAHLA SIFSVAYAPS RAGHGRIATA
GADNRIQIFR EVSGSVSDQP LFTVETSATN ELGDVNCVSW HPSDGSILAT AGDDGSVCIW
KFNL