Gene PHATRDRAFT_42247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42247 
Symbol 
ID7195079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp356590 
End bp357993 
Gene Length1404 bp 
Protein Length467 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183471 
Protein GI219126452 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGGCC CTGTTTCCAT CAAAGGCTGG GTCCGGACCG TGCGGAAGCA AAAGACTCTC 
GCCTTTGTGG AAGTCAACGA CGGCAGTAAC CTTTCCGGCA TTCAGTGCGT GATTGTTTTT
GATAAAGTCG ACGAAGCGAC CAAGGCGGAA CTCGATAAGG TCACGACCGG ATGTGCGGTG
GAACTCACCG GCCCATTGGT GGCTAGTCAA GGCGGGAAGC AAGCGGTCGA ATTGGCCGCT
ACAGTCCTGC GTGTCGTGGG AGCCTGTCCG GCCGAAACTT ACCCGCTCGC CAAGAAACGT
CACACTCTGG AATATTTGCG ATCGATTGCG CATTTGCGGC CCCGAACAAA TACCATTGCT
GCGGTAGCTC GAGTACGATC GCATTTGGCC GGCGCGATTC ACGCTTTTTT TCAAACGCAA
GGGTTCGTGT ACGTGCAGAC GCCTCTCGTG ACAGCTTCGG ATTGTGAAGG AGCCGGCGAA
CTGTTCCGCG TCACGACGCT CAATCTCGAC AGCGTCTCGA CCTTGCCCAA AGCCAAGAAC
GAGAACGGCA AAGAGCAGGA TCGAGTCGAT TACAGTGAGG ACTTTTTCGG TAAACCGGCA
TACTTGACTG TCTCGGGTCA GCTGGGGGGT GAAACACACG CCTGCGCGTT GGGTGATATT
TACACGTTTG GTCCGACGTT TCGAGCCGAA AATTCCCAAA CGAGTCGCCA TTTGGCCGAA
TTTCACATGG TCGAACCGGA AATGGCCTTT GCTGATTTGA CTTCCGCCAT GAACAACGCC
GAAAATATGC TGAAGTACGT AGTACAGCAC GTGTTGGACT CCTGTGGGGA AGATTTGGAG
TTCTTTCAAA AGTTCTACGA CAAGGCCCTA ATGACGAGAC TGGAAAAACT CGTGCAGAAA
CCATTTGTTC GCGTTTCTTA CCGGGAAGCG ATCGAGTTTT TGCAGGAAGA GATCAACAAG
GATCCCAGCA AGTGGCAATT TCCAGACGTA TCCTTTGGTA CCGACTTGGC GACGGAGCAT
GAACGATGGT TGGCGGAAAC CAAGTTTGAA AGTGCCGTGT TTGTGTACAA CTATCCCAAG
GCCATCAAAG CCTTCTACAT GCGTGATAAT GAAGAGGACG GCGGGGAAAC GGTCAATGCC
ATGGACTTGC TTGTTCCCGG CGTCGGAGAA CTGATCGGTG GGAGTCAACG TGAGGAACGG
TTGGATGTAC TGGAGCAGAA AATTGCCGAC GTTGGGCTTG ATAAGGAAGA CTACTGGTGG
TACCTGGATT TGCGCCGGTT TGGATCCGTC CCGCACGCCG GGTACGGTCT CGGATTCGAA
CGGTTGGTGA CCTACGTGTG TGGCATCGAA AACATTCGAG AGGCAATTGC CTTTCCCCGG
TATCCCGGCA ACGCCGAGTT TTGA
 
Protein sequence
MDGPVSIKGW VRTVRKQKTL AFVEVNDGSN LSGIQCVIVF DKVDEATKAE LDKVTTGCAV 
ELTGPLVASQ GGKQAVELAA TVLRVVGACP AETYPLAKKR HTLEYLRSIA HLRPRTNTIA
AVARVRSHLA GAIHAFFQTQ GFVYVQTPLV TASDCEGAGE LFRVTTLNLD SVSTLPKAKN
ENGKEQDRVD YSEDFFGKPA YLTVSGQLGG ETHACALGDI YTFGPTFRAE NSQTSRHLAE
FHMVEPEMAF ADLTSAMNNA ENMLKYVVQH VLDSCGEDLE FFQKFYDKAL MTRLEKLVQK
PFVRVSYREA IEFLQEEINK DPSKWQFPDV SFGTDLATEH ERWLAETKFE SAVFVYNYPK
AIKAFYMRDN EEDGGETVNA MDLLVPGVGE LIGGSQREER LDVLEQKIAD VGLDKEDYWW
YLDLRRFGSV PHAGYGLGFE RLVTYVCGIE NIREAIAFPR YPGNAEF