Gene PHATRDRAFT_3369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_3369 
SymbolGALK1 
ID7203303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp379926 
End bp381116 
Gene Length1191 bp 
Protein Length397 aa 
Translation table 
GC content51% 
IMG OID 
Productgalactokinase 
Protein accessionXP_002182673 
Protein GI219124778 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.949069 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCCCGGCGT ACGTGGTGGC GGCACCAGGA CGGGTCAATT TAATTGGCGA GCACGTCGAT 
TACACGGGTG GATTTGTCTT ACCCTTGGCC ATCGACTTTT CCACCGTGAT TTACGGAACC
GGCTTTTGCC ATACCGGCAA GGGCAATGGA CCGACGTCGA TGCGTCTGCG ACTCATTTCC
GAAAAGGCCC TTAACGGAAG TATTGTTGAA GAACGCCGGC TGAACGGCAC AAGTCGTCCA
CCAGATGCGG ATGAACCGCG ATCATGGGTC AACTACGTAG TCGGGGTAGT ACAACAGTAC
ATGCACGATC TTCCCAAAGA AGGGTGCACT ATAGACCTGT CTATGGCCAT TGCTAGCGAT
GTTCCTTTGA GCTCCGGTCT GTCCAGTTCG GCATCGCTCG AAGTTGCAGT CGCGACATTC
TTCGAATGCT TCCTGCCGGA AAATATGGCC TATTCCTCCG CCAAAGAAAA GGATATTCGA
AAGGAACGCG CGTTACGTTG TCAGAAAGCC GAAAACGATT GGGCACATTC GCCATGCGGT
ATCATGGATC AACTAGCCTC GTCTGCCGCC CAAGTCGGCA AACTTATGCT CATTGACTGC
CAATCACTGG AAATTGAACA CGTCACTATG AAAGCAAATA CACCAGAAGA TCCCGTTATT
CTTATCACCA ACTCAAAGGT CACGCACAGC ATTGCCGATT CCGAGTATGG GATCCGACGA
GCCCAGTGTC ACGATGCGCT CCTAGCAATG CAGTCCATTC CGCTCTACCA CGTACTTTCT
CTGCGAGACG CCACTAAGGA TGACGTGAAA GAGGCCGAAG CTAAGATGAA TAAGATTTCT
TACCATCGAG CTCTCCACGT CGTTAACGAA AATGTTCGCA CCAAAGAGTG CAAGGTTGCT
CTGAAAATGG GACTGTGGGA TCATGTTGGA GAACTTATGA ATGCGTCCCA CGCAAGCTTG
CGAGACGAGT ACGAAGTAAG CTGTGAAGAA GTTGACTATC TCGTCGAAGT AGCTCAGGCG
TACGAAGGTG TGTACGGTTC CCGCATGACC GGCGGCGGTT TTGGCGGTTG TACGGTCACT
TTCGTTCAAC GCCGAGTTGT CGAAGGACTC ATAAAGCATT TACAATCATC TTACGAAGCC
AAGTATGGGA AACAAGCCGA GTGCTTTTTG ACAGAACCGG CGGAAGGAGC C
 
Protein sequence
PPAYVVAAPG RVNLIGEHVD YTGGFVLPLA IDFSTVIYGT GFCHTGKGNG PTSMRLRLIS 
EKALNGSIVE ERRLNGTSRP PDADEPRSWV NYVVGVVQQY MHDLPKEGCT IDLSMAIASD
VPLSSGLSSS ASLEVAVATF FECFLPENMA YSSAKEKDIR KERALRCQKA ENDWAHSPCG
IMDQLASSAA QVGKLMLIDC QSLEIEHVTM KANTPEDPVI LITNSKVTHS IADSEYGIRR
AQCHDALLAM QSIPLYHVLS LRDATKDDVK EAEAKMNKIS YHRALHVVNE NVRTKECKVA
LKMGLWDHVG ELMNASHASL RDEYEVSCEE VDYLVEVAQA YEGVYGSRMT GGGFGGCTVT
FVQRRVVEGL IKHLQSSYEA KYGKQAECFL TEPAEGA