Gene PHATRDRAFT_54703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54703 
Symbol 
ID7201966 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp756815 
End bp758253 
Gene Length1439 bp 
Protein Length472 aa 
Translation table 
GC content50% 
IMG OID 
Productudp-n-acetylglucosamine pyrophosphorylase 
Protein accessionXP_002181257 
Protein GI219121821 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.860159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCAAA CCCGCAACGT GTTCACAGAC GCAGATGACG AATCGGAAAT TCGGTCTCGC 
TACGTGCAGG CCGGTCAGGA GCATGTGTTC CAACACTACG CCCAATTGTC GCCGACGGAA
AGGACATCTT TTCTGCATCA GCTGCGGGAC ATTCAGGTTG AAAACGTTGC CATTTTATTG
AAGTCAGCCG AATCTATCGA CCAAGGAGAG CCCACCGACG AGACAGCTAT TGCGCCTTTT
CCTACCAATA TTGTTGGTAG ATCTACTGAT GAGACGCTTG TGCGAGATTC CTACACAACG
GGCATGGAAG CCATTCGAAA GAATCAAGTA GCTACCCTAG TATTAGCCGG AGGTCAAGGA
ACTAGATTGG GATTTGACGG TCCCAAGGGC ATGTACAGTA TTGGCCTACC GAGCGAACGG
ACACTCTTTG CCATGATGGC GCTGAGGATC CGAAAACTTG CGGCACTAGC CGGTGAGGAA
AATGTTGCTT TACCGTTTTA TGTCATGACC TCGCCCCTCA ATCACGACGC GACAGTGGCA
TACTTCCATT CCAAAGAGTA TTTTGGGCTG CCGGAGAGTG ACGTGTTTTT CTTTCAGCAG
GGAACTCTTC CCTGCCTGAC GAAAGACGGT AAGATTATTC TCGAACGAGC AGGGAAAGTA
GCCGTCGCTC CCGACGGCAA CGGTGGTATA TACCCTGCCT TGCAGCGCTC CGGTGCGCTG
CAAGATATGA TGACCAGGGG TGTCCGATAT CTTCACGTAT TTAGCATTGA CAATGCCTTG
ATCAAACCAG CAGATCCGGT CTTTCTCGGA TACTGCATCG GACAAGGAGC CGACTGTGGC
AACAAGGTTG TGTGGAAGTC GCACGCACAT GAAAAAGTTG GAGTTGTGGC GTCTCGAGGC
GGGAAGCCTT GTATCGTGGA ATATTCCGAA ATCACAACAG AAATGGCGGA GAGCACGGAT
GATGACGGGC GATTGCTGTT TGGAGCGGGC AACATCTGCA ATCACTTTTA TACTTTAGAC
TTTCTGAGAG AGAAGATTCT ACCCAACATG GGCAACATGT ATCACATTGC GCACAAGAAG
ATTCCCTTTT ATGACGCAGC TACTCAATCC ACAGTTGCCC CGACCGAAAA TAACGGCATC
AAGCTGGAGA CTTTTATTTT TGACGTCTTT CCCCTTTCCG TGAATATGGC CGTTTTTGAA
ATTGAACGAA GCGAAGAATT TTCGCCCGTC AAGAATAAGG CAGGGTCGGA AGCGGACAGT
CCAGATACGG CTCGAGCCAT GGCTTCCGAT CAGGCTAAAA AATGGATCAA AAATGCTGGT
GGTAACTTGA TCGGAAAGGT GGATGATGGT GTTTGCGAGA TTTCACCACT CACTTCCTAT
GGCGGAGAAG GATTGGAGCA CTATGAAGGT CAGGATGTTG CCTGTCCGTT TAGCCTATG
 
Protein sequence
MDQTRNVFTD ADDESEIRSR YVQAGQEHVF QHYAQLSPTE RTSFLHQLRD IQVENVAILL 
KSAESIDQGE PTDETAIAPF PTNIVGRSTD ETLVRDSYTT GMEAIRKNQV ATLVLAGGQG
TRLGFDGPKG MYSIGLPSER TLFAMMALRI RKLAALAGEE NVALPFYVMT SPLNHDATVA
YFHSKEYFGL PESDVFFFQQ GTLPCLTKDG KIILERAGKV AVAPDGNGGI YPALQRSGAL
QDMMTRGVRY LHVFSIDNAL IKPADPVFLG YCIGQGADCG NKVVWKSHAH EKVGVVASRG
GKPCIVEYSE ITTEMAESTD DDGRLLFGAG NICNHFYTLD FLREKILPNM GNMYHIAHKK
IPFYDAATQS TVAPTENNGI KLETFIFDVF PLSVNMAVFE IERSEEFSPV KNKAGSEADS
PDTARAMASD QAKKWIKNAG GNLIGKISPL TSYGGEGLEH YEGQDVACPF SL