Gene PHATR_44040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44040 
Symbol 
ID7204226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp772880 
End bp775318 
Gene Length2439 bp 
Protein Length812 aa 
Translation table 
GC content59% 
IMG OID 
ProductUDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase 
Protein accessionXP_002186126 
Protein GI219113085 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.69787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTG ACGCTCCACG CTCCCGTCCA CGTCGGCCGT TCCGACGTTG GCTGGTCCTC 
GGTGTGACCA TCCTTTCCTA CTCCGGGATG TTCGTGCGAG CGCAACAACA ATCACCGCAA
CAATCGCTGT CCGGCGACGT CCTCCGGGAC GGACCGGCGT ACTGGCAACG CGGTCGGGAT
CATTTCCGGG ACGGCCGCTA CGACGACGCC GCCACTGATT TGTGGAAGGC CGTACTCTTG
CACACGCAGA CACCACCCGC ACAAACGTAC GATGTACAGG ACGTCTTTCG GTTGTTCCTG
CAGTGCTACG TGGTGCGGGA CCGGGCCGCC GACGGATTGG CTTTCGTGGC GGGAGAATCC
TTCCGCCGGG GACAGGACGA CATGGGACGA CTCTACCTGC AACAGGCACT CGGCATGGAC
CCACGCAACG ATGCGGCGTT GCTCGTCCAA GCCGAATTCG GCGACGCCGT GGACCAGTCG
TTGTCGGCGT CGACAACGGC ACCCACGTCG CACGACAACC CCTTTCCGGG ACAAACGCCG
GAACAACTCT ACGAAGTCGC CAGTCGCCAA TTTTCCGACA AGAACTACGA AGCCTGTGCC
GACGTATTCG AACTGTCCTG CCAGCAATCG GGACGGAAAA TCGGACCCTC CTGTGCCAAC
GCCGTATACT GTCGAAACAT GTTGACGGAT TGGGGATTCA ACGGCACACA GTTTGACCGG
GACATGCAGA CCATTGCGAC GCTCGTCCGA ACCGAAACGG CGCAGTACCG ATTCCGACAC
GAAACCGACG CGAACCAGTT CGTGTGGCAG CGGGCGACGT CGCCCCATCC CCACATGATG
CTCGGTTACC CGGTAGATCC CTTGCTCAAG CGCTACGTCG CCGAGTCCGC GGCCTACTTG
GACGAACAAA TGGCACGCCT CGCCCACACC GCACCCACCG AGACCGCGTT GCCCTCTCTC
CCGCCGGGAC TACCCTACCA CGTCCACGAC GATCGCCAAC GGTTTGCTGA CGAACGCGCG
GCGGATCCTC ACGCCAAAAT ACGTGTCGGC TTTGTCGGAT CCGGCTTCAA CTCGAAAGCC
GTCCTCTATC TGTCCCAAGA TATGTTTCGA TTCTTCGGTC GCGAGTTCGA AATTCACGTC
TTTTCCTTTG GTCCACGGGA CCATCCCATG TTCATTGAGC GCGGCATGCG TGGCGTCGAT
TGGCGAGAGC GTGTCAAGTC CAACGTTCAC TTCTTTCACG ATTGCCAAGC CATGAAGCTG
GATCACATCA AAGCCGCACG CTTCATTCAC GACCAGAATA TACACATACT CATCGAATGG
GACGGATACG CACGTCAGGG CGAACGAGCG CAAGGTCTCT TTGCTCTACG ACCAGCCCCG
ATTCAGATCC TCCATCAAGA ATACCTGGGC ACCAGTGGGG CGCTCTACGT GGACTACCTC
TTTACCGATC AAGTGTCGTC ACCGCCATCC CTACAGCACC TGTACACGGA AAAACTCATC
TATTTGCCGA ACCATTTCTT CAGCAAAGGC CACGCCTACC AAAAGGAAGT CCGCGAGCCA
CGGTACGAAT ACCAACCCGT GACTCGTCCC CATCAGTTGG GGACGGGCTC TCCCCAAGAA
AATCGCTGTC TCGCTCCGCC CGACGTGGGA CCCACCGACG TTGCGTTTGT CTATTGCAAC
TTCAACAAAT TTCTCAAAAA CAACCCCGAA ACGGTCCGCG GCTGGATACA AATTCTACGG
CAGGTCCCCG ATTCGATCCT GTGCCTTCTG GACAACCCCC GCGACGGTAT CCCCTACCTC
CACAAATTCA TTCACGAAGC CGCCGGCACT TCCGACGGAA ATTCCCCGGA TTCCTTCCAA
CCGGGCGACG GGGACGACTT GGTAAACCGC GTACACTTTC TCCCCTGGGA GCCCAATCCC
TTCGATCACC AGCAGCGGAA TCGCGATTTC TGCAACGCCA TGTTGGATTC ACACCCCTAC
AACGGCCACA CGGTGGCGCA GGATGCCCTG TACGCGGGTG TCCCGATCGT AACCCGCAGC
GACGGCGACG ACATGAGTGC GCGGGTCACG ACGTCCGCCA ATCTGGTCCT GGGCTTGTCG
CATTTGAACG CCGTACACGG TCCGGCGCAG TACGTGGCGA TTGCCGTGGC GTTGGGGACC
AACGCCACGC TGTTTCGGGA AACCCGGGAG CGGTTGATCG GTACGGCACT CCAGCGGAAT
CCCATGCACC CGTACTGGGA TGTGGCTCGG TACGTACTGA ACTTTGAAAG CGGGTTGCGC
GTGGTTTGGG AACGTTTTCT TCGAGGCCAA GCGCCGGATC ACGTGGTCGT GGAGGAAACG
GCGGACGCCG CGCGGGGTAC GTACGACGAC AAGATTCGGG CGCATCCACC GCAAGGCAAC
CGGGCACGCC GTGAGCGGGC AGCGAACGAT GAACTGTAG
 
Protein sequence
MTIDAPRSRP RRPFRRWLVL GVTILSYSGM FVRAQQQSPQ QSLSGDVLRD GPAYWQRGRD 
HFRDGRYDDA ATDLWKAVLL HTQTPPAQTY DVQDVFRLFL QCYVVRDRAA DGLAFVAGES
FRRGQDDMGR LYLQQALGMD PRNDAALLVQ AEFGDAVDQS LSASTTAPTS HDNPFPGQTP
EQLYEVASRQ FSDKNYEACA DVFELSCQQS GRKIGPSCAN AVYCRNMLTD WGFNGTQFDR
DMQTIATLVR TETAQYRFRH ETDANQFVWQ RATSPHPHMM LGYPVDPLLK RYVAESAAYL
DEQMARLAHT APTETALPSL PPGLPYHVHD DRQRFADERA ADPHAKIRVG FVGSGFNSKA
VLYLSQDMFR FFGREFEIHV FSFGPRDHPM FIERGMRGVD WRERVKSNVH FFHDCQAMKL
DHIKAARFIH DQNIHILIEW DGYARQGERA QGLFALRPAP IQILHQEYLG TSGALYVDYL
FTDQVSSPPS LQHLYTEKLI YLPNHFFSKG HAYQKEVREP RYEYQPVTRP HQLGTGSPQE
NRCLAPPDVG PTDVAFVYCN FNKFLKNNPE TVRGWIQILR QVPDSILCLL DNPRDGIPYL
HKFIHEAAGT SDGNSPDSFQ PGDGDDLVNR VHFLPWEPNP FDHQQRNRDF CNAMLDSHPY
NGHTVAQDAL YAGVPIVTRS DGDDMSARVT TSANLVLGLS HLNAVHGPAQ YVAIAVALGT
NATLFRETRE RLIGTALQRN PMHPYWDVAR YVLNFESGLR VVWERFLRGQ APDHVVVEET
ADAARGTYDD KIRAHPPQGN RARRERAAND EL