Gene Pnec_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_0104 
Symbol 
ID6183702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp96320 
End bp97594 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content49% 
IMG OID641670838 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001797038 
Protein GI171462925 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.332697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.845573 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAT TACGAATGGT TGGCGGAACC CCTCTCAAAG GAGAGGTTGT GATTGCTGGC 
GCTAAAAATG CCGCATTGCC CATTCTGTGT GCTTGTCTAT TAACCGATCA GCCAGTTGTT
CTGCGCAACG TTCCTGATTT GCAGGATGTG CGTACTATGC TCAAGCTTTT GCAAGAGATT
GGCGTCACAA TAGATTTCCC AAGCGCGGGT GATCGCAGTT ACATGGTGTT GAACGCGGCA
GTCATTAAGA GCTCTGAAGC TACATATGAG ATGGTGAAAA CCATGCGTGC CTCCATCTTG
GTTCTGGGGC CACTCCTTGC CAGAATGCAT AGCGCCAAGG TTTCTTTGCC TGGCGGCTGC
GCTATTGGTG CGCGTCCAGT AGATCAGCAC ATCAAAGGCT TGAAGGCGAT GGGTGCAACC
ATCAAGATTA AGAGCGGTTA CATACAGGCT GAAACAAAAC CGCAGTCAGA TCGATTGAAG
GGCGCCTCGA TCTTGACAGA CATGATTACG GTTACAGGTA CTGAGAATTT ATTAATGGCC
GCTACTTTGG CTTCAGGAAC AACGGTATTA GAGAATGCTG CGCGAGAACC TGAGGTTGGC
GACTTGGCGG AATTGTTAGT GAAGATGGGC GCCAAGATTT CTGGCATCGG CAGCGATCGC
TTGGTGATTG AAGGCGTTGA TAAACTTCAT GGCGCAGAGC ATTCAGTGAT TCCAGATCGT
ATTGAGGCCG GCACATTTTT ATGTGCAGTG GTTGCAACTG GTGGTGAGAT CACAGTCAAG
CACTGTCGCC CTGACACTTT AGATGCGGTT ATTGTGAAAT TAAAAGAAGC GGGTCTGCAA
ACGGAGATTG GCCCCGATTG GATTAAGGCG TCTATGCAGG GTCGCCCTAA AGCAGTTAAT
TTCCGCACCT CTGAATATCC AGCCTTTCCA ACAGATATGC AGGCACAGCT GATGACAGTA
AATGCAATTG CAGCTGGTAG CTCAATGATT ACCGAGACGA TTTTTGAAAA TCGCTTTATG
CATGTGCAGG AGCTTAATCG CTTGGGTGCT GACATTGCTA TTGAAGGAAA TACTGCAATT
GCTCAAGGGG TGGAAAAACT CTCTGGTGCC ATCGTGATGG CTACGGACCT ACGTGCTTCA
GCTAGTTTGG TGATTGCCGG ATTGGCGGCC CAAGGCGAAA CCCAGGTAGA CCGTATTTAC
CACTTGGATC GCGGCTATGA CCGTATGGAG CAGAAGTTAA CCCTTTTGGG CGCCAATATT
GAGCGTATCA AGTAA
 
Protein sequence
MDKLRMVGGT PLKGEVVIAG AKNAALPILC ACLLTDQPVV LRNVPDLQDV RTMLKLLQEI 
GVTIDFPSAG DRSYMVLNAA VIKSSEATYE MVKTMRASIL VLGPLLARMH SAKVSLPGGC
AIGARPVDQH IKGLKAMGAT IKIKSGYIQA ETKPQSDRLK GASILTDMIT VTGTENLLMA
ATLASGTTVL ENAAREPEVG DLAELLVKMG AKISGIGSDR LVIEGVDKLH GAEHSVIPDR
IEAGTFLCAV VATGGEITVK HCRPDTLDAV IVKLKEAGLQ TEIGPDWIKA SMQGRPKAVN
FRTSEYPAFP TDMQAQLMTV NAIAAGSSMI TETIFENRFM HVQELNRLGA DIAIEGNTAI
AQGVEKLSGA IVMATDLRAS ASLVIAGLAA QGETQVDRIY HLDRGYDRME QKLTLLGANI
ERIK