Gene Plav_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3066 
Symbol 
ID5455743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3271601 
End bp3272959 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content61% 
IMG OID640878655 
Productnucleotidyl transferase 
Protein accessionYP_001414330 
Protein GI154253506 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA ACGCTCCCGG CGCCGTCATT CTTGCCGCGG GCAAAGGCAC GCGGATGAAA 
TCGCGCTTGC CCAAGGTGCT GCATCCCATC GCCGGCAAGC CCATGCTGGG CCATGTTCTG
TCCGCTGTCA GCGCGCTTGG CAGCGAACGT CCGGTGCTGG TTGTCGGACC GGGCATGGAC
GAAGTCGCCA CTTATGCCCG TGGTCTCGTT TCCGGTCTTA CTATTGCGGT CCAGGAAAAG
CAGCTCGGTA CGGGCGACGC GGTTCGTGCC GCCGCACCGC ATATCGATAA GAAAGAAAGT
GTGGTCCTCG TCGTCTTTGG CGATACGCCG CTTGTCCGCG CGGAAACGCT GGCGGATATG
ACGCGGCGTT GCGAAGAAGG CAGCGACATC GTCGTTCTGG GTTTTGAGGC GGCCGATCCC
ACCGGTTACG GACGGCTCAT CCTCGATGGC AACGATGTGG TCAGGATTGT CGAGCACAAG
GATGCGAGCG AGGAAGAGCG CAAGAACAAG CTCTGTTTCG GCGGGCCGAT GGCGGTTCGC
GCCGCGCATC TGCCGGCGCT CCTCGCCAAG CTCACCAACA AGAACGCTCA GGGTGAATTC
TACATGACGG ACTTCGTCGC GCATGGCCGC GCCGCCGGTC TTGTCTGCAG CGCGGTCTTC
TGTCTCGAAG CGGATATGCA GGGTGTCAAC AGCCGCGCTG ATCTGGCGGC GGCCGAAGCC
ACCATGCAGC AAAGGCTGCG CATGGCCGCC ATGGCCGGTG GCGTTACGAT GCTCGATCCG
TCGAGCGTTT ACTTGAGCAT GGATACCGAG TTCGGAGAGG ACGTGACGGT GGGCCAGAAT
GTTGTTTTCG GGCCCGGTTG CGTCATTGCC AACGGCGTCA CTATCAAGGC TTTCTCTCAC
CTCGAAGGTG CGCATGTTGC AGAAGGAGCG GAGATCGGTC CTTTTGCACG CATCCGGCCG
GGTTCCGAAA TCGGCCGCAA GGCGCGGATC GGCAACTTCG TGGAAACAAA GAAGGCGCGG
ATCGAGGACG GCGCCAAGGT CAATCATCTG TCCTATATCG GCGATGCGCG CGTCGGTGCG
GGCGCCAATA TCGGCGCGGG TACCATCACG TGCAATTACG ACGGATATAA CAAGTTTTTC
ACGGATATCG GTGCTGGCGC CTTTATCGGC TCCAACAGTT CGCTGGTCGC TCCGGTCAGC
ATCGGCGATG GGGCTTATCT CGGCTCCGGC AGTGTCGTGA CAAAAGATGT AGCCGCGGAT
GCTTTGGGCG TTGCCCGCGC AAGACAATTC GAAAAACCGG GCTGGGCGGC GGCGTTCCAC
GCGAAACACA AAGACAAGAA GAAGGCTTCA GGCGAATAG
 
Protein sequence
MTTNAPGAVI LAAGKGTRMK SRLPKVLHPI AGKPMLGHVL SAVSALGSER PVLVVGPGMD 
EVATYARGLV SGLTIAVQEK QLGTGDAVRA AAPHIDKKES VVLVVFGDTP LVRAETLADM
TRRCEEGSDI VVLGFEAADP TGYGRLILDG NDVVRIVEHK DASEEERKNK LCFGGPMAVR
AAHLPALLAK LTNKNAQGEF YMTDFVAHGR AAGLVCSAVF CLEADMQGVN SRADLAAAEA
TMQQRLRMAA MAGGVTMLDP SSVYLSMDTE FGEDVTVGQN VVFGPGCVIA NGVTIKAFSH
LEGAHVAEGA EIGPFARIRP GSEIGRKARI GNFVETKKAR IEDGAKVNHL SYIGDARVGA
GANIGAGTIT CNYDGYNKFF TDIGAGAFIG SNSSLVAPVS IGDGAYLGSG SVVTKDVAAD
ALGVARARQF EKPGWAAAFH AKHKDKKKAS GE