Gene RPB_2855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2855 
SymbolglmU 
ID3910648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3254379 
End bp3255737 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content67% 
IMG OID637884755 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_486468 
Protein GI86749972 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.548657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA GAACCAGCCT GACGATCGTA CTTGCTGCCG GAGAGGGCAC CCGGATGCGA 
TCGTCGCTAC CAAAGGTGCT TCACCCGGTG GCCGGCCGCC CGCTGTTGGC TCATGTGCTG
GCCGCAGCAC CGCATGGCGA AGGCGACAAA CTCGCCGTGG TGATTGGCCC CGATCATCAG
GCGGTCGCCG ACGAGGCCAG ACGAATCCGA CCCGATGCGC AGATCTACGT GCAGACGGAG
CGTCTCGGCA CCGCGCACGC CGTACTGGCG GCGAAGCAGG CGGTCGCGGG CGGCGCCGAC
GATCTGCTGA TCGCCTTCGG CGACACCCCG CTGATTTCGG CGGAGACCTT CGCCCGCCTG
CGCGAACCGC TGCGCAATGG CGGCGCGCTG GTGGTGCTCG GCTTCCGGGC GGTCGATCCG
ACCGGTTACG GCCGGCTGGT CGTCGAGGAC GATCGATTGG CCGCGATCCG CGAGCAGGCC
GATGCCAGCC CGGACGAACT GAAGATCACG CTGTGCAATG CCGGCGTGAT GGCGATCGAC
GGGCGCATCG CGCTCGACGT GCTGGGCGCG ATCGGCAACG CCAACAAGAA GGGCGAGTAC
TATCTGACTG ACGCGGTCGG CATCGTGCGC GAACGCGGCC TCGTCGCCGG CGTGATCGAG
ACCGACGAGG ACGAGGTCCG CGGCATCAAC ACAAAGGCGC AACTCGCCGA GGCCGAAGCG
GTGATGCAGG CGCGGCTGCG CCAGGCCGCG ATGGCCGCGG GTGTGACGCT GATCTCGCCG
GAGACGATTC ACCTCGCCGC CGATACCACG TTCGGCCGGG ACGTGACGAT CGAACCATTC
GTGGTGATCG GCCCCGGCGT CAGCATCGGC GACGGTGCGG TGATCCATTC GTTCTCGCAC
ATCGTCGACA CCTCGCTCGG CAAGAATACA TCGATCGGCC CCTATGCGCG GCTGCGGCCC
GGCACGTCGC TCGGCGACGG CGCCAAGATC GGCAATTTCG TCGAGACCAA AGCAGCGCAG
ATCGATGCCG GCGCCAAGGT CAATCATCTG ACCTATATCG GCGACGCGCA TATCGGCCCG
GGCGCCAATA TCGGCGCCGG AACGATCACC TGCAATTACG ACGGCTTCAA CAAGCACAAG
ACCGAAATCG GCGCCGGCGC CTTCGTCGGC TCGAACTCGT CGCTGGTCGC GCCGGTCAGG
ATCGGCGCGG GCGCGTATAT CGGTTCGGGA TCGGTGATTA CCAGGAACGT GCCGGACGAC
GCGCTGGCGG TCGAACGCAA CGACCAGAGT GTGCGCGAGG GCTGGGCAAC GCGGTTCCGT
GAAGCCAAGC TGCGCGCGAA GAAACCCAAA GCCGGCTGA
 
Protein sequence
MTTRTSLTIV LAAGEGTRMR SSLPKVLHPV AGRPLLAHVL AAAPHGEGDK LAVVIGPDHQ 
AVADEARRIR PDAQIYVQTE RLGTAHAVLA AKQAVAGGAD DLLIAFGDTP LISAETFARL
REPLRNGGAL VVLGFRAVDP TGYGRLVVED DRLAAIREQA DASPDELKIT LCNAGVMAID
GRIALDVLGA IGNANKKGEY YLTDAVGIVR ERGLVAGVIE TDEDEVRGIN TKAQLAEAEA
VMQARLRQAA MAAGVTLISP ETIHLAADTT FGRDVTIEPF VVIGPGVSIG DGAVIHSFSH
IVDTSLGKNT SIGPYARLRP GTSLGDGAKI GNFVETKAAQ IDAGAKVNHL TYIGDAHIGP
GANIGAGTIT CNYDGFNKHK TEIGAGAFVG SNSSLVAPVR IGAGAYIGSG SVITRNVPDD
ALAVERNDQS VREGWATRFR EAKLRAKKPK AG