Gene Rleg_4130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4130 
Symbol 
ID8014925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4212584 
End bp4213735 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content66% 
IMG OID644826700 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_002977910 
Protein GI241206814 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.318316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.204694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCGCA AGATCTTCCT CGGCGCCCGC ATCTTCGACG GCGAGCACTT CCATGACGAC 
AAAGCCCTCA TCGTTGCCGG CGGCCGCGTC GAAGCGATCG TCGCGAGAAA CGATCTGCCG
GACGGCGAAG TGGTGACGCT TGCCGGTGGT GTTCTGTCGG CCGGCTTCAT CGATGCGCAG
GTCAATGGCG GCGCCGGGCG GATGCTGAAC GACGAGCCTT CCGCCGCCTC GATGGACATT
ATCGCCGGCG GGCACCGGCC CTATGGTACG ACGTCGCTGC TGCCAACGCT GATCACCGAT
ACATCAGAGG CCTCCATTGC CGCGATCGAG GCGGCCAAGG AGGCAGTGAA AATGAACCGC
GGCGTCGCCG GTCTGCATCT CGAAGGTCCG CACTTGGCGC CTGCGAGGAA GGGCGCGCAT
CTGGCCGAAC TGATGCGGCC GGTGGAGGAC CGCGACGTCA AGGCTTTCAT CCGGGCGCGC
GAGGCGATCG GCACGCTGCT GGTCACCATG GCCGCCGAGC AGGTGACGGT TGCCCAGGTG
CGCGAACTTG CGGAAGCCGG CGTCACCGTC AGCATCGGCC ATTCCGATTG TTCGAGCGAG
GCGGCGGAAG ACCGTTTCGA TGCCGGCGCG CGGGGCGTCA CGCATCTCTT CAACGCCATG
AGCCAGCTGG GACACCGTGC GCCCGGTCTT GTCGGCGCGG CAATCGATCA TCCCTCAACC
TGGTGCGGCA TCATCGCCGA TGGCCATCAC GTAGATCCGA AGGCCTTGCG CACAGCGCTC
CGCGCCAAAC GCGGCGAAGG CAAGCTGTTC TTCGTCACCG ACGCGATGTC GCTCGTCGGG
TCGGAGAAGG ATTCGTTCAC GCTGAACGGG CGCACCGTCC GGCGTGAAAG GGGCGGCTTT
TGCTCGAAGC TGGTGCTGTC CGACGGCACG CTGGCCGGTT CCGATGTCGA CATGATCTCG
ACGATCCGTT ACGGCGTCAC CTATCTCGAC CTGACGCTCG CCGAGGCCTT GCGCATGGCG
ACCCTTTATC CCGCGCGGTT TCTCAGGCTT GCCGATCGCG GCCATCTCTC GCCGGGCGCG
CGTGCCGATC TCGTGCATCT CACCGATGCG CTTGCCGTCA CCGCCACCTG GCTCAGCGGC
GAAGCGGCCT AA
 
Protein sequence
MVRKIFLGAR IFDGEHFHDD KALIVAGGRV EAIVARNDLP DGEVVTLAGG VLSAGFIDAQ 
VNGGAGRMLN DEPSAASMDI IAGGHRPYGT TSLLPTLITD TSEASIAAIE AAKEAVKMNR
GVAGLHLEGP HLAPARKGAH LAELMRPVED RDVKAFIRAR EAIGTLLVTM AAEQVTVAQV
RELAEAGVTV SIGHSDCSSE AAEDRFDAGA RGVTHLFNAM SQLGHRAPGL VGAAIDHPST
WCGIIADGHH VDPKALRTAL RAKRGEGKLF FVTDAMSLVG SEKDSFTLNG RTVRRERGGF
CSKLVLSDGT LAGSDVDMIS TIRYGVTYLD LTLAEALRMA TLYPARFLRL ADRGHLSPGA
RADLVHLTDA LAVTATWLSG EAA