Gene Rleg_3860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3860 
Symbol 
ID8014684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3930727 
End bp3931995 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content63% 
IMG OID644826430 
Productdiaminopimelate decarboxylase 
Protein accessionYP_002977642 
Protein GI241206546 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0818449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCATT TCGAGTACCG CGACGGCGTC CTTCATGCGG AGAACGTCCC CGTTCCCGAG 
ATCGCCAAGG CGGTCGGCAC CCCTTTCTAC GTCTACTCCA CCGCGACGCT GGAGCGCCAT
TACCGCGTCT TTTCGGAAGC CTTCGCCGAC ATGGACTCCA TGGTCTGCTA TGCGATGAAG
GCGAATTCGA ACCAGGCGGT GCTGAAGACG CTGGGCCGCC TCGGCGCCGG CATCGATGTC
GTCTCTGAGG GCGAACTGCG CCGCGCGCTT GCCGCCGACA TTCCAGCGAG CCGCATCATG
TTCTCAGGTG TCGGCAAGAC GCCGTCCGAG ATGGATTTCG CCCTCGAAGC CGGTATCTAC
TGTTTCAACG TCGAATCCGA GCCCGAGCTC GAGATCCTCA ATCAGCGCGC CGTCAGCGCG
GGCAAGAAAG CGCCGGTCTC CTTCCGCATC AACCCTGATG TCGATGCGAA GACGCATTCC
AAGATCTCGA CCGGCAAGAA GGAAAACAAG TTCGGCATCT CCTGGGAGCG CGCCCGCGCC
ATCTATGCCC ATGCCGCCAA GCTGCCGGGC ATCGAGGTCA CCGGCATCGA CATGCATATC
GGCAGCCAGA TCACCGAATT GCAGCCCTTC GACGACGCCT TCAAGCTGCT GCACGACCTT
GTCGCGACGC TGCGCGCCGA CGGCCACACC ATCCATCACG TCGATATCGG CGGCGGCCTC
GGTATCCCCT ACAAGGACGA CAACAATCCG CCGCCGCTGC CGGACGCCTA TGCGGCAATC
GTCAAGAACC AGCTGCGCGG TCTAAACTGC AAGATCATCA CCGAACCCGG ACGCCTGATC
GTCGGCAATG CCGGCATCCT CGTGACCGAG GTCCTCTATG TGAAGGATGG CGGCGAAAAG
ACCTTCGTCA TCGTCGACGG CGCGATGAAC GATCTCATCC GCCCGACGCT TTACGAGGCC
TATCACGAGA TTCGGCCGGT AACGATTTCG GCGGCCAACG CGCCGCGCAT CCGCGCCGAT
GTCGTCGGCC CCGTTTGCGA GACCGGCGAC TATCTGGCGC TCGACCGCGA GATGGCGATG
CCGAAGCCCG GCGACCTGAT GGCCGTCAGC ACCGCCGGCG CCTACGGCGC AGTCCAGGCC
GGCACCTATA ACAGCCGGCT GCTGGTGCCC GAAGTTCTGG TCAGGGGCAG CGATTTCCAC
ACGATTCGAC CGCGTAGAAC CTATGCCGAA CTGATCAGCC TCGACTCCGT TCCGGCCTGG
CTCGACTGA
 
Protein sequence
MNHFEYRDGV LHAENVPVPE IAKAVGTPFY VYSTATLERH YRVFSEAFAD MDSMVCYAMK 
ANSNQAVLKT LGRLGAGIDV VSEGELRRAL AADIPASRIM FSGVGKTPSE MDFALEAGIY
CFNVESEPEL EILNQRAVSA GKKAPVSFRI NPDVDAKTHS KISTGKKENK FGISWERARA
IYAHAAKLPG IEVTGIDMHI GSQITELQPF DDAFKLLHDL VATLRADGHT IHHVDIGGGL
GIPYKDDNNP PPLPDAYAAI VKNQLRGLNC KIITEPGRLI VGNAGILVTE VLYVKDGGEK
TFVIVDGAMN DLIRPTLYEA YHEIRPVTIS AANAPRIRAD VVGPVCETGD YLALDREMAM
PKPGDLMAVS TAGAYGAVQA GTYNSRLLVP EVLVRGSDFH TIRPRRTYAE LISLDSVPAW
LD