Gene Rleg2_5443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5443 
Symbol 
ID6978537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1086356 
End bp1087309 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content66% 
IMG OID643394544 
Productdihydrodipicolinate synthetase 
Protein accessionYP_002279362 
Protein GI209547444 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.344282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00661581 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATCG CGATCATGCG CAAGGCGCTC ACGGGCGTTT CGGGTGTTCC CGTCACGGCC 
TATGACGGCA AAGGCGAAGT CGAACCGCGG ATTACGGCCA AGGTCTATGC GCGGGTGGCG
GCCGCCCGCA TTCACAACAT TGTCGCTGCC GGCAATACCG GGGAATTCTA CGCGCTGACG
CCGCAGGAAA TCCGGATCGT CCACGAAGCC GCGGTATCAG GCGTCGACGG CCGCGCGCCG
GTGACGGCGG CGATCGGCCG GTCGCTACGC GAGGCGATTG GCATGGCGCG GGATGCGGCC
GCGATCGGCG CAACCGCCGT CATGTCGCAT CAGCCCGTCG ATCCTTTCGC AGCACCTTCG
GCCCAGATCG GCTATTTCTG CAATCTTGCC GATGCGTCCA CCCTGCCGCT CGTCGCCTAT
GTCAGGGCCG AGGGTTTCGG TGTCGACGAT ATCGTCCGCC TCGCCAACCA CGGCAACATC
GCCGGCATCA AGTTTGCCAC GACTGATCTG ATGCTCTTGT CGCGCGCGAT CCCGGCGGCC
GATCCCGATG GTGCGCTGTT CGTCTGCGGC CTGGCGGAGA GCTGGGCGCC GACATTCACC
GCAGCCGGGG CGCGCGGCTT CACGTCGGGC CTCGTCAACG TTGCGCCGCA GCTTTCGCTT
GCCGTCCACG CCGCGCTCGA AAAAGGCGAC TTTGCCGCTG CACGGGCGAT CGTCAACACG
CTCGAGCCGT TCGAGCGGAT GCGAACCAAA TTCCGCAACG GCGCCAACGT GACGGTCGTG
AAAGAGGCCG TCACCTATTC CGGCCTCGAT GTCGGCCCCG TGCGCGTGCC GGGGTTGCCG
CTGCTCGACC AGCATGATCG CGAGGAACTT CATCGGCTGC TTCGAGGCTG GGAGGCCGAG
GGCAGCATTC AAACTGATCC GGACCGGCAG CAGTCCGCCA AGGCGACCGG CTGA
 
Protein sequence
MSIAIMRKAL TGVSGVPVTA YDGKGEVEPR ITAKVYARVA AARIHNIVAA GNTGEFYALT 
PQEIRIVHEA AVSGVDGRAP VTAAIGRSLR EAIGMARDAA AIGATAVMSH QPVDPFAAPS
AQIGYFCNLA DASTLPLVAY VRAEGFGVDD IVRLANHGNI AGIKFATTDL MLLSRAIPAA
DPDGALFVCG LAESWAPTFT AAGARGFTSG LVNVAPQLSL AVHAALEKGD FAAARAIVNT
LEPFERMRTK FRNGANVTVV KEAVTYSGLD VGPVRVPGLP LLDQHDREEL HRLLRGWEAE
GSIQTDPDRQ QSAKATG