Gene Rleg_3818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3818 
Symbol 
ID8014646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3892336 
End bp3893595 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content60% 
IMG OID644826387 
Productaspartate kinase 
Protein accessionYP_002977600 
Protein GI241206504 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCG GCGGCACGTC CGTCGCTGAC CTGGACCGCA TCAAGAACGT TGCCCGCCAT 
GTAAAACGCG AAGTCGATGC CGGGCACGAG GTGGCGGTGG TGGTGTCGGC TATGTCCGGC
AAGACCAATG AACTGGTCGG CTGGGTCCAG GGCACGCCGA AGGTGATCGG CGCCAATTCT
CCGTTCTACG ATGCGCGGGA GTATGATGCG GTGGTCGCAT CGGGTGAGCA GGTGACCTCC
GGTCTGCTGG CGATCGCATT GCAGGCGATG GATATCAATG CGCGTTCCTG GCAGGGATGG
CAGATCCCGA TCCGCACCGA CAACGCGCAC GGCGCGGCAC GGATCATGGA GATCGACGGT
TCCGACATCG TCAAACGGAT GGGCGAGGGG CAGGTCGCCG TGATCTCCGG CTTCCAGGGG
CTGGGACCCG ACAACCGTAT CGCGACGCTC GGGCGCGGCG GGTCGGACAC TTCGGCCGTC
GCAATCGCGG CAGCTGTCAA GGCGGATCGC TGCGATATCT ATACCGATGT CGACGGCGTC
TACACGACCG ACCCGCGCAT CGTGCCAAAG GCGCGCAGGC TGAAGAAGAT CGCCTTCGAG
GAAATGCTGG AAATGGCCTC GCTCGGCGCC AAGGTACTGC AGGTGCGCTC GGTCGAGCTT
GCCATGGTAC ACAAGGTGCG CACCTTCGTG CGCTCATCAT TCGAAGATCC CGATGCTCCG
GGCATGGGCG ATCTGTTGAA CCCGCCCGGA ACGCTGATTT GTGACGAGGA AGAAATCGTG
GAACAGGAAG TAGTCACCGG CATCGCCTAT GCCAAGGATG AGGCTCAGAT CTCGCTTCGC
CGTCTTGCCG ACCGGCCGGG CGTTTCCGCG GCGATCTTCG GGCCGCTCGC CGAATCCCAT
ATCAATGTCG ACATGATCGT CCAGAATATT TCCGAGGACG GGTCGAAGAC CGACATGACC
TTCACCGTGC CATCAGGCGA CGTCGAGAAG GCGATCAAGG TGCTCGGCGA CCATAAGGAG
AAGATCGGCT ACGATGTCGT GCAGAACGAA TCGGGGCTGG TGAAAGTATC GGTCATCGGC
ATCGGCATGC GCAGCCATGC GGGCGTTGCC GCTACCGCAT TTCGTGCACT TGCCGAAAAA
GGCATCAACA TCAAGGCGAT CACCACCTCC GAGATCAAGA TTTCCATCCT GATCGACGGT
CCCTATGCAG AACTCGCTGT CAGGACTTTG CATTCCTGCT ACGGTCTGGA TAAGAATTGA
 
Protein sequence
MKFGGTSVAD LDRIKNVARH VKREVDAGHE VAVVVSAMSG KTNELVGWVQ GTPKVIGANS 
PFYDAREYDA VVASGEQVTS GLLAIALQAM DINARSWQGW QIPIRTDNAH GAARIMEIDG
SDIVKRMGEG QVAVISGFQG LGPDNRIATL GRGGSDTSAV AIAAAVKADR CDIYTDVDGV
YTTDPRIVPK ARRLKKIAFE EMLEMASLGA KVLQVRSVEL AMVHKVRTFV RSSFEDPDAP
GMGDLLNPPG TLICDEEEIV EQEVVTGIAY AKDEAQISLR RLADRPGVSA AIFGPLAESH
INVDMIVQNI SEDGSKTDMT FTVPSGDVEK AIKVLGDHKE KIGYDVVQNE SGLVKVSVIG
IGMRSHAGVA ATAFRALAEK GINIKAITTS EIKISILIDG PYAELAVRTL HSCYGLDKN