Gene Rleg2_4205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4205 
Symbol 
ID6982978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4379897 
End bp4382818 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content65% 
IMG OID643398936 
Producthypothetical protein 
Protein accessionYP_002283693 
Protein GI209551776 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases
[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG AGGCGCTTGT TGATCGCATG TCGCTGCCGC GTCCCAACGT GACCGTTGCC 
GACGCGGAAG AGATCCTTCT TGCCCATTAC AGCCTGTCAG GCACCATCGC CGAACTCGGC
AGTCAGCAGG ATCGGAACTA CCGCGTCGAT AGCGATCGCG GCCGCTACGT CCTGAAGATC
TGCCATGCTG CCTACGAGAC CCGCGAACTC GAAGCGCAGA ATGCCGCGAT CCATCACCTC
AAGAGCAAGC AGGATGCGCC ACGCGTTCCC AATGTGATCG CCAGCAATGA GGGGCGGGAG
ATCGTCGTGC TGACGGTGCG CGGGCAAGGC TACCAGGTCC GGCTGCTGGA ATATCTGGAA
GGTCAGGGGC TGACGGAACT GACCTACCTG GCGCCGGCTT CCGTTGCGGC GCTCGGCGCG
CTCTGCGCCA AGCTGGCGCA GGCGCTTGCC GATTTCAACC ATCCCGGCCT CGACCGCAGC
CTCCAATGGG ATCTGAGGCG GGCCGGGCCG GTCGCGGTGC AGCTGCTTTC CGCCATCACC
GACAGTGCCG CGCGCGACCG GATCGCCAAG ACCATGGTGA TGGCCGTTCG CCGCATCCAG
CCCTTGGCGC CGTCGCTTCG GCTGCAGGCG GTGCATCACG ACGTCACCGG CGACAATGTC
GTCGGTCATC GCAGTGCCCA TGGCCATATC ATCCCCGACG GAGTGATCGA TTTCGGCGAC
ATCATCCGCG GCTGGCTGGT CGGCGACCTC GCCGTCACCT GCGCCTCGCT GCTGCATCAC
GCAGAGGGCG ACCCCTTTCA TATCCTGCCG GCCGTCACTG CCTATCAGGC AATCTATCCG
CTGACCGAGG AGGAATTGAA GGCGCTGTGG CCATTGATCG TCGCACGCGC GGTCATTCTC
GTCGCCAGCA GCGAGCAGCA GATTTCGATC GATCCCGAGA ATGACTATGT CCGCGGCAAT
CTCGACCGCG AGCGGGCGAT CTTCGATACG GCGATGTCCG TTCCCTTCGA ATTGATGGAA
GCCGCAATCC TCCAGGCAGC CGGCGCGGAT GTCGCCACCC CGGAACCATC GGGATGGCTG
CCGCTGCTGC CTGACATCAA TCCCGCCGGG ATCGCCTATG TCGATCTTGG GGTGCGGAGC
CCGCATCTTT CCGCCGGCAA CTGGCTGAAT GCCGACATGG ACTGGCGGCT GCTAGCCCGC
GCTGCGACCG AAAACGGCAC GGCAGCGACG CGCTACGGCG AATATCGGCT TTCCCGGGCG
GGGACCGCCG GAAGACAGGC GACCTGCGCT CTCCATATCG ATATATGCCT TGCCGCCGGC
AGCGCGGTCG CAGCACCCTT TGCCGGCCGT ATCGGCTGGA AGGACCAGCA TCTGACACTG
ACCGGCGACG GCATGACCCT GCACCTCGAC GGGCTCGACC TCTCGGTCGA GGATGACGCC
GAGCTTGCCG GCGGCGACGC GCTCGGCACG GTTGTCGGCG AGGCTTCGTC GCTCGGCGGG
CTGCGCGCCC AGCTTTGCCG CGTCGCGGGG CTCGAGCCGC CGCTCTTTGC CACAGCGCGC
GAGGCGGGGG CCTGGTCGGC GCTCTGCCCT TCTCCCTCGC TGCTTCTCGG CCCTGGAGCC
TATGCGCCGA AACCTGAAAC CGCCGAGCTG CTCGCCAGGC GGCAGGCACA TCTGGCGAGG
GCGCAGAAGA ATTATTACGC GGCGCCGCCG CAGATCGAGC GCGGCTGGAA GGAGCATCTC
TTCGATGTCG AGGGCCGCGC CTATCTCGAC ATGGTCAACA ACGTCTCCAT TCTCGGCCAT
GGCCATCCCA GGCTTGCGGC GGCGATCAGC GCCCAGTGGC TGCGGCTCAA CACCAACTCA
CGCTTCCACT ATGCCGCGAT CACGGAATTT TCCGAGCGGC TCGCAGCACT TTCGCCCGAC
GGGCTCGATA CGGTCTTCCT GGTCAACAGC GGCTCGGAGG CGAACGATCT GGCGATCCGG
CTGGCGCAGG CTCATAGCGG CGCGCGCAAC ATGCTTTGCC TGCTCGAAGC CTATCATGGC
TGGTCGTCGG CAAGTGACGC CGTCTCCACC TCGATCGCCG ACAATCCGCA GGCGCTGACC
ACCCGGCCGG ACTGGGTGCA TGCCGTCGCG TCGCCGAATA CCTATCGCGG CGCATTCCGG
GGTCCCGATA CGGCGGCCGG CTATCTCAGC GCGATAACGC CGGTGCTGGA GGCGATCGAC
GCCGGCGGCG CGGGCCTTGC CGGCTTCATC TGCGAATCGG TCTACGGCAA TGCCGGCGGC
ATTCCGCTGC CGGATGGGTA TCTCGGCCAG GTCTATGCGC AGGTGCGCGC CCGCGGCGGC
CTCTGTATCG CCGATGAAGT GCAGGTCGGT TATGGCAGGC TCGGCCATTA TTTCTGGGGC
TTCGAACAGC AAGGGGTCGT GCCCGACATC ATCACCATCG CCAAGGGCAT GGGCAACGGC
CATCCGCTCG GCGCGGTCAT CACCACCCGG GAAATCGCCG GGTCGCTGGA GAAGGAGGGC
ACGTTCTTTT CCTCCACCGG CGGCAGCCCG GTCAGTTGCG TCGCCGGCAT GACGGTGCTC
GACATCATGG CCGAGGAAAA GCTGCAGGAA AATGCCCGAG AGGTCGGCGA TCATCTGAAG
GCGCGGCTTG CCGCGCTGAT CGACCGCCAT CCGATCGCCG GCGCCGTGCA CGGCATGGGG
CTCTATCTCG GGCTCGAATT CGTCCGCGAC AGAACGACGC TGGAGCCGGC GAGCGAAGAG
ACGGCAGCAA TCTGCGAGCG GTTGCTGACC CTCGGCGTCA TCATGCAGCC GACCGGCGAT
CACCAAAATG TGCTGAAGAT CAAACCGCCC CTCTGCCTCA GCATCGAAAG CGCGGATTTC
TTCGCGGACA TGCTGGAAAA GGTGCTCGAT GAAGGCTGGT GA
 
Protein sequence
MTDEALVDRM SLPRPNVTVA DAEEILLAHY SLSGTIAELG SQQDRNYRVD SDRGRYVLKI 
CHAAYETREL EAQNAAIHHL KSKQDAPRVP NVIASNEGRE IVVLTVRGQG YQVRLLEYLE
GQGLTELTYL APASVAALGA LCAKLAQALA DFNHPGLDRS LQWDLRRAGP VAVQLLSAIT
DSAARDRIAK TMVMAVRRIQ PLAPSLRLQA VHHDVTGDNV VGHRSAHGHI IPDGVIDFGD
IIRGWLVGDL AVTCASLLHH AEGDPFHILP AVTAYQAIYP LTEEELKALW PLIVARAVIL
VASSEQQISI DPENDYVRGN LDRERAIFDT AMSVPFELME AAILQAAGAD VATPEPSGWL
PLLPDINPAG IAYVDLGVRS PHLSAGNWLN ADMDWRLLAR AATENGTAAT RYGEYRLSRA
GTAGRQATCA LHIDICLAAG SAVAAPFAGR IGWKDQHLTL TGDGMTLHLD GLDLSVEDDA
ELAGGDALGT VVGEASSLGG LRAQLCRVAG LEPPLFATAR EAGAWSALCP SPSLLLGPGA
YAPKPETAEL LARRQAHLAR AQKNYYAAPP QIERGWKEHL FDVEGRAYLD MVNNVSILGH
GHPRLAAAIS AQWLRLNTNS RFHYAAITEF SERLAALSPD GLDTVFLVNS GSEANDLAIR
LAQAHSGARN MLCLLEAYHG WSSASDAVST SIADNPQALT TRPDWVHAVA SPNTYRGAFR
GPDTAAGYLS AITPVLEAID AGGAGLAGFI CESVYGNAGG IPLPDGYLGQ VYAQVRARGG
LCIADEVQVG YGRLGHYFWG FEQQGVVPDI ITIAKGMGNG HPLGAVITTR EIAGSLEKEG
TFFSSTGGSP VSCVAGMTVL DIMAEEKLQE NAREVGDHLK ARLAALIDRH PIAGAVHGMG
LYLGLEFVRD RTTLEPASEE TAAICERLLT LGVIMQPTGD HQNVLKIKPP LCLSIESADF
FADMLEKVLD EGW