Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4205 |
Symbol | |
ID | 6982978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4379897 |
End bp | 4382818 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643398936 |
Product | hypothetical protein |
Protein accession | YP_002283693 |
Protein GI | 209551776 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACG AGGCGCTTGT TGATCGCATG TCGCTGCCGC GTCCCAACGT GACCGTTGCC GACGCGGAAG AGATCCTTCT TGCCCATTAC AGCCTGTCAG GCACCATCGC CGAACTCGGC AGTCAGCAGG ATCGGAACTA CCGCGTCGAT AGCGATCGCG GCCGCTACGT CCTGAAGATC TGCCATGCTG CCTACGAGAC CCGCGAACTC GAAGCGCAGA ATGCCGCGAT CCATCACCTC AAGAGCAAGC AGGATGCGCC ACGCGTTCCC AATGTGATCG CCAGCAATGA GGGGCGGGAG ATCGTCGTGC TGACGGTGCG CGGGCAAGGC TACCAGGTCC GGCTGCTGGA ATATCTGGAA GGTCAGGGGC TGACGGAACT GACCTACCTG GCGCCGGCTT CCGTTGCGGC GCTCGGCGCG CTCTGCGCCA AGCTGGCGCA GGCGCTTGCC GATTTCAACC ATCCCGGCCT CGACCGCAGC CTCCAATGGG ATCTGAGGCG GGCCGGGCCG GTCGCGGTGC AGCTGCTTTC CGCCATCACC GACAGTGCCG CGCGCGACCG GATCGCCAAG ACCATGGTGA TGGCCGTTCG CCGCATCCAG CCCTTGGCGC CGTCGCTTCG GCTGCAGGCG GTGCATCACG ACGTCACCGG CGACAATGTC GTCGGTCATC GCAGTGCCCA TGGCCATATC ATCCCCGACG GAGTGATCGA TTTCGGCGAC ATCATCCGCG GCTGGCTGGT CGGCGACCTC GCCGTCACCT GCGCCTCGCT GCTGCATCAC GCAGAGGGCG ACCCCTTTCA TATCCTGCCG GCCGTCACTG CCTATCAGGC AATCTATCCG CTGACCGAGG AGGAATTGAA GGCGCTGTGG CCATTGATCG TCGCACGCGC GGTCATTCTC GTCGCCAGCA GCGAGCAGCA GATTTCGATC GATCCCGAGA ATGACTATGT CCGCGGCAAT CTCGACCGCG AGCGGGCGAT CTTCGATACG GCGATGTCCG TTCCCTTCGA ATTGATGGAA GCCGCAATCC TCCAGGCAGC CGGCGCGGAT GTCGCCACCC CGGAACCATC GGGATGGCTG CCGCTGCTGC CTGACATCAA TCCCGCCGGG ATCGCCTATG TCGATCTTGG GGTGCGGAGC CCGCATCTTT CCGCCGGCAA CTGGCTGAAT GCCGACATGG ACTGGCGGCT GCTAGCCCGC GCTGCGACCG AAAACGGCAC GGCAGCGACG CGCTACGGCG AATATCGGCT TTCCCGGGCG GGGACCGCCG GAAGACAGGC GACCTGCGCT CTCCATATCG ATATATGCCT TGCCGCCGGC AGCGCGGTCG CAGCACCCTT TGCCGGCCGT ATCGGCTGGA AGGACCAGCA TCTGACACTG ACCGGCGACG GCATGACCCT GCACCTCGAC GGGCTCGACC TCTCGGTCGA GGATGACGCC GAGCTTGCCG GCGGCGACGC GCTCGGCACG GTTGTCGGCG AGGCTTCGTC GCTCGGCGGG CTGCGCGCCC AGCTTTGCCG CGTCGCGGGG CTCGAGCCGC CGCTCTTTGC CACAGCGCGC GAGGCGGGGG CCTGGTCGGC GCTCTGCCCT TCTCCCTCGC TGCTTCTCGG CCCTGGAGCC TATGCGCCGA AACCTGAAAC CGCCGAGCTG CTCGCCAGGC GGCAGGCACA TCTGGCGAGG GCGCAGAAGA ATTATTACGC GGCGCCGCCG CAGATCGAGC GCGGCTGGAA GGAGCATCTC TTCGATGTCG AGGGCCGCGC CTATCTCGAC ATGGTCAACA ACGTCTCCAT TCTCGGCCAT GGCCATCCCA GGCTTGCGGC GGCGATCAGC GCCCAGTGGC TGCGGCTCAA CACCAACTCA CGCTTCCACT ATGCCGCGAT CACGGAATTT TCCGAGCGGC TCGCAGCACT TTCGCCCGAC GGGCTCGATA CGGTCTTCCT GGTCAACAGC GGCTCGGAGG CGAACGATCT GGCGATCCGG CTGGCGCAGG CTCATAGCGG CGCGCGCAAC ATGCTTTGCC TGCTCGAAGC CTATCATGGC TGGTCGTCGG CAAGTGACGC CGTCTCCACC TCGATCGCCG ACAATCCGCA GGCGCTGACC ACCCGGCCGG ACTGGGTGCA TGCCGTCGCG TCGCCGAATA CCTATCGCGG CGCATTCCGG GGTCCCGATA CGGCGGCCGG CTATCTCAGC GCGATAACGC CGGTGCTGGA GGCGATCGAC GCCGGCGGCG CGGGCCTTGC CGGCTTCATC TGCGAATCGG TCTACGGCAA TGCCGGCGGC ATTCCGCTGC CGGATGGGTA TCTCGGCCAG GTCTATGCGC AGGTGCGCGC CCGCGGCGGC CTCTGTATCG CCGATGAAGT GCAGGTCGGT TATGGCAGGC TCGGCCATTA TTTCTGGGGC TTCGAACAGC AAGGGGTCGT GCCCGACATC ATCACCATCG CCAAGGGCAT GGGCAACGGC CATCCGCTCG GCGCGGTCAT CACCACCCGG GAAATCGCCG GGTCGCTGGA GAAGGAGGGC ACGTTCTTTT CCTCCACCGG CGGCAGCCCG GTCAGTTGCG TCGCCGGCAT GACGGTGCTC GACATCATGG CCGAGGAAAA GCTGCAGGAA AATGCCCGAG AGGTCGGCGA TCATCTGAAG GCGCGGCTTG CCGCGCTGAT CGACCGCCAT CCGATCGCCG GCGCCGTGCA CGGCATGGGG CTCTATCTCG GGCTCGAATT CGTCCGCGAC AGAACGACGC TGGAGCCGGC GAGCGAAGAG ACGGCAGCAA TCTGCGAGCG GTTGCTGACC CTCGGCGTCA TCATGCAGCC GACCGGCGAT CACCAAAATG TGCTGAAGAT CAAACCGCCC CTCTGCCTCA GCATCGAAAG CGCGGATTTC TTCGCGGACA TGCTGGAAAA GGTGCTCGAT GAAGGCTGGT GA
|
Protein sequence | MTDEALVDRM SLPRPNVTVA DAEEILLAHY SLSGTIAELG SQQDRNYRVD SDRGRYVLKI CHAAYETREL EAQNAAIHHL KSKQDAPRVP NVIASNEGRE IVVLTVRGQG YQVRLLEYLE GQGLTELTYL APASVAALGA LCAKLAQALA DFNHPGLDRS LQWDLRRAGP VAVQLLSAIT DSAARDRIAK TMVMAVRRIQ PLAPSLRLQA VHHDVTGDNV VGHRSAHGHI IPDGVIDFGD IIRGWLVGDL AVTCASLLHH AEGDPFHILP AVTAYQAIYP LTEEELKALW PLIVARAVIL VASSEQQISI DPENDYVRGN LDRERAIFDT AMSVPFELME AAILQAAGAD VATPEPSGWL PLLPDINPAG IAYVDLGVRS PHLSAGNWLN ADMDWRLLAR AATENGTAAT RYGEYRLSRA GTAGRQATCA LHIDICLAAG SAVAAPFAGR IGWKDQHLTL TGDGMTLHLD GLDLSVEDDA ELAGGDALGT VVGEASSLGG LRAQLCRVAG LEPPLFATAR EAGAWSALCP SPSLLLGPGA YAPKPETAEL LARRQAHLAR AQKNYYAAPP QIERGWKEHL FDVEGRAYLD MVNNVSILGH GHPRLAAAIS AQWLRLNTNS RFHYAAITEF SERLAALSPD GLDTVFLVNS GSEANDLAIR LAQAHSGARN MLCLLEAYHG WSSASDAVST SIADNPQALT TRPDWVHAVA SPNTYRGAFR GPDTAAGYLS AITPVLEAID AGGAGLAGFI CESVYGNAGG IPLPDGYLGQ VYAQVRARGG LCIADEVQVG YGRLGHYFWG FEQQGVVPDI ITIAKGMGNG HPLGAVITTR EIAGSLEKEG TFFSSTGGSP VSCVAGMTVL DIMAEEKLQE NAREVGDHLK ARLAALIDRH PIAGAVHGMG LYLGLEFVRD RTTLEPASEE TAAICERLLT LGVIMQPTGD HQNVLKIKPP LCLSIESADF FADMLEKVLD EGW
|
| |