Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4495 |
Symbol | |
ID | 8015257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4626831 |
End bp | 4629752 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644827071 |
Product | hypothetical protein |
Protein accession | YP_002978272 |
Protein GI | 241207176 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACG AGGCGCTTGT TGATCGCATG GCGCTGCCGC GCCCCGATGT GACCGCCACT GATGCGGAAG AGATCCTCCT TGCTCATTAC AGCCTATCCG GCACGCTTGC CGAACTCGGC AGCCAGCAGG ATCGAAACTA TCGCGTCGAT AGCGAGAGGG GTCGCTACGT CCTGAAGATC TGCCACGCCG CCTACGATAT CCGTGAGCTC GAAGCACAGA ATGCGGCGAT CCGTCACCTC AAAAGCCGGC AAGATGCGCC GCGCGTTCCG AAGGTGATCC CCACCAATGA GGGGCGGGAG ATCGTCGTCC TTACCGTGCG CGGGCAGGGC TATCAGGTCC GGCTGCTGGA ATATCTGGAA GGCGAGGGGC TGACGGAGCT GACCTATCTA GCGCCGGCTT CCGTGGCGGC GCTGGGCGCG CTCTGCGCCA GGCTGGCGCA GGCGCTTGCC GATTTCAACC ATCCCGGTCT CGACCGCAGC CTGCAATGGG ATCTCAGACG GGCAGGTCCC GTTGCCGTGC AGCTGCTTTC GGCCATCACC GACAGTGCGG CGCGCGACCG GATCGCCAAG ACCATGGTGA TGGCCGTCCG CCGCATCCAG CCGTTGGCGC CGGCGCTTCG GCTACAGGCC GTGCATCACG ACGTCACCGG CGACAATGTC GTCGGCCACC GTGACGCCCA TGGCCATATC ATCCCCGACG GGGTGATCGA TTTCGGCGAC ATCATCCGCG GCTGGCTGGT CGGCGATCTC GCCGTCACCT GCGCTTCGCT GCTGCATCAG GCCGATGGCG ACCCCTTTCA TATTCTGCCC GCCGTCACCG CCTATCAGGC GATCTATCCG CTGAGCGAGG AGGAACTGAA GGCGCTCTGG CCGCTGATCG TCGCACGCGC GGTCATTCTC GTCGCCAGCG GCGAGCAGCA GATTTCGGTC GATCCCGACA ATGACTATGT CCGCGACAAT CTCGACCGCG AGCGGGCGAT CTTCGATACG GCGATGTCGG TTCCCTTCGA TCTCATGGAA GCCGCGATCC TCAAGGCAGC CGGCGTAGAT GTCACCGCGC CGGAAACATC GGGATGGCTG CCGTTGCTGC CCGACATCGA TCCCGCCGGG ATCGCCTATG TCGATCTTGG GGTACGGAGC CCGCATTTTT CCGCCGGCAA CTGGCTGAAT ACGGACATGG ACTGGCGGCT GCTTGCCCGC ATGGCAACCG AAAACGGTAC GGCAGCGACG CGCTACGGCG AATATCGGCT TTCCCGCGCG GGAACCGCGA AGGGACAGGC GACCTGCGCT CTGCATGTCG ACATATGCCT TGCCGCAGGC AGCGCGATTG CCGCGCCCTT TGCCGGCCGC ATCGGCTGGA AGGACCAGCA TCTGACGCTC GCCAGCGATA CGATGACCCT GCATCTCGAC GGGCTCGACC TCTCCGTCGA GGATGGAGCC GAGATCGCCG CCGGCGATTC GCTCGGCACT GTTTTCGGAG AGGCGTCGTC GCTCGGCGGG CTGCGCGTCC AGCTTTGCAG CGTCGCCGGT CTCGAACCGC CGCTTTTTGC GACGCCGCGC ACGGCGGCGG CCTGGTCGGT GCTCTGCCCT TCGCCTTCGC TGCTTCTCAG CCCGCAAGCG GATGCGCCGC AACCTGAAAC CGCCAAACTC TTCGCCAGGC GGCGGGCGCA TCTCGCAAGG CCGCAGAAGA ATTATTATGC GGCGCCGCCG CAGATCGAGC GCGGCTGGAA GGAGCATTTG TTCGATGTCG AGGGCCGCGC CTATCTCGAC ATGGTCAACA ACGTCACCAT TCTCGGCCAT GGCCATCCCA AGCTCGCGGC GGCGATCAGC GCGCAATGGC TGCGGCTCAA CACGAATTCG CGCTTTCACT ACGCCGCGAT TACAGAATTT TCCGAACGCC TCGCCGCCCT GTCGCCGGAT GGGCTCGATG CGGTCTTCCT GGTCAACAGC GGCTCGGAGG CGAACGATCT GGCGCTTCGG CTGGCGCAAG CCCATAGCGG CGCGCGCAAC ATGCTCTGCC TGCTCGAAGC CTATCATGGC TGGTCAGCGG CGAGCGACGC CGTCTCCACC TCGATCGCCG ACAATCCGCA GGCACCGACC ACCCGGCCGG ACTGGGTGCA TACCATCGTT TCGCCGAACA CCTATCGCGG CGACTTCCGT GGTCCCGATA CGGCAGCGGA TTATCTCGGC ATGGCGACGC CGGTGCTGGA GGCAATAGAT GCTGCGGGCG AAGGCCTTGC GGGCTTCATC GCCGAATCGG TCTACGGCAA TGCCGGCGGC ATTCCATTGC CGGAAGGCTA TCTCAAGGAG CTCTATGCGC AGGTACGCGC TCGCGGCGGC GTCTGCATCG CCGACGAAGT GCAGGTCGGC TATGCCAGGC TCGGGCACTA TTTCTGGGGC TTCGAGCAGC AGGGCGTCGT GCCTGATATC ATCACCGTCG CCAAGGGCAT GGGCAACGGC CATCCGCTCG GCGCCGTCAT CACCACGCGG GAGATTGCGC AATCGCTGGA GAAGGAGGGC ACTTTCTTTT CCTCCACCGG CGGCAGCCCC GTCAGTTGCA TCGCCGGCAT GACGGTGCTC GACATCATGG CCGAGGAAAA GTTGCAGGAA AATGCCCGGG CGGTCGGCGA TCATCTGAAG GCGCGGCTCG CCGCGCTGAT CGACCGCCAT CCGATTGCGG GCGCCGTGCA CGGCATGGGC CTCTATCTCG GCCTCGAATT CGTCCGCGAC AGAACGACGC TGGAGCCGGC GACGGAAGAG ACGGCTGCAA TCTGCGACCG GCTTCTCGAC CTCGGCGTCA TCATGCAGCC GACCGGCGAT CACCAAAATG TGCTGAAGAT CAAACCGCCG CTCTGCCTCA GCATCGACAG CGCGGATTTT TTCGCGGACA CGCTGGAGAA GGTGCTCGAA GAAGGCTGGT GA
|
Protein sequence | MTDEALVDRM ALPRPDVTAT DAEEILLAHY SLSGTLAELG SQQDRNYRVD SERGRYVLKI CHAAYDIREL EAQNAAIRHL KSRQDAPRVP KVIPTNEGRE IVVLTVRGQG YQVRLLEYLE GEGLTELTYL APASVAALGA LCARLAQALA DFNHPGLDRS LQWDLRRAGP VAVQLLSAIT DSAARDRIAK TMVMAVRRIQ PLAPALRLQA VHHDVTGDNV VGHRDAHGHI IPDGVIDFGD IIRGWLVGDL AVTCASLLHQ ADGDPFHILP AVTAYQAIYP LSEEELKALW PLIVARAVIL VASGEQQISV DPDNDYVRDN LDRERAIFDT AMSVPFDLME AAILKAAGVD VTAPETSGWL PLLPDIDPAG IAYVDLGVRS PHFSAGNWLN TDMDWRLLAR MATENGTAAT RYGEYRLSRA GTAKGQATCA LHVDICLAAG SAIAAPFAGR IGWKDQHLTL ASDTMTLHLD GLDLSVEDGA EIAAGDSLGT VFGEASSLGG LRVQLCSVAG LEPPLFATPR TAAAWSVLCP SPSLLLSPQA DAPQPETAKL FARRRAHLAR PQKNYYAAPP QIERGWKEHL FDVEGRAYLD MVNNVTILGH GHPKLAAAIS AQWLRLNTNS RFHYAAITEF SERLAALSPD GLDAVFLVNS GSEANDLALR LAQAHSGARN MLCLLEAYHG WSAASDAVST SIADNPQAPT TRPDWVHTIV SPNTYRGDFR GPDTAADYLG MATPVLEAID AAGEGLAGFI AESVYGNAGG IPLPEGYLKE LYAQVRARGG VCIADEVQVG YARLGHYFWG FEQQGVVPDI ITVAKGMGNG HPLGAVITTR EIAQSLEKEG TFFSSTGGSP VSCIAGMTVL DIMAEEKLQE NARAVGDHLK ARLAALIDRH PIAGAVHGMG LYLGLEFVRD RTTLEPATEE TAAICDRLLD LGVIMQPTGD HQNVLKIKPP LCLSIDSADF FADTLEKVLE EGW
|
| |