Gene Rleg_4495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4495 
Symbol 
ID8015257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4626831 
End bp4629752 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content64% 
IMG OID644827071 
Producthypothetical protein 
Protein accessionYP_002978272 
Protein GI241207176 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG AGGCGCTTGT TGATCGCATG GCGCTGCCGC GCCCCGATGT GACCGCCACT 
GATGCGGAAG AGATCCTCCT TGCTCATTAC AGCCTATCCG GCACGCTTGC CGAACTCGGC
AGCCAGCAGG ATCGAAACTA TCGCGTCGAT AGCGAGAGGG GTCGCTACGT CCTGAAGATC
TGCCACGCCG CCTACGATAT CCGTGAGCTC GAAGCACAGA ATGCGGCGAT CCGTCACCTC
AAAAGCCGGC AAGATGCGCC GCGCGTTCCG AAGGTGATCC CCACCAATGA GGGGCGGGAG
ATCGTCGTCC TTACCGTGCG CGGGCAGGGC TATCAGGTCC GGCTGCTGGA ATATCTGGAA
GGCGAGGGGC TGACGGAGCT GACCTATCTA GCGCCGGCTT CCGTGGCGGC GCTGGGCGCG
CTCTGCGCCA GGCTGGCGCA GGCGCTTGCC GATTTCAACC ATCCCGGTCT CGACCGCAGC
CTGCAATGGG ATCTCAGACG GGCAGGTCCC GTTGCCGTGC AGCTGCTTTC GGCCATCACC
GACAGTGCGG CGCGCGACCG GATCGCCAAG ACCATGGTGA TGGCCGTCCG CCGCATCCAG
CCGTTGGCGC CGGCGCTTCG GCTACAGGCC GTGCATCACG ACGTCACCGG CGACAATGTC
GTCGGCCACC GTGACGCCCA TGGCCATATC ATCCCCGACG GGGTGATCGA TTTCGGCGAC
ATCATCCGCG GCTGGCTGGT CGGCGATCTC GCCGTCACCT GCGCTTCGCT GCTGCATCAG
GCCGATGGCG ACCCCTTTCA TATTCTGCCC GCCGTCACCG CCTATCAGGC GATCTATCCG
CTGAGCGAGG AGGAACTGAA GGCGCTCTGG CCGCTGATCG TCGCACGCGC GGTCATTCTC
GTCGCCAGCG GCGAGCAGCA GATTTCGGTC GATCCCGACA ATGACTATGT CCGCGACAAT
CTCGACCGCG AGCGGGCGAT CTTCGATACG GCGATGTCGG TTCCCTTCGA TCTCATGGAA
GCCGCGATCC TCAAGGCAGC CGGCGTAGAT GTCACCGCGC CGGAAACATC GGGATGGCTG
CCGTTGCTGC CCGACATCGA TCCCGCCGGG ATCGCCTATG TCGATCTTGG GGTACGGAGC
CCGCATTTTT CCGCCGGCAA CTGGCTGAAT ACGGACATGG ACTGGCGGCT GCTTGCCCGC
ATGGCAACCG AAAACGGTAC GGCAGCGACG CGCTACGGCG AATATCGGCT TTCCCGCGCG
GGAACCGCGA AGGGACAGGC GACCTGCGCT CTGCATGTCG ACATATGCCT TGCCGCAGGC
AGCGCGATTG CCGCGCCCTT TGCCGGCCGC ATCGGCTGGA AGGACCAGCA TCTGACGCTC
GCCAGCGATA CGATGACCCT GCATCTCGAC GGGCTCGACC TCTCCGTCGA GGATGGAGCC
GAGATCGCCG CCGGCGATTC GCTCGGCACT GTTTTCGGAG AGGCGTCGTC GCTCGGCGGG
CTGCGCGTCC AGCTTTGCAG CGTCGCCGGT CTCGAACCGC CGCTTTTTGC GACGCCGCGC
ACGGCGGCGG CCTGGTCGGT GCTCTGCCCT TCGCCTTCGC TGCTTCTCAG CCCGCAAGCG
GATGCGCCGC AACCTGAAAC CGCCAAACTC TTCGCCAGGC GGCGGGCGCA TCTCGCAAGG
CCGCAGAAGA ATTATTATGC GGCGCCGCCG CAGATCGAGC GCGGCTGGAA GGAGCATTTG
TTCGATGTCG AGGGCCGCGC CTATCTCGAC ATGGTCAACA ACGTCACCAT TCTCGGCCAT
GGCCATCCCA AGCTCGCGGC GGCGATCAGC GCGCAATGGC TGCGGCTCAA CACGAATTCG
CGCTTTCACT ACGCCGCGAT TACAGAATTT TCCGAACGCC TCGCCGCCCT GTCGCCGGAT
GGGCTCGATG CGGTCTTCCT GGTCAACAGC GGCTCGGAGG CGAACGATCT GGCGCTTCGG
CTGGCGCAAG CCCATAGCGG CGCGCGCAAC ATGCTCTGCC TGCTCGAAGC CTATCATGGC
TGGTCAGCGG CGAGCGACGC CGTCTCCACC TCGATCGCCG ACAATCCGCA GGCACCGACC
ACCCGGCCGG ACTGGGTGCA TACCATCGTT TCGCCGAACA CCTATCGCGG CGACTTCCGT
GGTCCCGATA CGGCAGCGGA TTATCTCGGC ATGGCGACGC CGGTGCTGGA GGCAATAGAT
GCTGCGGGCG AAGGCCTTGC GGGCTTCATC GCCGAATCGG TCTACGGCAA TGCCGGCGGC
ATTCCATTGC CGGAAGGCTA TCTCAAGGAG CTCTATGCGC AGGTACGCGC TCGCGGCGGC
GTCTGCATCG CCGACGAAGT GCAGGTCGGC TATGCCAGGC TCGGGCACTA TTTCTGGGGC
TTCGAGCAGC AGGGCGTCGT GCCTGATATC ATCACCGTCG CCAAGGGCAT GGGCAACGGC
CATCCGCTCG GCGCCGTCAT CACCACGCGG GAGATTGCGC AATCGCTGGA GAAGGAGGGC
ACTTTCTTTT CCTCCACCGG CGGCAGCCCC GTCAGTTGCA TCGCCGGCAT GACGGTGCTC
GACATCATGG CCGAGGAAAA GTTGCAGGAA AATGCCCGGG CGGTCGGCGA TCATCTGAAG
GCGCGGCTCG CCGCGCTGAT CGACCGCCAT CCGATTGCGG GCGCCGTGCA CGGCATGGGC
CTCTATCTCG GCCTCGAATT CGTCCGCGAC AGAACGACGC TGGAGCCGGC GACGGAAGAG
ACGGCTGCAA TCTGCGACCG GCTTCTCGAC CTCGGCGTCA TCATGCAGCC GACCGGCGAT
CACCAAAATG TGCTGAAGAT CAAACCGCCG CTCTGCCTCA GCATCGACAG CGCGGATTTT
TTCGCGGACA CGCTGGAGAA GGTGCTCGAA GAAGGCTGGT GA
 
Protein sequence
MTDEALVDRM ALPRPDVTAT DAEEILLAHY SLSGTLAELG SQQDRNYRVD SERGRYVLKI 
CHAAYDIREL EAQNAAIRHL KSRQDAPRVP KVIPTNEGRE IVVLTVRGQG YQVRLLEYLE
GEGLTELTYL APASVAALGA LCARLAQALA DFNHPGLDRS LQWDLRRAGP VAVQLLSAIT
DSAARDRIAK TMVMAVRRIQ PLAPALRLQA VHHDVTGDNV VGHRDAHGHI IPDGVIDFGD
IIRGWLVGDL AVTCASLLHQ ADGDPFHILP AVTAYQAIYP LSEEELKALW PLIVARAVIL
VASGEQQISV DPDNDYVRDN LDRERAIFDT AMSVPFDLME AAILKAAGVD VTAPETSGWL
PLLPDIDPAG IAYVDLGVRS PHFSAGNWLN TDMDWRLLAR MATENGTAAT RYGEYRLSRA
GTAKGQATCA LHVDICLAAG SAIAAPFAGR IGWKDQHLTL ASDTMTLHLD GLDLSVEDGA
EIAAGDSLGT VFGEASSLGG LRVQLCSVAG LEPPLFATPR TAAAWSVLCP SPSLLLSPQA
DAPQPETAKL FARRRAHLAR PQKNYYAAPP QIERGWKEHL FDVEGRAYLD MVNNVTILGH
GHPKLAAAIS AQWLRLNTNS RFHYAAITEF SERLAALSPD GLDAVFLVNS GSEANDLALR
LAQAHSGARN MLCLLEAYHG WSAASDAVST SIADNPQAPT TRPDWVHTIV SPNTYRGDFR
GPDTAADYLG MATPVLEAID AAGEGLAGFI AESVYGNAGG IPLPEGYLKE LYAQVRARGG
VCIADEVQVG YARLGHYFWG FEQQGVVPDI ITVAKGMGNG HPLGAVITTR EIAQSLEKEG
TFFSSTGGSP VSCIAGMTVL DIMAEEKLQE NARAVGDHLK ARLAALIDRH PIAGAVHGMG
LYLGLEFVRD RTTLEPATEE TAAICDRLLD LGVIMQPTGD HQNVLKIKPP LCLSIDSADF
FADTLEKVLE EGW