Gene Rleg_6099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6099 
Symbol 
ID8016056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp136401 
End bp137561 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content63% 
IMG OID644827405 
Productfumarylacetoacetate (FAA) hydrolase 
Protein accessionYP_002978605 
Protein GI241258721 
COG category[R] General function prediction only 
COG ID[COG3970] Fumarylacetoacetate (FAA) hydrolase family protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.59157 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAAC CTCTGCTCGA TGTCGCCGCT TCGGATGGTC TTTTTGTCGG TCGCATCTGG 
AATCCCGAAG TGCAGGGACC GAGCATTGTG ACGCTGCGCG AGGGTATACT GGTCGACATC
ACGTCGCGCG AGGCGCCGAC GCTGAGCGCC CTGCTCGAGC GGCAGGATGC CGCCACCTTC
GTCCGTGCAG CAAGTGGCAA GGCGGTTGGC TCGCTGGCGG ACATCGCCGC CAACAGTACC
GGAGCTCCGG ATCAAACGCA CCCTTATCTC CTTGCGCCCG TCGACCTGCA GGCAGTGAAA
GCCTGCGGCG TCACCTTTGC GCAGTCGATG ATCGAGCGCG TCATCGAGGA GAAGGCGGCC
GGCAGTCCGG AGCGTGCCGC CTCGATCCGC GAGCGCGTCA GCACGCTGAT CGGTGGCAGC
CTCACCAATC TGAAGGCCGG CTCACCGGAG GCTGCCAAGG TCAAGCAGGC ACTGATCGAC
GAAGGCATGT GGTCGCAATA TCTGGAGGTC GGTATCGGGC CGGACGCCGA AGTCTTCACC
AAGTCGCCGG TGCTCTCCTC CGTCGGCTGG GGTGCGGATG TCGGCCTGCA TCCGATCTCG
ACCTGGAACA ATCCCGAGCC GGAAATCGTG CTCGCGGTCA ACAGCCGCGG CGAAATCACG
GGGGCGACTC TCGGCAACGA CGTCAACCTG CGCGACGTCG AGGGCCGCTC GGCGCTGCTG
CTCGGCAAGG CCAAGGATAA CAATGCCTCC TGCTCGATCG GTCCTTTCAT CCGCCTGTTC
GATGCCGGCT ACAGCCTCGA TGATGTACGC AAGGCCGAAC TCGACCTGAA GGTGTCAGGC
CAGGATGGCT TCGTGATGCA CGGCAAGAGT TCGATGTCGC AGATCAGTCG CGATCCGACC
GATCTCGTCA AGCAGACGGT CGGCGCCCAT CATCAATATC CCGACGGTTT CATGCTTTTC
CTCGGCACGC TGTTTGCGCC GACTCAGGAC CGCGACGCGC CGAAGCAAGG CTTTACCCAC
AAGATCGGCG ATGTCGTCGA GATTTCCTCG GCAGGCCTCG GCGCGCTCAT CAACACCGTG
CGCCTCTCCA CCGAATGCCC GCCCTGGACC TTCGGCATTT CGGCGCTGAT GAGCAATCTG
GCAAAGCGCG GTCTTCTCTA A
 
Protein sequence
MSQPLLDVAA SDGLFVGRIW NPEVQGPSIV TLREGILVDI TSREAPTLSA LLERQDAATF 
VRAASGKAVG SLADIAANST GAPDQTHPYL LAPVDLQAVK ACGVTFAQSM IERVIEEKAA
GSPERAASIR ERVSTLIGGS LTNLKAGSPE AAKVKQALID EGMWSQYLEV GIGPDAEVFT
KSPVLSSVGW GADVGLHPIS TWNNPEPEIV LAVNSRGEIT GATLGNDVNL RDVEGRSALL
LGKAKDNNAS CSIGPFIRLF DAGYSLDDVR KAELDLKVSG QDGFVMHGKS SMSQISRDPT
DLVKQTVGAH HQYPDGFMLF LGTLFAPTQD RDAPKQGFTH KIGDVVEISS AGLGALINTV
RLSTECPPWT FGISALMSNL AKRGLL