Gene Rleg_1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1198 
Symbol 
ID8012307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1173742 
End bp1174785 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content59% 
IMG OID644823782 
Productaldo/keto reductase 
Protein accessionYP_002975032 
Protein GI241203936 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.5324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.017013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTACA ATTCGCTAGG CCGCACCGAG ATTTCCGTTT CAGAGATTTG CCTTGGCACC 
ATGACCTGGG GTTCGCAGAA CAGCGAAGCC GATGCTCATG CGCAGATGGA CTACGCCGTC
GAAAAGGGCG TCAATTTCTT CGATACGGCC GAACTTTATC CGACCACCCC GATTTCGGCC
GCTACGCAGG GCTGGACGGA AGACTATATC GGCAGCTGGT TCAAGAAGAC CGGCAAGCGC
GGCGATATCG TGCTCGCCAC CAAGGTCGCC GGCCGCGGCC GCGACTATAT ACGTGGTGGC
GAAGGTGCCG ATGCAAAGAA TATCCGCCTG GCGCTCGAAG CCAGCCTGGC GCGGCTGAAG
ACGGATTACG TCGACCTCTA CCAGATCCAC TGGCCGAACC GCGGCCATTT CCATTTCCGT
CAGAATTGGA GCTACAATCC CTTCAACCAG AACCGCGACG AGGCCGTCGC CAATATGCTC
GACATCCTGG AAACGCTTGG CGTGCTGGTG AAGGAAGGCA AGATCCGCGC GATCGGCCTT
TCCAACGAAA CCACCTGGGG CATACAGAAA TATCTGACGC TCGCCGAACA GAAGAGCCTG
CCGCGGGTCG CCTGCGTCCA GAACGAATAC AACTTGCTCT ACCGCCATTT CGACCTCGAT
CTCGCCGAAC TCTCGCATCA TGAGGATGTC GGGCTGCTCG CCTATTCTCC GCTCGCCGGC
GGCATCCTCT CCGGAAAATA TGTCGATGGC GGCAGGCCGA AGGGTTCGCG CGGCTCGATC
AACCACGATA TCGGCGGTCG CCTGCAGCCG CTACAGGAGC CGGCGACCAA AGCCTATCTG
GAGATCGCTG CAACATACCG CCTCGACCCG GCAGCAATGG CGCTCGCCTT CTGCCTTTCC
AGGCCCTTCA TGGCCTCGGC CATCATCGGC GCGACCTCGA TGGAGCAGTT GAAAATCGAT
ATCGGCGCGG CCGACATTAC GCTTTCGAAC GAGATCCTGG CGGAAATCGC CAAGGTGCAC
CGGCAGTATC CGCTGACGCT TTGA
 
Protein sequence
MKYNSLGRTE ISVSEICLGT MTWGSQNSEA DAHAQMDYAV EKGVNFFDTA ELYPTTPISA 
ATQGWTEDYI GSWFKKTGKR GDIVLATKVA GRGRDYIRGG EGADAKNIRL ALEASLARLK
TDYVDLYQIH WPNRGHFHFR QNWSYNPFNQ NRDEAVANML DILETLGVLV KEGKIRAIGL
SNETTWGIQK YLTLAEQKSL PRVACVQNEY NLLYRHFDLD LAELSHHEDV GLLAYSPLAG
GILSGKYVDG GRPKGSRGSI NHDIGGRLQP LQEPATKAYL EIAATYRLDP AAMALAFCLS
RPFMASAIIG ATSMEQLKID IGAADITLSN EILAEIAKVH RQYPLTL