Gene Rleg_5228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5228 
Symbol 
ID8007396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp639186 
End bp640223 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content58% 
IMG OID644822136 
Productoxidoreductase domain protein 
Protein accessionYP_002973396 
Protein GI241113561 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.596346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.783327 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTGA TCATTCTCGG AACCGGAGGA TGGGCAAACA CCCATGCCAT GAATTTTTCC 
GAAATCGCCG ACGTGAAAAT TGTTGCTGCT GTTGATACGG ACGAGGTCCG GCTACGGGCC
TTCGCGCTAA GGCATGGTAT CCCACTTACC TTCACGTCGC TTGATGACGC TCTTGCCTGG
GGAGAGTTTG ACGCCGTGAC CAATGTCACG CCGGATCGCG CGCATTATTC CACGACAATG
AAGATACTCG GCGCCGGCAA GCATGTCCTC TGCGAGAAGC CGCTGGCGGT TAACTACCGC
GAAGCCAAAG AGATGGCCGA CGCTGCCGCT GCGTCCGGCA AGGTCACGAT GGTAAACCTT
ACCTATCGTA ATGTAGCGCC GCTGCAAGCA GCGCGTAAGA TGGTGTTGGA CGGACGCCTC
GGCGCGATCC GCCACTTCGA AGCATCCTAT CTCCAGAGCT GGCTGGTCTC AAAGGCGTGG
GGCGACTGGA CCAAGGAATC GCAATGGCTT TGGCGGCTGT CGACAAAGCA CGGCTCCAAT
GGCGTGCTGG GCGATGTCGG TATCCATATT CTCGACTTCG CGGTTTTTGC CGCCGGAAGT
GACGTCAAGG CGGCTGCATC GCATCTTAAG GTGTTCGACA AGAGCCCCGG AAATCGGATC
GGCGAGTATG ATCTCGACGC CAACGACAGT TTCCTGATGA TGGCTGAACT CGAAAACGGT
GCCGCTGGCG TCATCCACGC AACGCGCTGG GCAACCGGCC ATCTGAACGA ATTGCGCCTG
CGCCTGCATG GAGACAAGGG CGCGCTGGAG GTGGTGCATA CGCCTGAAGG TTCGACGCTC
AGGGCCTGTG AAGGTCCCGA TGCCGACAAG GCAATCTGGC GTAAGATCGA CGTCGAACCG
GTTATCACTA ATTTCCAGCG CTTTGCAAAC GCCGTGCAAA AGGGGCAGCT GGATGAGCCT
GGTTTTGGCC ACGCAGCCAA GCTGCAATTC GTTCTTGACC ACGCAGTGAA GACGGCCGGC
GCTCTGATCG AACTTTAA
 
Protein sequence
MRLIILGTGG WANTHAMNFS EIADVKIVAA VDTDEVRLRA FALRHGIPLT FTSLDDALAW 
GEFDAVTNVT PDRAHYSTTM KILGAGKHVL CEKPLAVNYR EAKEMADAAA ASGKVTMVNL
TYRNVAPLQA ARKMVLDGRL GAIRHFEASY LQSWLVSKAW GDWTKESQWL WRLSTKHGSN
GVLGDVGIHI LDFAVFAAGS DVKAAASHLK VFDKSPGNRI GEYDLDANDS FLMMAELENG
AAGVIHATRW ATGHLNELRL RLHGDKGALE VVHTPEGSTL RACEGPDADK AIWRKIDVEP
VITNFQRFAN AVQKGQLDEP GFGHAAKLQF VLDHAVKTAG ALIEL