Gene Rleg2_2276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2276 
Symbol 
ID6981015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2333382 
End bp2334662 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content65% 
IMG OID643396992 
Productprotein of unknown function DUF442 
Protein accessionYP_002281780 
Protein GI209549863 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0491] Zn-dependent hydrolases, including glyoxylases
[COG3453] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01244] conserved hypothetical protein TIGR01244 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCG TAAGGGTCAA TGAGCTGATA TCGGTGGCGG GGCAGCCCGA TGCCGCCGGT 
TTTGCTGCTT TTGCGGCAGA CGGTTTTGCC GGCGTCATCA ATGCGCGGCC GGATGGCGAG
GAGCCGGGGC AGCCGGGCGA TGCGGCGGAA AAGGCTTCTG CCGCCGCCGC GGGGCTCGCC
TACAGCTTCG TGCCGGTGAA AGGGGCCGAG ATCACCGAGG CCGATATCCG CGCCTTCCAG
AGGGCGATGA TGCAAGCGAA GGGGCCGGTC GTCGCTCATT GCAAGAGCGG CACGCGGGCG
CTGACGCTCT ATGCGCTCGG AGAGGTGCTC GACGGGCGGA TGCAGCCCGG GGACGTCGAG
GCCTTCGGTC AAAACCTCGG CTTCGATCTT GCCGGCGCAC GCCGCTGGCT GGAAAAGCGG
GCAGGGCAGA CGGCTGTCGT GAAGGCCTTC TTCGAACCCC GCACCTGCAG TGTGCAATAT
GTCGTTTCAG ACCCGGTCAC GAAACGCTGC GCCATCATCG ACCCGGTGCT CGATTTCGAC
GAGATGTCGG GTGCGACGGG AACGACCAAT GCCGACGCCA TCCTCGCGCA TATCGACAGC
GAAGGGCTGA CGGTCGAGTG GGTCCTCGAC ACGCATCCGC ATGCCGATCA TTTCTCCGCC
GCGCACTATC TCAAGGGAAA GACCGGCGCG CCGACAGCGA TCGGCGCCCA TGTCACCGAG
GTCCAGCTGC TCTGGAAGGA AATCTACAAC TGGCCGGCAC TCGAAACCGA CGGCTCGCAA
TGGAACCGGC TGTTTGCCGA GGGCGACACT TTCGAGATCG GCGGGCTTGA AGCCCGTGTG
ATGTTCTCGC CCGGCCATAC GCTTGCCTCG GTGACCTATG TGATCGGTGA CGCCGCCTTC
GTGCACGACA CCGTATTCAC CCCGGATTCC GGCACGGCGC GCACTGATTT TCCCGGCGGC
AGCGCCAGCG CCCTCTGGCA CTCGATCCAG GCCATCCTGT CGCTGCCCGA GGAGACCCGG
CTCTTTTCCG GCCACGACTA TCAGCCTGGC GGTCGCCATC CCCGCTGGGA AAGCACGGTG
ACCGCCCAGA AGACCGCCAA TCCGCATATA TCGGGTATCG ATGAGGCCGG CTTTGTGACG
CTGCGCCAGG CGCGCGACCG TACGCTACCG AAACCGAAGC TGATGCTGCA CGCGCTACAG
GTGAATATTC GGGGCGGACG GCTGCCCGAG CCGGAGGAGA ACGGCCGGCG CTATCTGAAA
ATCCCGCTGG ATGCGTTGTA G
 
Protein sequence
MTSVRVNELI SVAGQPDAAG FAAFAADGFA GVINARPDGE EPGQPGDAAE KASAAAAGLA 
YSFVPVKGAE ITEADIRAFQ RAMMQAKGPV VAHCKSGTRA LTLYALGEVL DGRMQPGDVE
AFGQNLGFDL AGARRWLEKR AGQTAVVKAF FEPRTCSVQY VVSDPVTKRC AIIDPVLDFD
EMSGATGTTN ADAILAHIDS EGLTVEWVLD THPHADHFSA AHYLKGKTGA PTAIGAHVTE
VQLLWKEIYN WPALETDGSQ WNRLFAEGDT FEIGGLEARV MFSPGHTLAS VTYVIGDAAF
VHDTVFTPDS GTARTDFPGG SASALWHSIQ AILSLPEETR LFSGHDYQPG GRHPRWESTV
TAQKTANPHI SGIDEAGFVT LRQARDRTLP KPKLMLHALQ VNIRGGRLPE PEENGRRYLK
IPLDAL