Gene Rleg_2474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2474 
Symbol 
ID8015689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2473400 
End bp2474911 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content66% 
IMG OID644825055 
ProductProtein of unknown function DUF1800 
Protein accessionYP_002976285 
Protein GI241205189 
COG category[S] Function unknown 
COG ID[COG5267] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGT CTTTCCCGAC CATGGCGGCG ATCCGGTTCG GCTATGGTTT CCGGCCGGGC 
GAGGCGCCGC CGAGCAGCAA GGACGAGCTC ATCGACCAGC TGCGCAAGGG GGCGGCGGCG
ACGCCGGACT TTCCCCTCGG CGGCCCCAAC ATGCGCCACC AGGCGATCCT CAGCCTGCAG
GAGCAGTTGC AGCAGATCCG ACAAGACGCC AAGACGGTGA CCGACGATAC GACGCAGCGC
GAGATGCGCA AAGGGGTGCA GCGTCAGGCG CAGCAGCAAT TCCAGCACGA TGCGAACCTG
CGGCTGATGC AGGCCGTGTT GTCGCCGTAC GGCTTCTACG AGAGGCTTTC GACCTTCTGG
ACCAATCATT TCTCCACCAG CGCCAACAAG AGCCTGCCGA TGCGCCTCAT CGTGCCGCTC
TACGAGGCCG AGGCGATCCG GCCGTTCATA TCAGGCACGT TCGGCGATCT CTTGCGCAAT
GCCACCGCCC ATCCGGCCAT GCTGATCTAT CTCGATCAGG CGGATTCGCT CGGGCCGGAT
TCGGCCGGCG GCATCAAGCG CAACAAGGGG CTCAATGAAA ATCTCGGCCG CGAACTGCTG
GAACTGCACA CGCTCGGCGC CGGCAGCGGC TATAGTCAAG CGGACGTCAC GGCGGCAGCC
ATGGTGCTGA CGGGGCTCAC CATCGACCGC AAGGAGATGG ACATCGCGTT CCGGCCGAAT
ATTTCGGAGC CCGGGACACA TGAGGTGCTC GGCGTCAGTT ATGGCGGGCG CAGGCGCTCG
CGCGACGATT ATCTCGACAT GCTCGACGAT CTCGCCCTCC ATCCGAAGAC GGCGGCGCAT
ATCAGCCGCA AGCTGGCGGT GCATTTCATC GCCGACCAGC CCGATGAGGG GATGGTGTCC
GACATGGCCG AAGCCTGGAA GAAAACGGAT GGCGACCTGA CCGCCGTCTA CACCGCCATG
CTCGACCATC CCGCCGCCTG GCGCGACGAG GGCGCCAAGG CGCGCCAGCC TTTCGACTAT
GTCGTCACCG GCCTGAGGGC GTTGAATGCG GGACCGGTCA ACGGCGTTGT CGGCAGTTTC
CTGGCGGCCA ACCAGCAGGG CACGGACGAG GGCGACATGG CGGCGAATAC GCCTGGCATG
GCTGGATCGC CGGTGACGAC CGATCCCGCC GGCGAGGCAA GGGAGAAGCG CCTCAAAGCC
TTCCAGACGG CGCGGGCGCT GGGGCAGGGG GCACTCAGGC GCATGGGCCA GCCGACCTGG
CTGCCGCCGA GTCCGGCCGG TTTCGAGGAA GGCTTCTCCG CCTGGATCAC CGGCAGCCAG
CTCGCCGAGC GGCTGGCCTG GGCAAGGCGG GCCGCAGCCC AGTTCGGCCG GGATGAGGAT
CCGCGCGAAT TCCTGAAGTC GACGCTTGCC GATGCCGCCC GAGACGAGAC GATCCGCGTG
GTGTCGCAGG CGCCGAACAA GATCAGCGGG TTGACGCTGG TGCTGGCATC GCCCGAATTC
AATCGCCGCT GA
 
Protein sequence
MSLSFPTMAA IRFGYGFRPG EAPPSSKDEL IDQLRKGAAA TPDFPLGGPN MRHQAILSLQ 
EQLQQIRQDA KTVTDDTTQR EMRKGVQRQA QQQFQHDANL RLMQAVLSPY GFYERLSTFW
TNHFSTSANK SLPMRLIVPL YEAEAIRPFI SGTFGDLLRN ATAHPAMLIY LDQADSLGPD
SAGGIKRNKG LNENLGRELL ELHTLGAGSG YSQADVTAAA MVLTGLTIDR KEMDIAFRPN
ISEPGTHEVL GVSYGGRRRS RDDYLDMLDD LALHPKTAAH ISRKLAVHFI ADQPDEGMVS
DMAEAWKKTD GDLTAVYTAM LDHPAAWRDE GAKARQPFDY VVTGLRALNA GPVNGVVGSF
LAANQQGTDE GDMAANTPGM AGSPVTTDPA GEAREKRLKA FQTARALGQG ALRRMGQPTW
LPPSPAGFEE GFSAWITGSQ LAERLAWARR AAAQFGRDED PREFLKSTLA DAARDETIRV
VSQAPNKISG LTLVLASPEF NRR