Gene Rleg2_0823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0823 
Symbol 
ID6979541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp840209 
End bp841777 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content65% 
IMG OID643395534 
Productprotein of unknown function DUF853 NPT hydrolase putative 
Protein accessionYP_002280343 
Protein GI209548426 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.953941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.874075 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGG ATGGCAAGAT TTTCATCGGC GCGAGCCGCA ATCCCGACGA CAGCATCAAC 
AAGCCTGAAT ATCTCGATCT GAAATTCGGC AACCGCCACG GCCTCGTCAC CGGCGCCACC
GGCACCGGCA AGACCGTGAC GCTGCAGGTG CTGGCCGAGG GCTTCTCCCG CGCCGGCGTC
CCGGTTTTTG CCGCCGACAT CAAGGGCGAT CTTTCCGGCA TCGCCGCCAA GGGCGAGGCC
AAGGATTTCC TCACCAAGCG GGCCGAGCAG ATCGGCTTCA CCGACTATGA ATTCGATCAG
TTTCCGGTGA TTTTCTGGGA TCTGTTCGGC GAGAAGGGCC ACCGGGTGCG CACCACCGTC
GCCGAGATGG GGCCGCTGCT GCTCGCCCGG CTGATGGATG CCTCCGAACC GCAGGAGGGC
GTCATCAACA TCGCCTTCAA GATCGCCGAC CAGGCCGGGC TGCCGCTGCT CGATCTCAAG
GATTTCACCT CGCTGCTGAA TTATATGGGC GAGAATGCCG GCGAGCTTTC CAATCAGTAC
GGGCTGATCT CCAAAGCCTC GGTCGGCTCG ATCCAGCGGG CGCTGCTGGT TCTCGAACAG
CAGGGTGCGG AACATTTCTT CGGCGAGCCG GCGCTGAAGA TTTCCGACAT CATGCGCACC
AGCAATAACG GCTACGGCCA GATCTCGGTG CTTGCCGCCG ACAAGCTGAT GATGAACCCG
CGCCTTTACG CCACCTTCCT GCTCTGGCTG CTGTCGGAGC TGTTCGAGGA ACTGCCCGAA
GTCGGCGATC CGGACAAGCC GAAGCTAGTG TTCTTTTTCG ACGAAGCGCA TCTGCTCTTC
AACGACGCGC CGAAGGTGCT GACCGAACGG GTCGAGCAGG TGGTGCGGCT GATACGCTCC
AAGGGAGTCG GCGTTTATTT CGTGACGCAG AACCCGCTCG ACGTGCCGGA AACGGTGCTC
GCCCAGCTCG GCAACCGCGC CCAGCATGCG CTGCGCGCCT ATTCGCCGCG CGAGCAGAAG
GCGGTGAGGA CCGCGGCCGA TACCTTCCGC CCCAACCCGG CCTTCGATTG CGCCACCGTC
ATCACCAATC TCGGCACCGG CGAGGCGCTG GTCTCGACGC TCGAAGCCAA GGGCGCGCCG
TCGATCGTCG AGCGCACGCT GATCCGCCCG CCATCCGGCC GCGTCGGGCC GGTGACGGAT
GCCGAGCGCC AGCAGATCAT GGACAGGAGC CCGGTTCTCG GCGTCTATGA CGAGGACATC
GACCGCGAAT CCGCCTTCGA GATATTGGCG GCCCGCGCCA AGAAGGCAGC CGATGCCGAT
GCCGCCAAAC GGGCGCAGGA CGAAGCCGCA GAACAGCAGC CGGGCACCAC CACCTCCGGC
TGGAGCTTGC CGGGCTTCGG CGGCAGCAAT GACGACAACC AGCAGGGCCG CGGCCAATCA
CGCGGCCGGT CTTCCGGCTA CCAGCGCGAA ACGGTGGTGG AAGCGGCGAT GAAGAGCGTG
GCACGCACAG TGGCGACCCA GGTCGGCCGG GCGCTGGTGC GCGGGATCTT GGGAAGCTTG
AAGCGGTAG
 
Protein sequence
MIEDGKIFIG ASRNPDDSIN KPEYLDLKFG NRHGLVTGAT GTGKTVTLQV LAEGFSRAGV 
PVFAADIKGD LSGIAAKGEA KDFLTKRAEQ IGFTDYEFDQ FPVIFWDLFG EKGHRVRTTV
AEMGPLLLAR LMDASEPQEG VINIAFKIAD QAGLPLLDLK DFTSLLNYMG ENAGELSNQY
GLISKASVGS IQRALLVLEQ QGAEHFFGEP ALKISDIMRT SNNGYGQISV LAADKLMMNP
RLYATFLLWL LSELFEELPE VGDPDKPKLV FFFDEAHLLF NDAPKVLTER VEQVVRLIRS
KGVGVYFVTQ NPLDVPETVL AQLGNRAQHA LRAYSPREQK AVRTAADTFR PNPAFDCATV
ITNLGTGEAL VSTLEAKGAP SIVERTLIRP PSGRVGPVTD AERQQIMDRS PVLGVYDEDI
DRESAFEILA ARAKKAADAD AAKRAQDEAA EQQPGTTTSG WSLPGFGGSN DDNQQGRGQS
RGRSSGYQRE TVVEAAMKSV ARTVATQVGR ALVRGILGSL KR