Gene Rleg_0936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0936 
Symbol 
ID8015498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp922455 
End bp924020 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content64% 
IMG OID644823520 
Productprotein of unknown function DUF853 NPT hydrolase putative 
Protein accessionYP_002974771 
Protein GI241203675 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.222531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGG ATGGCAAGAT TTTCATCGGC GCGAGCCGCA ATCCCGATGA CAGCATCAAC 
AAGCCAGAAT ATCTCGACCT GAAATTCGGC AACCGTCACG GCCTCGTCAC CGGCGCCACC
GGTACCGGCA AGACGGTGAC ACTGCAGGTG CTGGCCGAAG GCTTCTCGCG GGCCGGCGTT
CCGGTATTTG CGGCCGATAT CAAGGGCGAT CTTTCCGGCA TCGCCGCCAA GGGCGAGCCG
AAGGATTTCC TGACGAAGCG CGCCGAGCAG ATCGGTTTCA CCGACTATGA ATTCGACCAG
TTTCCGGTGA TTTTCTGGGA TCTGTTCGGC GAGAAGGGCC ACCGGGTGCG CACCACCATC
GCCGAGATGG GACCGCTGCT GCTCGCCCGT CTGATGGATG CCTCCGAACC GCAGGAAGGC
GTCATCAACA TTGCCTTCAA GATCGCCGAC CAGGGCGGGC TGCCGCTGCT CGACCTCAAG
GATTTCAGCT CGCTGCTCAA CTATATGGGC GAGAACGCCA GCCAACTTTC CAACCAGTAC
GGCCTGATCT CCAAGGCCTC GGTCGGCTCG ATCCAGCGGG CGCTGCTCGT TCTCGAACAG
CAGGGTGCGG AGCACTTCTT CGGCGAACCG GCGCTGAAGA TTTCCGACAT CATGCGCACC
AGCAACAACG GCTACGGCCA GATCTCGGTG CTGGCCGCCG ACAAGCTGAT GATGAACCCG
CGGCTTTACG CCACTTTCCT GCTCTGGCTG CTTTCCGAAC TCTTCGAGGA ACTGCCCGAG
GTGGGCGACC CCGACAAGCC GAAGCTCGTC TTCTTCTTCG ACGAGGCGCA CCTGCTCTTC
AACGATGCGC CGAAGGTGCT GACCGAACGT GTCGAGCAGG TGGTGCGGCT GATCCGTTCC
AAGGGCGTCG GCGTCTATTT CGTGACGCAG AACCCGCTCG ACGTGCCGGA AACGGTGCTC
GCCCAGCTCG GCAACCGGGC GCAGCACGCG CTTCGCGCCT ATTCGCCGCG CGAGCAGAAG
GCGGTGCGGA CGGCGGCCGA TACATTCCGC GCCAATCCGG CCTTCGATTG CGCCACCGTC
ATCACCAATC TCGGCACCGG CGAGGCGCTG GTCTCGACGC TGGAGGCCAA GGGCGCGCCT
TCGATCGTCG AGCGCACGCT GATCCGCCCA CCCTCCGGTC GCGTCGGCCC GGTGACCGAT
GACGAGCGCC GTCAGATCAT GGACAGGAGC CCGGTTCTCG GCGTCTATGA CGAGGATATC
GACCGCGAAT CCGCCTTCGA ACTGCTGGCC GCACGGGCGA AGAAGGCAGC CGATGCCGAA
GCCGCCAAAC GGGCGCAGGA AGAAGCGCCT CAGCAACAGG GCGGCACAAC CTCCGGCTGG
AACCTGCCGG GCTTCGGCGG CGGCAATGAC GACGACAACC AGGGCCGCGG CCAATCGCGC
GGCCGGACGT CCAGCTATCA GCGCGAAACG GTGGTGGAAG CGGCAATGAA GAGCGTGGCC
CGCACGGTGG CAACACAAGT CGGCCGGGCG CTGGTGCGCG GGATCTTGGG GAGCTTGAAG
CGGTAG
 
Protein sequence
MIEDGKIFIG ASRNPDDSIN KPEYLDLKFG NRHGLVTGAT GTGKTVTLQV LAEGFSRAGV 
PVFAADIKGD LSGIAAKGEP KDFLTKRAEQ IGFTDYEFDQ FPVIFWDLFG EKGHRVRTTI
AEMGPLLLAR LMDASEPQEG VINIAFKIAD QGGLPLLDLK DFSSLLNYMG ENASQLSNQY
GLISKASVGS IQRALLVLEQ QGAEHFFGEP ALKISDIMRT SNNGYGQISV LAADKLMMNP
RLYATFLLWL LSELFEELPE VGDPDKPKLV FFFDEAHLLF NDAPKVLTER VEQVVRLIRS
KGVGVYFVTQ NPLDVPETVL AQLGNRAQHA LRAYSPREQK AVRTAADTFR ANPAFDCATV
ITNLGTGEAL VSTLEAKGAP SIVERTLIRP PSGRVGPVTD DERRQIMDRS PVLGVYDEDI
DRESAFELLA ARAKKAADAE AAKRAQEEAP QQQGGTTSGW NLPGFGGGND DDNQGRGQSR
GRTSSYQRET VVEAAMKSVA RTVATQVGRA LVRGILGSLK R