Gene Rleg2_4079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4079 
Symbol 
ID6982851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4255079 
End bp4256038 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content64% 
IMG OID643398809 
ProductNUDIX hydrolase 
Protein accessionYP_002283567 
Protein GI209551650 
COG category[L] Replication, recombination and repair 
COG ID[COG2816] NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCATT CGCTTTTCGA TTCGGATGTG CCGCATCCCG AACCCAGCAA TCTCACGGCT 
TTTGCCGCCA ACGACCTCAA TCGTGATTCC GAGCATCGCG ACGAACACTC CGTCGAAAAG
GCGCTGGCAA GGGACGGCAC CCATATCTTC GCCTTCGCTG GGGATAAGCT GGTGCTGAAG
CATGACGGCC AGGTGCTCGA TCCGCTCTTT GCCCGCTACG AGCTGAAGGA ATTGCAGCCG
GATTGGGACG AGACGGTGCT TCTCGGCTAC CGCAAATCAG GCGAGCCGCG CCTTGCCGTC
CCCGTCGGCA TCGACGTAGG GAACCTCGCC AGCCAGTATA AGCCCGCCGA CGGCCGCACG
CTGTTTCGCG AAATGCTGAT CGACGAGGTG CTGCTCGGGG AATTCGCCCA GGCCGCGAGC
CTCATCCGTT GGAATGGCGA CAACCGCTTC TGCGGTCGCT GCGGCTCGGC GATGGAGATC
CACATCGGCG GCTACAAGCG TGTCTGCGCC GCCTGCGAAC ACATGATCTT CCCGCGCACC
GACCCCGTCG TCATCATGCT GACCATCGAC GAGAGCCGCG ACCTCTGCCT GCTCGGCCGC
AGCCCGCATT TCGCGCCCGG CATGTATTCC TGTCTTGCCG GCTTCGTCGA ACCCGGCGAG
ACCATCGAAA ACGCCGTGCG CCGCGAAACG CTGGAGGAAT CGGGCATCCG CACCGGCCGC
ATCCGCTACC ACGCCTCGCA GCCCTGGCCG ATGCCGCATT CGCTGATGAT CGGCTGTTAC
GCCGAGGCCA AATCCACCGA AATCACCCGC GACGAGACGG AGCTGGAGGA TTGCCGCTGG
TTCACCCGCG AGGAAACGAT CGAGATGCTG GAACGCCCGA GCGCGACAGG CAAGGCCTCG
CCGCCGAAAG GGGCGATCGC CCACCGCTTG ATGCGCGACT GGGTGGAGTG GAAGCGGTAA
 
Protein sequence
MSHSLFDSDV PHPEPSNLTA FAANDLNRDS EHRDEHSVEK ALARDGTHIF AFAGDKLVLK 
HDGQVLDPLF ARYELKELQP DWDETVLLGY RKSGEPRLAV PVGIDVGNLA SQYKPADGRT
LFREMLIDEV LLGEFAQAAS LIRWNGDNRF CGRCGSAMEI HIGGYKRVCA ACEHMIFPRT
DPVVIMLTID ESRDLCLLGR SPHFAPGMYS CLAGFVEPGE TIENAVRRET LEESGIRTGR
IRYHASQPWP MPHSLMIGCY AEAKSTEITR DETELEDCRW FTREETIEML ERPSATGKAS
PPKGAIAHRL MRDWVEWKR