Gene Rleg_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1114 
Symbol 
ID8012235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1095775 
End bp1097277 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content62% 
IMG OID644823697 
ProductAMP nucleosidase 
Protein accessionYP_002974948 
Protein GI241203852 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0775] Nucleoside phosphorylase 
TIGRFAM ID[TIGR01717] AMP nucleosidase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0677524 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAC GAATCCCCCC TTTGTCGCCC CTCGACATTT CCTCTCCACC GCCTTTCCAG 
CCTGAGAGTT TCGATGATCC CGCCAAGGCG GTCGAGGCGC TGACGGCGCT TTACGAGCGC
AATACGGCGT TTCTCATCCA GAGCTTCGCC GAACTCGCAC AGGGCGCGCC GATCTCCTCG
CGCTACCGCG CTTTTTATCC GCAGGTCAGC ATCGAGACGA CAAGCTTCGG CCATGTCGAT
TCGCGGCTTT CCTATGGTCA TGTCACGGCA CCCGGCATCT ATACGACGAC CGTCACCCGG
CCGAAGCTCT TCAAGCATTA CCTGAAAGAG CAGCTGGCGC TGCTGGTGAA GAGCCACAAT
GTGCCTGTCA TCGTCTCGGA ATCGACGACG CCGATCCCCT TGCACTTCGC CTTCGGCGAG
GGTGCGCATG TGGAAGCATC GACCAACGCC TTCATCGACG TTCCGATGCG CGATATCTTC
GACACACCCG ATCTCAACAC CACCGACGAC GAGATCGCCA ACGGCGAGTA TATCCCGCCG
CCTGGAGAGC CCTCGCCGCT TGCGCCATTT ACCGCACAAC GCATCGACTA CTCGCTCGCC
CGACTGTCGC ATTATACGGC GACGCATGCC GAGCATTTCC AGAATTTCGT GCTGTTTACG
AACTACCAGT TCTATATCGA CGAATTCTGC AGCTGGGCGC GCAAGCTGAT GGCCGAGGGC
GGCGACGGCT ATACTGCCTT CGTCGAGCCC GGCAATGTCG TCACCCTGCC CGGCTCGAAC
GCGCCGGAAA CGGATTCCGC CCTCACCCGC CTGCCACAGA TGCCAGCCTA CCATCTGAAG
AAGAAGGGCC ATGCCGGGAT CACCATGATC AATATCGGCG TCGGCCCCTC CAACGCCAAG
ACGATCACCG ACCACGTCGC CGTGCTGCGC CCGCATGCCT GGCTGATGCT CGGCCATTGC
GCCGGTCTTC GCAACAGCCA GCGGCTCGGC GACTATGTAC TCGCCCATGC CTATATGCGC
GAGGACCATG TCCTCGACGA CGACCTGCCG GTCTGGGTGC CGATCCCGGC GCTGGCCGAA
GTGCAGGTGG CGCTGGAAGC CGCCGTTGCC GAAATCACCG GTTACGAAGG TTTCGAGCTG
AAGCGCATCA TGCGCACCGG CACCGTCGGC ACAATCGACA ACCGCAACTG GGAACTGCGC
GACCAGCGCG GGCCGGTGAA GCGGCTCTCC CAGGCGCGCG CGATCGCGCT CGATATGGAA
TCGGCAACGA TCGCCGCTAA CGGCTTCCGC TTCCGCGTGC CCTATGGCAC TCTGCTCTGC
GTCTCCGACA AGCCGCTGCA CGGCGAATTG AAGCTGCCGG GCATGGCAAC GGCCTTCTAC
CGCACGCAGG TCAACCAGCA TCTGCAGATC GGCATCCGCG CCGTCCAGAA GCTTGCCGCC
ATGCCGAAGG AAGCGCTGCA TTCACGCAAG CTGCGCAGCT TCTTCGAAAC GGCCTTCCAA
TAG
 
Protein sequence
MNKRIPPLSP LDISSPPPFQ PESFDDPAKA VEALTALYER NTAFLIQSFA ELAQGAPISS 
RYRAFYPQVS IETTSFGHVD SRLSYGHVTA PGIYTTTVTR PKLFKHYLKE QLALLVKSHN
VPVIVSESTT PIPLHFAFGE GAHVEASTNA FIDVPMRDIF DTPDLNTTDD EIANGEYIPP
PGEPSPLAPF TAQRIDYSLA RLSHYTATHA EHFQNFVLFT NYQFYIDEFC SWARKLMAEG
GDGYTAFVEP GNVVTLPGSN APETDSALTR LPQMPAYHLK KKGHAGITMI NIGVGPSNAK
TITDHVAVLR PHAWLMLGHC AGLRNSQRLG DYVLAHAYMR EDHVLDDDLP VWVPIPALAE
VQVALEAAVA EITGYEGFEL KRIMRTGTVG TIDNRNWELR DQRGPVKRLS QARAIALDME
SATIAANGFR FRVPYGTLLC VSDKPLHGEL KLPGMATAFY RTQVNQHLQI GIRAVQKLAA
MPKEALHSRK LRSFFETAFQ