Gene Rleg_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3101 
Symbol 
ID8014009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3098811 
End bp3100508 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content63% 
IMG OID644825668 
Productadenine deaminase 
Protein accessionYP_002976896 
Protein GI241205800 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1001] Adenine deaminase 
TIGRFAM ID[TIGR01178] adenine deaminase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.326559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGA AGCTCGAACG TCTCATCGAT CAGGGCGTTG GCCGTGTGCC GGCCGATATC 
GTGCTGAAGG GCGGCTGCTT CTTCGATCTC GTCACCGGCG AACTCGTCCA GTCGGACATT
GCCATCGGCG CCGATCGCAT CGTCGGCACC TCAGGCAATT ATGAGGGCGA AACCGAGATC
GACATCTCGG GCAGGATCGT CGTTCCAGGC TTCATCGATA CGCATCTGCA TATCGAATCT
TCGCTGGTGA CCCCGCATGA ATTCGATCGC TGCGTTCTGC CCTATGGTGT CACCACCGCC
ATCTGCGATC CGCACGAAAT CGCCAATGTG CTCGGAACCG CCGGCATCGA ATTCTTTGTT
GAATCCGCGC TGGAGACGAT CATGGACATC CGCGTCCAGC TCTCTTCCTG CGTTCCGGCG
ACACATCTCG AAACCTCCGG CGCCGACCTG CCGATCGAAA GCCTCCTGCC CTACCGTGAC
CATCCGCAGG TCATCGGCCT TGCCGAATTC ATGAATTTCC CCGGCGTCAT CCACAAAGAC
CCCATCTGCA TGGCCAAGCT CGAAGCCTTC CAGAGCGGCC ATATCGACGG TCACGCGCCG
CTGCTCTCGG GCAACGACCT CAACGGTTAC CTCTCGGCCG GCATCCGCAC CGAGCACGAA
TGCACGACCG CCGCCGAAGC GCTGGAAAAG ATCCGCAAGG GCATGCACAT TCTCGTGCGC
GAAGGTTCGG TATCCAAGGA TCTCGCTGCG CTGATACCCA TTATCACCGA GCGGCTTTCA
CCCTTCCTCG CGCTTTGCAC CGACGACCGC AATCCGCTCG ATATCGCCGA ACAGGGCCAT
CTTGATCATA TGATCCGCAC AGCGATCGCG AGCGGCGTCG AACCCCTGGC GATCTACCGC
GCCGCCTCGA TTTCGGCCGC CCGCGCCTTC GGTCTGAGGG ACCGCGGCCT GGTCGCGCCC
GGCTGGCGCG CCGATCTGGT GGTGCTCGAC AGCCTGGAAA ACTGCCGCGC CGACATGGTC
TTTTCCGCCG GCCGCCGCGT CACCGATGCG CTCTTTTCCT CGCGCAGGCC GGTTGCCCCG
ATCGGCCTCG ACAGCGTCAA GGCCCGTCCC GTCAACGCCG CCCATTTCGG CGTCCCGGTC
GCCGAAGGCG AGACGCCTGT CATCGGCGTC ATCCCGGGCA AGATCATCAC CGAGCATCGC
CGCTATCGCC TGCCCGTCAG GGGCAACGAG GCGACAGTCG ATCTTGCGAA TGATATCATC
AAGGTCGCCG TCATCGAGCG CCACGGCAAG AACGGCAACC ATGCCAACGG CTTCGTCCAG
GGCTTCGGCC TGAAGAAGGG TGCGATCGCC TCGACCGTCG GCCATGACAG CCACAATATC
TGCGTCGTCG GCGTCAACGA GGACGACATG GCGCGCGCCG CAAACCGCCT CGGCGAGATC
AAGGGCGGCT TCGTTGTCGT CGAAGACGGC AAGGTCACCG GCGAAATCGC CCTGCCCATC
GCCGGCTTGA TGAGCCTCGA ACCCTATGAG ACGGTCCGCG ATACGCTCCA CCAACTGCGC
AAAGCCGCCT TGGCGCTCGG CGCCACGCTG GAAGAACCCT TCCTCCAGCT CGCTTTCTTG
CCGCTGCCGG TCATCCCGCA CCTGAAAATA TCCGACCGCG GCATGGTTGA TGTCGACAAG
TTCGCGCTCA TCGGGTGA
 
Protein sequence
MTTKLERLID QGVGRVPADI VLKGGCFFDL VTGELVQSDI AIGADRIVGT SGNYEGETEI 
DISGRIVVPG FIDTHLHIES SLVTPHEFDR CVLPYGVTTA ICDPHEIANV LGTAGIEFFV
ESALETIMDI RVQLSSCVPA THLETSGADL PIESLLPYRD HPQVIGLAEF MNFPGVIHKD
PICMAKLEAF QSGHIDGHAP LLSGNDLNGY LSAGIRTEHE CTTAAEALEK IRKGMHILVR
EGSVSKDLAA LIPIITERLS PFLALCTDDR NPLDIAEQGH LDHMIRTAIA SGVEPLAIYR
AASISAARAF GLRDRGLVAP GWRADLVVLD SLENCRADMV FSAGRRVTDA LFSSRRPVAP
IGLDSVKARP VNAAHFGVPV AEGETPVIGV IPGKIITEHR RYRLPVRGNE ATVDLANDII
KVAVIERHGK NGNHANGFVQ GFGLKKGAIA STVGHDSHNI CVVGVNEDDM ARAANRLGEI
KGGFVVVEDG KVTGEIALPI AGLMSLEPYE TVRDTLHQLR KAALALGATL EEPFLQLAFL
PLPVIPHLKI SDRGMVDVDK FALIG