Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5499 |
Symbol | |
ID | 8016808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | + |
Start bp | 84643 |
End bp | 86037 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644827666 |
Product | Amidohydrolase 3 |
Protein accession | YP_002978866 |
Protein GI | 241518238 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.544812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCCTG AGACTCAAAT CGTCATCGTC GGCGGACGAA TCGTCACCGG CGATGGAACC ACGCTCCACG AGCGGGGGGT CGTGCGCATC CGTGGGACAA GGATCATCGA CGTCGCGCCA GGTGAGGCCG ATGCCCGAAG CGAGGCTGTC GTCCTCAACG CCGCCGGATG CACGGTCATT CCCGGTATCG TCAATGCGCA TGCTCATGGG TGCATCCACG GCCCGTCCAT GCCGAGCGGG TCGGTGCCCG TCCGGCCCAC GGATGTCGAC TACTTCAGAA ACAGACATCT CCTGTCCGGA ACAACCACCC TGCTCAACGT CTGCGGCTTA GCGCTGCCGG ATGAGATCGA TGGCCCCTCG AACCAGCGCC ATGCGATGGA TATCCATCTG ACGACGGCGC ATACCACGAG CAATCTGGCC GCGGCGATCG CGATTGATGG TGGCGGATTA TCGGAGCGGC ACAAGATCGC CCGCATCGAT GACATGGTCG CCAAAGGCGC AAAGGCACTC GGAGAAGCAG GCGGCGGGCA GACGCTTGGT GGCGGGGCGC AGGACTATCG CTTTATACCT GCGGCTATAG AGGCCGCTAC GGGAACATCA ATCCATCCGA AGGAGGCGCG CGCACTCAAG GAAGCCGTGC TCGGCCGCTA TCTCGATCGT GGCTTGCCCG ATCTTCCGCG ACTCAATGCT CTTCTGATCG AGTGCGGTCT CGCCGCTAAA ATTTGTACGT CCGACCTCAC AAAGCTGATC CGCGACACGG TGATGCCGCC CGTGGCGCTG TCACTGAAGG GTTTTGACGA GATCGCGGCA GCGTCTGAAC GACTCAATTT TCCGGCGATC TTTCACAATG CGGCACCCAC CGCGGCGACG CTGCTGAAGC TTGCGGAGAC ATATCCGAAG GCCCGCATCA TCGCTGGGCA TTCGAACCAC CCGATGTTCC TTCCCGAAGA GGCAGTGCGT TTCGGGCTGC AACTGAGGGA GCGCGGCGTC GCCATCGACG TCTCGACGCT CGATTGCATC GAAACCCGCT GGCGCAATGA CACCGCAAAT ATCGATGCTC TGGTCGAAGC GGGGCTCGTC GATACCCTAT CGACAGACTT TGCCGGCGGA GACTGGGATA GTATTCTGTC GGCCATCCAG CGAATGGTGC GCAAGAGCCA GCTTTTGCTG CCGGCCGCAA TCGCGCTTGC GACCGGCAAC GTCTCGAAAA CACTCCCGGA GCTTGCAGCC GATCGTGGCT TGCTGGAAAC GGGTAAACGC GCCGACGTGG TCATCGTTGA GAACCACAAT CTCGGTCGAG TCCGCCATGT CGTGGCAAAC GGAGAGCTGG TGGTGTTCAA CGCGGCGATG GGGGTTGGGG ATTTGCACGC TTACGCCATG GCAGCGGGCC GTTAA
|
Protein sequence | MPPETQIVIV GGRIVTGDGT TLHERGVVRI RGTRIIDVAP GEADARSEAV VLNAAGCTVI PGIVNAHAHG CIHGPSMPSG SVPVRPTDVD YFRNRHLLSG TTTLLNVCGL ALPDEIDGPS NQRHAMDIHL TTAHTTSNLA AAIAIDGGGL SERHKIARID DMVAKGAKAL GEAGGGQTLG GGAQDYRFIP AAIEAATGTS IHPKEARALK EAVLGRYLDR GLPDLPRLNA LLIECGLAAK ICTSDLTKLI RDTVMPPVAL SLKGFDEIAA ASERLNFPAI FHNAAPTAAT LLKLAETYPK ARIIAGHSNH PMFLPEEAVR FGLQLRERGV AIDVSTLDCI ETRWRNDTAN IDALVEAGLV DTLSTDFAGG DWDSILSAIQ RMVRKSQLLL PAAIALATGN VSKTLPELAA DRGLLETGKR ADVVIVENHN LGRVRHVVAN GELVVFNAAM GVGDLHAYAM AAGR
|
| |