Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4464 |
Symbol | |
ID | 6977558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 95644 |
End bp | 97023 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643393642 |
Product | amidohydrolase |
Protein accession | YP_002278460 |
Protein GI | 209546542 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0716528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGAT ATCTCCTGAA GAACTGCACG GCCGTGATTG TCGACGATGG CAAGGGGCCG GTCGTCCGTC GCAATGTCGA TCTGCTAACC GACGGTTCGG CGATTGTGGC GATCGGTGAA AATCTGGCGG CGGACGCGCT TCGCGACGGC ACGACCGTTC AGGATGCCAC CGGCTGGTTC GTCTATCCCG GCCTCGTCAA TACCCATCAT CATTTTTTTC AATGCTTCGT GCGCAATCGC GCCGATCTCG ACTGGACGAA GCTTTCGGTT ATCGAGTGGC TGGACCGGAT CTACCCGATC TTTTCGCAGC TCAATGAGGA GTGTTTTTAC CACTCCTCCG TCACGGCGAT GGCCGAGATG ATCAAACATG GCTGTACCAC GGCTTTCGAC CATCAATATA ATTTCCCCCG GCACGCCGGG AAGCGGCTGA TCGACCGCCA GTTCGAAGCC GCCGAGCTCT TCGGCATGCG CTTCCACGCC GGCCGCGGCG GCAATACTCT GCCGAAGTCG GAGGGCTCGA CCATTCCCGA CGAGATGCTT GAGACGACCG ACGAATTTAT CGCCGACTGC GCCCGGCTGA TCGACACTTA CCACGATGCC AACCCGTTCA GTATGCGCCA GGTCGTGGTG GCGCCCTGCC AGCCGGTGAA TTGCTATCGC GAGACCTTTG CGGAATCGGT GGCGCTGGCG CGTGATCGCG GCGTCATGAT GCACACCCAT GTCGGCGAGG GCGAAAGCCC GGTCATTCAC GCTCGCCATG GCGTGCGCAC CGTCGATTAT CTGGAAGAGC TCGGCTTTGC CGGCCCCGAC ACATTTTATG CCCATTGCTG GGAGTTGACC CACGACGAAC TCAGGACAAT GGCGGCGAGC GGCACCGGCG TGGCGCACTG CCCCGAACCG GTCTATCTCG TCGGCGCCGA GGTCACCGAC ATTCCCGCCA TGGCTGCCTT CGGCTTGCGC ATCGGCCTTG GCTGCGACGG TGCTGCCTCC AACGACAATT CCAACGTGAT GCACTGCCTG CACTCCGCCT ATATGTTGCA GTGCCTGGTT TCCTCGAGCC GCGCTCATCC CGTGCCGCCG CCAGTCGATT TCCTCGGCTA CGGCACGACA GGCGGAGCGA GCCTGCTCGG CAGGCGCGAC ATCGGCCGGC TGGCGCCTGG CATGGCAGCA GATCTGTTTG CGATTAACAC GCGTCGCATG GATTATGTCG GTACGCGGCA CGATCCGTTG AGCCTGATTG CCCGCGTCGG CATCGGCATG GCGACCGATA TGACGATGAT CAATGGCCGC ATCGTCTGGC AGAAGGGGGA ATTCCCCGGT CTCGACGAGG CCAAGCTCTC TGCCGACGCC GAGGCTGCAC TCGCAGCCGT AGAATTTTAA
|
Protein sequence | MAGYLLKNCT AVIVDDGKGP VVRRNVDLLT DGSAIVAIGE NLAADALRDG TTVQDATGWF VYPGLVNTHH HFFQCFVRNR ADLDWTKLSV IEWLDRIYPI FSQLNEECFY HSSVTAMAEM IKHGCTTAFD HQYNFPRHAG KRLIDRQFEA AELFGMRFHA GRGGNTLPKS EGSTIPDEML ETTDEFIADC ARLIDTYHDA NPFSMRQVVV APCQPVNCYR ETFAESVALA RDRGVMMHTH VGEGESPVIH ARHGVRTVDY LEELGFAGPD TFYAHCWELT HDELRTMAAS GTGVAHCPEP VYLVGAEVTD IPAMAAFGLR IGLGCDGAAS NDNSNVMHCL HSAYMLQCLV SSSRAHPVPP PVDFLGYGTT GGASLLGRRD IGRLAPGMAA DLFAINTRRM DYVGTRHDPL SLIARVGIGM ATDMTMINGR IVWQKGEFPG LDEAKLSADA EAALAAVEF
|
| |