Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3594 |
Symbol | |
ID | 6982355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3719469 |
End bp | 3720779 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643398319 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002283087 |
Protein GI | 209551170 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4536] Putative Mg2+ and Co2+ transporter CorB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.798731 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTCG AAGGCGCTCT GGCATTTCTT GCGACATATT GGCCGGAGAT CCTTTCGATC ACGGCGCTCG TGCTGATGTC CGCCTTCTTT TCCGGCTCGG AAACCGCGCT GACCGCCGTG TCGCGCAGCC GCATCCATAC GCTCGAGGTC AACGGCGACG AACGCGCCGG CCTCGTCCGG CAACTGATCG AGCGGCGCGA CCGGTTGATC GGCGCGCTGC TGATCGGCAA TAATCTCGCC AATATCCTGT CCTCGTCGAT CGCCACCAGC CTGTTCCTCG GGCTGTTCGG CAGTTCCGGC GTGGCGCTGG CGACGCTGGC GATGACCGTC ATCCTCGTCA TCTTCGCCGA AGTGCTGCCG AAGAGCTGGG CGATTTCGAC ACCCGACCGT TTCGCGCTTG CGATCGCCGT TCCCGCCAGG CTCTTCGTTC TCGTCGTCGG CCCGATTTCC TCCTTCGTCA ATGCGATCGT CCGGCGGATC CTGGCGCTGT TCGGCATCAA TCTCTCACGC GAGGTATCGA TGCTGACGGC GCATGAGGAG CTGCGCGGCG CCGTCGACCT GCTGCACCGC GAGGGATCGG TGGTGAAAGC CGACCGCGAC CGGCTGGGCG GCGTGCTCGA TCTCGGCGAG CTCGAACTCT CAGACATCAT GGTCCACCGC ACGGCGATGC GGGCGATCAA CGCCGACGAT CCGCCTGAGG CGGTGGTGCG CGCCATTCTC GAAAGTCCCT ATACGCGCAT GCCTTTATGG CGCGGCACGA TCGACAACAT CATCGGCGTC GTTCACGCCA AGGATCTGCT GCGGGCGCTG GCCGAGCCGA ACATGGAGCC GCAGAACCTC GATATCGTGA AGATCGCGCA GAAGCCGTGG TTCGTGCCCG ACAGCACCAA CCTCGAAGAT CAGCTGAACG CCTTCCTGCG CCGCAAGCAG CATTTCGCCG TCGTCGTCGA CGAATATGGC GAGGTGCAGG GCATCGTCAC GCTGGAGGAT ATTCTCGAGG AAATCGTCGG CGATATTTCC GATGAGCACG ATATCGAGAT CCAGGGTGTG CGCCAGGAGG CCGACGGTTC GGTCGTGGTC GACGGCGGCG TGCCGATCCG CGACTTGAAC CGCGCGCTCG ACTGGAACCT GCCCGATGAG GAGGCGACGA CGATTGCCGG CCTGGTCATC CATGAATCGA TGACCATTCC GGAAGAGCGC CAGGCCTTCA CCTTTTACGG CAAGCGCTTC ATCGTCATGA AGCGGGAAAA GAACCGCATC ACCAAGCTGC GCATCCGCCC CGCCGGGGAA GACGGCGCAA AACCGGTCTG A
|
Protein sequence | MSVEGALAFL ATYWPEILSI TALVLMSAFF SGSETALTAV SRSRIHTLEV NGDERAGLVR QLIERRDRLI GALLIGNNLA NILSSSIATS LFLGLFGSSG VALATLAMTV ILVIFAEVLP KSWAISTPDR FALAIAVPAR LFVLVVGPIS SFVNAIVRRI LALFGINLSR EVSMLTAHEE LRGAVDLLHR EGSVVKADRD RLGGVLDLGE LELSDIMVHR TAMRAINADD PPEAVVRAIL ESPYTRMPLW RGTIDNIIGV VHAKDLLRAL AEPNMEPQNL DIVKIAQKPW FVPDSTNLED QLNAFLRRKQ HFAVVVDEYG EVQGIVTLED ILEEIVGDIS DEHDIEIQGV RQEADGSVVV DGGVPIRDLN RALDWNLPDE EATTIAGLVI HESMTIPEER QAFTFYGKRF IVMKREKNRI TKLRIRPAGE DGAKPV
|
| |