Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2034 |
Symbol | |
ID | 6980773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2096195 |
End bp | 2097175 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643396756 |
Product | Monosaccharide-transporting ATPase |
Protein accession | YP_002281544 |
Protein GI | 209549627 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0584323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGCGC TCGACATCAA TGAACACAGG CTTTCGTCCG GAGCCTGGCT GAGCAAGCTC AAGGGAGCAA CCGGCCCGCT CGTCGGACTG CTCGCGCTGT GCGTCTTTCT GAGCCTGAGC ACCGACACGT TTCTTTCGGT TCGAAACGGC CTCAACATCC TCGATCAGAT CACCGTTCTC GGCATCATGG CGGTTGGAAT GACCTTCGTC ATCCTAATCG GCGGCATCGA TCTCTCGGTC GGCTCGGCGC TTGCCCTGGC GATGATGGTC ATGGGCTGGA CCGCCAATGT CGCCGGCCTG CCGCTGCCGG TCGCGATCGC TTTTGCTCTG GTCGCATCGG GAGTTTCGGG CCTGATCGTC GGACTTCTGG TGACGCAGTT CAGGGTCCCG GCCTTTATTG CCACTCTTGC GATGATGTCC GCCGCTCGCG GGGTCGCCAA CATGATCACC GACGGTCAGC AGATCGTCGG ATTCCCGGAC TGGTTCATGA TGCTGGCAAT CGATCGTCAT TTCGGCGTGT TGACCGCCAC CGTGTTTCTC ATGCTTGCGG TGGTTCTTGC GGCATGGCTT TTCCTGCACT TCCGCTCCGA AGGGCGCATG CTCTATGCGG TCGGCGGAAA TCCGGAAGTC GCGCGCCTTG CGGGTATCAA CGTCCCGCTC GTGACGATTG GCGTCTACGT CGTAAGTTCA GTCCTTGCCG GCCTCGCAGG CATCGTACTC GCCGCCAGGC TGGATTCCGT CCAACCATCA AGCGGTCTGG GCTATGAGCT GGACACCATC GCCGCGGTCG TCATCGGCGG CACGTCGCTC TCCGGCGGCG CCGGCGGGAT AGGAGGAACA TTGATCGGTG TTCTTATCAT CGGCGTCCTT CGCAACGGGC TCAATCTTCT CAACGTCTCG CCGTTCCTGC AGCAGGTGAT CATCGGCATC GTCATCGTGC TCGCGGTCGG CGCGGAGACT ATTCGTCGGC GTCGCGCTTG A
|
Protein sequence | MVALDINEHR LSSGAWLSKL KGATGPLVGL LALCVFLSLS TDTFLSVRNG LNILDQITVL GIMAVGMTFV ILIGGIDLSV GSALALAMMV MGWTANVAGL PLPVAIAFAL VASGVSGLIV GLLVTQFRVP AFIATLAMMS AARGVANMIT DGQQIVGFPD WFMMLAIDRH FGVLTATVFL MLAVVLAAWL FLHFRSEGRM LYAVGGNPEV ARLAGINVPL VTIGVYVVSS VLAGLAGIVL AARLDSVQPS SGLGYELDTI AAVVIGGTSL SGGAGGIGGT LIGVLIIGVL RNGLNLLNVS PFLQQVIIGI VIVLAVGAET IRRRRA
|
| |