Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1956 |
Symbol | |
ID | 8012995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1945955 |
End bp | 1947463 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644824545 |
Product | transcriptional regulator domain protein |
Protein accession | YP_002975777 |
Protein GI | 241204681 |
COG category | [S] Function unknown |
COG ID | [COG5616] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00706169 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGGAT CGCGTTTTGC CTTTGGACCA TTCGTGCTTG ATCCGGCTGC GGGAACGCTT CTTCGGAACG ATGATCCCGT TGCCGTCGGC CACCGCGGGG TCAAGCTGCT TGCAGCGCTT GTCGGACGAC CCGGCGAAAT CTTGGGCAAG GCCGAGTTGA TGGACGCGGC GTGGCCGGGC ATATCAGTCG AGGAGGGCAA CCTGACTGTC CAGATTGCGC AGCTGCGCAA GCTGCTTGGT CCGGCCGCGA ACGGCGGTGA ATGGATCTCC ACGGTTCCGC GCATCGGCTA CCGCTTCATA GGCGCCATCA ACCAGCTTGG CGGCGTGAAG CGAAAAGCTT TGCCGCTGCC TGACAAACCA TCGATAGCAG TGCTGCCATT TGTCAATATC AGCAACGATC CCGAGCAGGA ATCCTTCGCC GACGGGCTGA CGGAAGACCT GATCACCGAC TTATCCAGAA TGCCGGGCCT GTTCGTCATC GCCCGCAACT CGGCCTTCGC CTACAAGGGA AAGGCGAGGG ACGTAGGCGA GATCGCCGAG GAGCTCGGCG TACGCTACCT GGTGGAGGGA AGCGCAAGAC GCGTAGCAGG GCACGTGCGC GTCAACGCCA AGCTGGTCGA TGCGGCAAGT GGCGATCATC TATGGGCGGA ACGCTTCGAT CGCAGCCTCG ACGATATCTT TGCCGTTCAG GACGAGGTCA CCGGCAAGAT CGTCGAAGCG CTGCTCGGGC GGCTGCGCGC ACCGCCATCG CGCAATCGGC CCAAAAATTT AGAGGCTTAC GATCTCTGCG TACGGGCGCG CAGGCTGATG GATGATACGC CGCAGACGGC GCGGGAAGCG CATCTGATGC TGACGCGCGC GATTGCCCTC GACCCTGATT ATGCCGAGGC GTACCGCTGG CTTGCCATGA ACCACTGGAT GGGAGAGGTC CATTCCGGCG GACCGACGGA ACCCACACGC GGGACTGCTC TGGAACTGGC GCGCAAGGCG GTGGCGATCG ATCCCAACGA TGCTGGCTGC CGCTGGATAC TGGCTTACCT GCTTGCCTAT GAGCGCAACT TTGCCGAGGC GGATGCCGAA TTTGCCAAGG CGATCGAACT CGACCCGAAC GAGGCCGACA CCTTTGCGGC ACTATCCGAC ATCGCGGTTT TAGCCGGGCG GGTCGGGGAG GGCCTCGAGC ATATCGCCAA GGCTTTCCGG CTGAACCCGT TTCCGGCAAG CTGGTACTAT CTGGCGCTCG GACAGGCGCA ATATGCCGCC GGCCAATACG CAGCCGCTGT CGACACGCTG CGGAGCGACG AGACCTATCG CACGAGCTCA CGCCGTTTCC TGGCGGCAAG CCTTGCTCAA CTCGGCCGGC TCGACGAGGC GCGCGCCGAA GCCGAACTGT TTCTCGTCGC CAACCCGCAT TTTTCAACCC GCCACTGGGC GAAGACCGAG CCATTCCGCG ACGCTCGGAC GCTTAAGCAT TTCATCGACG GCTACCGTAA GGCCGGACTT CCGGAGTGA
|
Protein sequence | MQGSRFAFGP FVLDPAAGTL LRNDDPVAVG HRGVKLLAAL VGRPGEILGK AELMDAAWPG ISVEEGNLTV QIAQLRKLLG PAANGGEWIS TVPRIGYRFI GAINQLGGVK RKALPLPDKP SIAVLPFVNI SNDPEQESFA DGLTEDLITD LSRMPGLFVI ARNSAFAYKG KARDVGEIAE ELGVRYLVEG SARRVAGHVR VNAKLVDAAS GDHLWAERFD RSLDDIFAVQ DEVTGKIVEA LLGRLRAPPS RNRPKNLEAY DLCVRARRLM DDTPQTAREA HLMLTRAIAL DPDYAEAYRW LAMNHWMGEV HSGGPTEPTR GTALELARKA VAIDPNDAGC RWILAYLLAY ERNFAEADAE FAKAIELDPN EADTFAALSD IAVLAGRVGE GLEHIAKAFR LNPFPASWYY LALGQAQYAA GQYAAAVDTL RSDETYRTSS RRFLAASLAQ LGRLDEARAE AELFLVANPH FSTRHWAKTE PFRDARTLKH FIDGYRKAGL PE
|
| |