Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1631 |
Symbol | |
ID | 8012702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1622650 |
End bp | 1623924 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644824217 |
Product | nitrate ABC transporter, substrate-binding protein |
Protein accession | YP_002975458 |
Protein GI | 241204362 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0841796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGACAA CGACCTCCAA CTCCGGCGAC GACATGCCGT CGCCCGTGCG GGTGAACAGC GAAGGACCGA AAGTGCTGCG GGCCGGTTTC ATTCCGCTCG TCGATGCATC GGTGCTGATT GCGGCGGCGG AATTCGGCTT CGCGCGGAAG GAAGGCCTGA CGCTCGATCT CGTCAAGGAC GTCTCCTGGG CGAATGTGCG CGACCGCCTG GCATTCCGCC AGTTCGACAT CGCCCACATG CTGTCACCGA TGCCCGTCGC CTCCATGCTC GGTCTCGGCT CCAATCCCTC ACCGACGATC ACGCCATTTT CGCTGGGGCG CGGCGGCAAC GCGATCACGC TATCGACGCG GCTCTTCGAC AGGATGCGGA ATGACGTGGG ATTGCCGGAA ACGGCAAGCG CGCTCGACAA TGCCCGCGCC CTGGCAAAAA CATTGGCGGC AATGAAGGCC CGCGGCGAGC CGCTGCCGAC TTTCGGGGTC ACCTACCCCT TCTCCTCGCA CAATTACGAA TTCCGCTACT GGCTGGCAGC CGGCGGCATC GATCCCGACA AGGATGTCAA GCTCGTCGTC GTGCCGCCGC CCATGACTTC GGATGCGCTG GCGGCCGGTG CGATCGACGG TTTCTGCGTC GGTGCGCCGT GGAACATGGT CGCTTCAGAG CGCGGCGTCG GCCGCATCGT CGCCGCCAAA CAGGATATCT GGCCGTCGGC ACCTGAAAAG GTGATCGGCA TGCGGCCGGA CTGGGCGGAA AGCCATCCGG AAACCGTTTC CCGGCTGATC GTGGCGCTCG ACGCGGCAGC CCAGTGGTGC GATCGGCCGG AAAATCACGA TGCGCTGGCG GCAGCTCTTG CCGACCCGCG CTATATCGCT GCCCCCGTCG AAATCATTCG CCGTGTGCTC GCCGGCGAAT TCAGCCTCGA CGCAAAGGGC AACCGCCGCA TCATCGCCGA TTATTTCATG TTCCATTCCG GCTTCGCCAA TTATCCCCGG CCAAGCCATG CTCTTTGGAT CTACAGCCAG ATGATCCGCT GGGGACAGGC CGAGATCAGC CTTAACATGG CAAGGGCCGC AGCATCCGCT TACCGCCCGG ATCTCTATCG CACGGCTCTC GGTGACGACA AATCACCTGA AGATGCCGAT ATCCGCATCG AAGGCAGTGA CGAGGGCGAC CGTTTCATGG ACGGCCACGT CTTCGACCCG GCGAGGTTGC CGGACTATGT CGCAGGTTTT GCCGTCAAAA GCGCCCTCGC CTTCGTTTGC GACGACGAGG TTTAA
|
Protein sequence | MMTTTSNSGD DMPSPVRVNS EGPKVLRAGF IPLVDASVLI AAAEFGFARK EGLTLDLVKD VSWANVRDRL AFRQFDIAHM LSPMPVASML GLGSNPSPTI TPFSLGRGGN AITLSTRLFD RMRNDVGLPE TASALDNARA LAKTLAAMKA RGEPLPTFGV TYPFSSHNYE FRYWLAAGGI DPDKDVKLVV VPPPMTSDAL AAGAIDGFCV GAPWNMVASE RGVGRIVAAK QDIWPSAPEK VIGMRPDWAE SHPETVSRLI VALDAAAQWC DRPENHDALA AALADPRYIA APVEIIRRVL AGEFSLDAKG NRRIIADYFM FHSGFANYPR PSHALWIYSQ MIRWGQAEIS LNMARAAASA YRPDLYRTAL GDDKSPEDAD IRIEGSDEGD RFMDGHVFDP ARLPDYVAGF AVKSALAFVC DDEV
|
| |