Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4663 |
Symbol | |
ID | 8007141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 26952 |
End bp | 27968 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644821599 |
Product | NMT1/THI5 like domain protein |
Protein accession | YP_002972859 |
Protein GI | 241113024 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.679325 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAAAG CACAGACGAA GATGCAATTC CTGGGCTGCC TCGGCCTCGC CTCGATGCTT GCGGCTGCCT CACCGGCGCT TGCCCTCGAC AAGGTGAGCT ACGGGACGAA CTGGCTTGCC CAGGCGGAGC ATGGCGGCTT CTACCAAGCC GTCGCCGACG GCACCTATGC AAAATACGGC CTCGACGTCA CCATTGTCCA GGGCGGCCCG AATGCTGCAA ACAGCGCCCT ATTGATCTCC GGCAAGCTCG ATTTCTACAT GGGCGGCCCT CAGGGAGAGA TATCCGCCGT CGAACAGGGC ATTCCGCTGG TCGATGTCGC CGCGATCTTC CAAAAGGATC CGCAGGTACT GATCGCCCAT CCGGACAACG GCGTCGACAA GTTCGAGGAC CTCGCCAAGC TGAAAACGCT GTTCCTCAGC AAGGACGGCT ATCTCACCTA TTTCGAGTGG ATGAAGGCCA ACTTCAAAGG CTTCAAGGAC GAGCAGTACA AGCCCTATAA CTTCAGTCCC GCCCCCTTCC TCGCAGACAA GGAGTCTGCC CAGCAGGGGT ACCTGACCTC CGAACCCTAC GAGATCCAGA AGCAGGCAGG CTTCGAGCCA AAGGTCTTCC TGCTCGCCGA CAACGGCTAC TCACCCTATT CGACGATGAT CACGACCACG CAGGCGACGA TCGATGGCAA GCCCGACGTC GTGCAGCGCT TCGTCGATGC CTCGATCGAG GGCTGGTACA ATTACCTCTA CGGCGACAAC ACCAAGGCGA ACGCGCTGAT CAAGAAGGAC AATCCTGAAA TAACGGACGG CCAGATCGCC TATTCGGTCA CCAAGATGAA GGAATACGGC ATCATCGAAT CCGGCGACAG CCTGGACAAG GGCATCGGCT GCATCACCGA CGCCCATTAC AAGAAGTTCT TCGACGAGAT GACTGCTATC AAGGTCTTCA AGACCGACAC CGACTATACC AAGGCCTTCA CGACGAAGTT CGTCTGCAAG GGCGCCGGAA TAGCGCTGAA GAAATAA
|
Protein sequence | MLKAQTKMQF LGCLGLASML AAASPALALD KVSYGTNWLA QAEHGGFYQA VADGTYAKYG LDVTIVQGGP NAANSALLIS GKLDFYMGGP QGEISAVEQG IPLVDVAAIF QKDPQVLIAH PDNGVDKFED LAKLKTLFLS KDGYLTYFEW MKANFKGFKD EQYKPYNFSP APFLADKESA QQGYLTSEPY EIQKQAGFEP KVFLLADNGY SPYSTMITTT QATIDGKPDV VQRFVDASIE GWYNYLYGDN TKANALIKKD NPEITDGQIA YSVTKMKEYG IIESGDSLDK GIGCITDAHY KKFFDEMTAI KVFKTDTDYT KAFTTKFVCK GAGIALKK
|
| |