Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4540 |
Symbol | |
ID | 6977634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 178985 |
End bp | 179968 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643393718 |
Product | NMT1/THI5 like domain protein |
Protein accession | YP_002278536 |
Protein GI | 209546618 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.184718 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTCC ACCCCCGCAA TCTCCTCCTG CCGGCCGTGA TCGCCCTCGG TCTTGCCACC CCGGCCGCCG CCGCCGCCAC GGTGAAGCTG CGCTACCTCG CCAGCCAAGG CGGTCTTGCC GCCCACGAAC TTGCCGACGA ACTCGGCTAT TTCAAGGACA CCGGCATCAC GTTTGAGAAT GTCGGCTATG CCCAGGGCGG TCCGGCCTCT CTGATCGCGC TCGCATCCGG CGATGTCGAG ATCGGCAGTG CGGCCACCTC CGCGGTGCTG AATTCGATCA TCGGCGGCAA CGACTTCGTA GCCGCCTATC CGTCGAACGG CATCAATGAC GAGGTGCAGT CGACTTTCTA CGTGCTGGAA GACAGCCCGA TCAAAAGCAT CAAGGACATT GTCGGCAAGA GTATCGCGGT CAACACGCTC GGTGCCCATC TCGACTACAC CATCCGCGAA GCCCTGCATT CTGTCGGCTT GCCGAGCGAC TCCGCCAACC AGGTCGTCGT TCCCGGGCCG CAGCTCGAGC AGGTGCTGCG CTCCAAGCAG GTCGATATCG CCGCCTTCGG CTATTGGCAG ACGACCTTCG AGGGCGCGGC GCTCAAGAAC GGCGGCTTGC GTGCGGTCTT CGACGATACC GATGTGCTCG GCGACATTGC CGGCGGCTTC GTGGTCCTGC GCCGAGATTT CATTCAGCAG CATCCGCAAG CCGCCAAGAT CTTCGTCGAG CAGTCGGCCC GCGCCCTCGA TTATGCACGC GAACATCCTG AGGAAACCAA AAAGATCCTC GCCAAGGCGC TCAGTGAGCG TGGCGAGAAC GCGGATATCG CGCAATATTT CCGCGGCTAC GGCGTGCGCG CCGGCGGCCT GCCGGTCGAG CGCGATATCC AGTTCTGGAT CGACGTCCTC GTCCGCGAAG GCAAGCTGAA GCAGGGCCAG CTGGCGGCCA AGAACATTCT CTTTACCGCC GACGCCAAGC CGGCAAGCAA CTGA
|
Protein sequence | MTFHPRNLLL PAVIALGLAT PAAAAATVKL RYLASQGGLA AHELADELGY FKDTGITFEN VGYAQGGPAS LIALASGDVE IGSAATSAVL NSIIGGNDFV AAYPSNGIND EVQSTFYVLE DSPIKSIKDI VGKSIAVNTL GAHLDYTIRE ALHSVGLPSD SANQVVVPGP QLEQVLRSKQ VDIAAFGYWQ TTFEGAALKN GGLRAVFDDT DVLGDIAGGF VVLRRDFIQQ HPQAAKIFVE QSARALDYAR EHPEETKKIL AKALSERGEN ADIAQYFRGY GVRAGGLPVE RDIQFWIDVL VREGKLKQGQ LAAKNILFTA DAKPASN
|
| |