Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3936 |
Symbol | |
ID | 8014752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4009875 |
End bp | 4010885 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644826505 |
Product | NMT1/THI5 like domain protein |
Protein accession | YP_002977716 |
Protein GI | 241206620 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.541401 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCTTC TCACCCGTCG CCAGACGATC TTCGCCGCCA TCGCGGCAAG CGTCGCCGGC CGCACTGCCT TTGCCCAATC GGCGCCCGCA AAGGTTCGCA TCGCGCTCGA CTGGACGCCC AACACCAACC ATATCGGCAT CTATGTCGCC AAGGCGAAGG GCTTCTATGC CGATGCCGGG CTCGATGTCG AGATTCTTCC CTTCACCGAT ACCAGCGCCG GAACGCTGGT GTCGAACGGC GTTGCCGATT TCGGCATCAG CAGCGAGATC GAGACGCTGA CGCAACGCGC TGGCGGCGGT GACGTGAAGA TGGTCTACGG CGTCGTCCAG ACGGAAACCG CACGCCTGAT CTTCAAGGGC GGACGCGACG ACATCAAGAG CCCGAAAGAC CTCGACGGCA AGACCTATGG CGGCTTTGGT GGCACCTGGG AGAGCGCGCT GATCTCGGCG ATGATCCGCA ATGACGGCGG CAAAGGCGAC GTCAAGACCG TCACCCTCGG CACCTCCGCT TACGAGGCGC TGGACAATGG CTCGATCGAT TTCACGCTGG AGATCTACAC CTGGGAAGGC ATCGCTGCCG AACTGGAAAA CCGCAAGATC GGCCGCTTCC ACTATTCCGA TTATGGCATT CCCGACGAGC AGACGACGGT CATCGTCTCC AGCGACGCCT ATCTCTCCGC AAGTCGGGAC CACGCCCGCG CCTTCATCCA GGCGACACGA AAGGGTTATG CCTACTCCGT CGACCATCCC GACGAAGCCT GCGACCTGTT GATCTCTGGA AGCAACGGCG CACTGATGAA TACGGAACTG GTAAAGGCTT CTCAGAAGGC ATTGATCGAG GGCCACTTCC TGAAATCCGA GGCCGGTGTG ATCGGTAAGC TCGACCCGGC AAAGGCCGAG GCCCTGGGTG GCTTCCTGAT CGAGAATGGT ATTCTGGTCG ATGCGAATGG CGCCGCACTC AAGGAGAAGC CGGACTTTTC CACCTATTAT ACCAACGAAC TTCTCGACTG A
|
Protein sequence | MLLLTRRQTI FAAIAASVAG RTAFAQSAPA KVRIALDWTP NTNHIGIYVA KAKGFYADAG LDVEILPFTD TSAGTLVSNG VADFGISSEI ETLTQRAGGG DVKMVYGVVQ TETARLIFKG GRDDIKSPKD LDGKTYGGFG GTWESALISA MIRNDGGKGD VKTVTLGTSA YEALDNGSID FTLEIYTWEG IAAELENRKI GRFHYSDYGI PDEQTTVIVS SDAYLSASRD HARAFIQATR KGYAYSVDHP DEACDLLISG SNGALMNTEL VKASQKALIE GHFLKSEAGV IGKLDPAKAE ALGGFLIENG ILVDANGAAL KEKPDFSTYY TNELLD
|
| |