Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2473 |
Symbol | |
ID | 8013450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2472101 |
End bp | 2473285 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644825054 |
Product | protein of unknown function DUF1501 |
Protein accession | YP_002976284 |
Protein GI | 241205188 |
COG category | [S] Function unknown |
COG ID | [COG4102] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTGC CCATGAACCG GATTTCGCTG TCCCGCCGCG GCTTTCTGAC CTCTGCCTGC TGTCTTGCCG CTGCCCCCGC CTTCACGCCG GTCACTTTCG CGGCGATGCC GGGTGACAAG CGTTTCGTCA CCATCGTGCT GCGCGGCGCG ATGGACGGGC TGGATCTGGT GCAGCCCTAT GGCGATGCCG GCTTTGCGGC GCTTAGGCCG ACACTGGCGC TGACGCCCGA TACCGGACTT CTCGATCTAG ATGGCCATTT CGGCCTCAAT CCGGCTGCCG CAGAGCTGAT GCCGCTGTGG AAGAGCCGCG AGCTTGCCTT CGTGCACGCG GTGTCGACGC CCTACCGCGA CCAGCGCAGC CATTTCGACG GGCAGGACAT GCTGGAATCC GGCGGCGAGC ATGTCGCCGA GGAAAAGACC GGCTGGCTGA ACCGGGCGCT CGCCGTCATT CCGCGCTCGG ATGCGCGCAA GGCGATCGAC ATCAACACTT CGACGGAGCT GATCCTCTCC GGACCCAACA ATGTCGATGT CTGGGCGTCG GATTCCAATC TGGCGCCGGC GCGTGACGAG ATGCAGTTCC TGGCGCGGCT CTATGCCGGC GATCCGCCGT TCGCCGAGGC GCTTGCCGAG GCGACGCGGG CCAATAGCGC CTCGATGATC ATCGAGCCGG AGGGCCAGCG CGGCGCAAAG ATCGCCGATG TGGCGGCGCT CGCGGCCAAC ATGCTGAAGG GCGATTACCG CATCGCCAGC TTCTCGATAT CAGGCTGGGA CACGCATATC GGCCAGGCCG GCCAGTTCAA GCGGCCGGTG CAGGACCTTT CGCAGGCGAT CAATACGTTG AAGACCACGC TCGGGCCTGA GATCTGGGCA AAGACGGTGG TGCTTGCCAT GACCGAGTTC GGCCGCACTG TGCGTCAGAA CGGCTCAGCC GGCACCGACC ACGGCACCGG CGGCTGCGCG CTGCTATCAG GCGGCACCAT CAACGGCGGC CGCATCCTCG GCCGTTGGCC GGGCATCGGC GACGGCCAAC TGCTCGACGA CCGCGACCTG ATGCCGACCG CCGACGTGCG CGAGCTCGCC GCGGCAATGC TCTACCGGCA GTTCGATGTA AGCGCCGATG ATTTGACCGG GAAGATCTTT CCGGGGCTGG GGTTCGACAA AGGGTCGCAG TTTCTGCGTG GGTGA
|
Protein sequence | MTLPMNRISL SRRGFLTSAC CLAAAPAFTP VTFAAMPGDK RFVTIVLRGA MDGLDLVQPY GDAGFAALRP TLALTPDTGL LDLDGHFGLN PAAAELMPLW KSRELAFVHA VSTPYRDQRS HFDGQDMLES GGEHVAEEKT GWLNRALAVI PRSDARKAID INTSTELILS GPNNVDVWAS DSNLAPARDE MQFLARLYAG DPPFAEALAE ATRANSASMI IEPEGQRGAK IADVAALAAN MLKGDYRIAS FSISGWDTHI GQAGQFKRPV QDLSQAINTL KTTLGPEIWA KTVVLAMTEF GRTVRQNGSA GTDHGTGGCA LLSGGTINGG RILGRWPGIG DGQLLDDRDL MPTADVRELA AAMLYRQFDV SADDLTGKIF PGLGFDKGSQ FLRG
|
| |