Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2604 |
Symbol | |
ID | 8013565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2600559 |
End bp | 2601839 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644825180 |
Product | protein of unknown function DUF442 |
Protein accession | YP_002976410 |
Protein GI | 241205314 |
COG category | [S] Function unknown |
COG ID | [COG3453] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01244] conserved hypothetical protein TIGR01244 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.148627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCCG TGAAGGTCAA TGAGCTGATA TCGGTGGCGG GCCAGCCCGA CGCCGCAGGT TTTGCCGCCT TCGCGGCTGA TGGCTTTGCT GCCGTCATCA ATGCCCGGCC GGATGGCGAG GAGCCGGGAC AGCCGGGCAA TACGGCGGAA AAGGCTTCCG CCGCTGCCGC CGGGCTCGCC TACAGCTTCG TGCCGGTGAA GGGGACCGAA ATCACCGAGG CCGATATCTG CGCCTTCCAG ACGGCGATGG CCGAGGCCAA GGGACCGGTC GTCGCCCATT GCAAGAGCGG CACGCGGGCG TTGACGCTTT ATGCGCTGGG CGAGGTGCTC GACGGGCGGA TGAAGCCCGG AGATGTCGAG GCCTTCGGTC AAAACCTCGG TTTTGATCTT GCCGGCGCGC GACGCTGGCT GGAAAAGCGG TCAGGGCAGG TGGCTGATGT GAAGGCCTTC TTCGAGCCCC GCACCTGCAG TGTGCAATAT GTCGTTTCCG ACCCGGCAAC GAAACGCTGC GCCATCATCG ACCCGGTGCT CGATTTCGAC GAGATGTCGG GGGCGACGGG AACGGCCAAT GCAGATGCCA TCCTCGCTCA TATCGAAAGC GAAGGGCTGA CGGTCGAGTG GATCCTCGAC ACGCATCCGC ATGCCGATCA TTTCTCCGCC GCGCATTATC TGCATGAGAA GACCGGCGCG CCGACGGCGA TCGGCGCCCA TGTCACCGAC GTGCAGACGC TCTGGAAGGA GATCTACAAC TGGCCGGGGC TCGCGACCGA CGGCTCGCAA TGGGACCGGC TGTTTGCCGA TGGCGACACG TTCGAGATCG GTGCGCTTAA AGCCCGCGTG ATTTTTTCGC CCGGGCACAC ACTCGCCTCG ATCACCTATG TGATCGGTGA CGCCGCCTTT GTGCACGACA CGGTGTTCAC GCCGGATTCC GGCACGGCGC GCACGGATTT CCCGGGCGGC AGCGCTGCCG CCCTCTGGCA CTCGATCCAG GCCATCCTGT CGCTGCCCGA GGAGACCCGT CTCTTTTCCG GCCACGATTA CCAGCCCGGC GGCCGGCACC CGCGCTGGGA AAGCACGGTG GAGGCACAGA AGCGCGCCAA TCCGCATATT GCAGGCATCG ACGAGGCCGG CTTCGTGGCG CTGCGCCAGG CGCGCGATCG CACGCTGCCC AAGCCCAAGC TGATGCTGCA CGCGCTGCAG GTGAATATCC GCGGCGGGCG GCTGCCCGAG CCGGAGGGGA ATGGCAGGCG GTATCTGAAG ATACCGCTGG ATGCATTGTA G
|
Protein sequence | MTSVKVNELI SVAGQPDAAG FAAFAADGFA AVINARPDGE EPGQPGNTAE KASAAAAGLA YSFVPVKGTE ITEADICAFQ TAMAEAKGPV VAHCKSGTRA LTLYALGEVL DGRMKPGDVE AFGQNLGFDL AGARRWLEKR SGQVADVKAF FEPRTCSVQY VVSDPATKRC AIIDPVLDFD EMSGATGTAN ADAILAHIES EGLTVEWILD THPHADHFSA AHYLHEKTGA PTAIGAHVTD VQTLWKEIYN WPGLATDGSQ WDRLFADGDT FEIGALKARV IFSPGHTLAS ITYVIGDAAF VHDTVFTPDS GTARTDFPGG SAAALWHSIQ AILSLPEETR LFSGHDYQPG GRHPRWESTV EAQKRANPHI AGIDEAGFVA LRQARDRTLP KPKLMLHALQ VNIRGGRLPE PEGNGRRYLK IPLDAL
|
| |