Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5631 |
Symbol | |
ID | 6977022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 17584 |
End bp | 18714 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643393088 |
Product | hypothetical protein |
Protein accession | YP_002277906 |
Protein GI | 209546016 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02308] RNA ligase, T4 RnlA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.280064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGACC GCATACACCC TGCACGCGAA ATCCCGTTTC CCGACCTTAT CGCTGGCTTG AAGCGAGCCC AAGGGCTTGG CCATGTCCAT CGCCGTCAGA ACGCAACCGG TACTTTGCAG CTCTACATCT ATACCCCCCG GTGCGTATAT GAGGATGGTT GGGATCAGTT TTCGCTGATC GCTCGCGGTC TCATTGTGGA CGAGGGCGCT GGTCGGGTCG TTGCCACGCC GTTTCCGAAG TTTTTCAATG TCGGCGAGCG GCATGGCGAA GTGCCCGATC TGCCGTTTGA GGCGTTCGAA AAGCTCGATG GTAGTCTGAT AATCGTGTTC AATGATGCTG GCCGTTGGCA CGCAGCCACC AAAGGCGCGT TCGACTCCGA ACAGGCCCTA TGGGCTCAAG CACGCTTGGA TGCCCACGAT CTCTCCGGTC TGTCGCCGGA TACGACATAT CTGTTCGAGG CGGTATATCC GGAAAACAGA ATTGTCGTGC GATATGCGGA GCCTGCCATG GTGATGTTGG CGGCCTACCA CGCTTCAGGT CTTGAAGTAA CCTACGACGA GGTTCGAACG ACCAGCCAAG CGTTGGGATG GCGTGCGGCC GAACGCCATG AGTTCGGGAA TATGGCGGAC ATGATGCTCC ATACTGCAAC GCTCCCACGC GACAACGAGG GGTTTGTCGT TAGATTCACA AATGGCTTGC GCCTCAAACT CAAAGGCTCC GAGTACCGTC GTATCCATGC GTTGATCTCA CGCTGCACGC CATTGGCAAT GTGGGAAGCA ATGGCCGCTG GGGACGACAT GGCCGCGATT CGTCGTGATT TGCCGGAAGA GTTTTGGAGC GATTTTGACA ACATCGTACG CCTCCTGACG AAGGAATACG CGGCGATGGA AAGGAAGGTC GCTGCACTGG CAGCATCTGT CGCCCATCTT TCCGATAAAG AGTTGGGATT GTCGCTCAAT TCACTGCCTG CTGACGTGGG TCCTTACGTT TTTGGCTTGC GAAAAGCAGG TGCAATCGCA GGTAAGTCCC GAGACGCGTT GATGCGTTCC ATCAGACCCA CTGGCAACGT GTTGCCAGGT TACCAGCCGT CATATGCCAT GGGGCGTGTG ATTGATGAGG CAACATCGTA G
|
Protein sequence | MNDRIHPARE IPFPDLIAGL KRAQGLGHVH RRQNATGTLQ LYIYTPRCVY EDGWDQFSLI ARGLIVDEGA GRVVATPFPK FFNVGERHGE VPDLPFEAFE KLDGSLIIVF NDAGRWHAAT KGAFDSEQAL WAQARLDAHD LSGLSPDTTY LFEAVYPENR IVVRYAEPAM VMLAAYHASG LEVTYDEVRT TSQALGWRAA ERHEFGNMAD MMLHTATLPR DNEGFVVRFT NGLRLKLKGS EYRRIHALIS RCTPLAMWEA MAAGDDMAAI RRDLPEEFWS DFDNIVRLLT KEYAAMERKV AALAASVAHL SDKELGLSLN SLPADVGPYV FGLRKAGAIA GKSRDALMRS IRPTGNVLPG YQPSYAMGRV IDEATS
|
| |