Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2021 |
Symbol | |
ID | 6980760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2082940 |
End bp | 2083920 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643396743 |
Product | hypothetical protein |
Protein accession | YP_002281531 |
Protein GI | 209549614 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAATT TTGCGCTTGA GGTCACCCGT CTGGGGTTTT CGGCTCTGCA GGCGATTTCG CCCGATTTGG CGGGCAGGGC GGCGTTCGGG CTGTTCTGCC GCACACCGTC GTCGCGGCCG AAGGGAGCAA GGGCGAAGGC GGCGCATGCC GCGGGCGCGG CCCGGCTGGC CGGCGCCGAA CGCTTCACCC TCAAGCTCGC CGGCGGGGCG AAAGCGCATG CCTATCGGCT GAATGGCGGG GCGCGGGGAA AACGCCGGCG TTTGCTGGTG ACGCATGGCT GGGGCTCGAG CGCCGACTAT ATGGCCGAGC TGGTTTCGAT GCTTGCGGCG ACCGGTGCGG AGGTGGTGGC GCTCGATTTT CCCGGCCACG GGCGCGCCGG CGGGCGATTC CTGCACATGG GCCTTGCGGT CGAGGCGATT GCGGCTGCCG AGGCGCGCTT CGGCGCCTTC GATGCGGTTG TCGGCCACTC CTTCGGCGGC GCGGCGCTGA TGGTCTCAGC GGCGGGGCTG CTGCCCGGGG TGGCGCCTTT GGCCTGCGAG CGGCTGGTGC TGATTGGCGC GCCAAGTGAG ATGGCCTGGC TGTTTACCGA TTTCGGCCGG ATGATTGGCC TTCGCCCGGC TGCGCAAGCG GCGCTGGAGA CTGAAGTCCA GCGCGTCACC GGCCGGAGAC TCGAGGAGTT CGACGCGGGC AATGCCGCGA GCGGCATCGG CCGGCCGGTG CTCGTCATCC ATGCCGAGGA CGACAAGGAG GTGCCGCCGG CCCATGCCAG GCGCTATCAG GCTGCCGGGA AGGACGTCCG GCTGCTCTGG GCGAACGGCT TCGGCCATCG GCGCATCGTC GGCGCGGCGC CCGTCCTTGC CGCGGTTGCG GCATTTCTCG ACGGCGAACG GGGCGAAGAG GGTGTTTCCG ACGAAAGCAT CAAAAAAGAT GCGGAGATCA TTCCGTTTTT TGAGCTTCCG GCGCGGCGCG CGGCATTGTA G
|
Protein sequence | MANFALEVTR LGFSALQAIS PDLAGRAAFG LFCRTPSSRP KGARAKAAHA AGAARLAGAE RFTLKLAGGA KAHAYRLNGG ARGKRRRLLV THGWGSSADY MAELVSMLAA TGAEVVALDF PGHGRAGGRF LHMGLAVEAI AAAEARFGAF DAVVGHSFGG AALMVSAAGL LPGVAPLACE RLVLIGAPSE MAWLFTDFGR MIGLRPAAQA ALETEVQRVT GRRLEEFDAG NAASGIGRPV LVIHAEDDKE VPPAHARRYQ AAGKDVRLLW ANGFGHRRIV GAAPVLAAVA AFLDGERGEE GVSDESIKKD AEIIPFFELP ARRAAL
|
| |