Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4047 |
Symbol | |
ID | 6982818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 4221410 |
End bp | 4223218 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643398777 |
Product | hypothetical protein |
Protein accession | YP_002283535 |
Protein GI | 209551618 |
COG category | [S] Function unknown |
COG ID | [COG5616] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.136047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGATC TTGTCGTGGG GGCGATCATG AGAAACGCGA CAAACGTCAA TCTATTCAGC AGTCTTGGGC CGACGGCTGA GGAAATCTGC CGGCAGGCCG AGAGAATTCT CGCAAGTGAA GAATTTCACG CGCCGCAGCG CGGCCGAAAT TTTCTGGAGT TTGTCGTCAA CGAGACGCTC GCCGGCCGAT CTGGCTTCCT GAAGGCGTTC ACCATCGCAA ATGTGGTTTT TGGCAGGGAA GCGTCCTTCG ATCCGCAGAA TGATCCGGTC GTCCGGATCG AGGCCGGCCG GATACGAAAG GCCCTGGAAC GGTATTATCT CGTCGCGGGC CAGGCGGACG AGGTCATTAT TACGATGCCG AAAGGCGGAT ATGTCCCGCA TTTCGAATAT GTCCGCGATG CGGCGATGGC GCCGCCGCTG AACGAGCCGG AAAATGCTCG GATCGCAGAC CTCGCGCATC AAGCTCATCC GCTTCCCGAG GCGCACCCTC CGACGCTGCC GGCGGGTCGC GGCCGCGGGA TCGCTCTACC GGCCATCTTC GTTCTTCTTC TCCTCGCATT GGTGTCTGCA CTTTTTATTG CCCGAAGTGT CCCGGTGCGA GCGCCGCCGC CGGCAGCCGC CGTGCCGACG GTTGCCGTCG AGGTGTTCGC CGAAAGCAGC TCCGTCGATT CCAGGGCGGA TATCGCCCGC GGCCTGAGGG ATGATATTAT CGGCCAGCTT GCTGAGCGCG ATGAGGTCAT CGTCGTCGCC GATCCGTCGA CGGGTGATCG TGCCGTTGCT GCCGACTACG CGCTGCAAGG CAACATCCAG ATGGACGGCA GCAGGTTACG TTCGGTCGCC AGGCTGGTGC GCCAAAGGGA CGGGGTCGTC ATCTGGGCCG ATAATTTCGA TGCCGATTTT CGCGCTCAAA ACAAGCTTGG AATTCAAGCA AACGTCGCCC GGCAAATCGC CGGTGCGATA GCGCAGCCCC ATGGCGCGAT CTTTCAGGCC GAGCCAGCGA TGATCGCGCG GTCGGGCCTA AAGGCAGACC AAAATGCTGA AGCCTGCACC CCGGCCTATA ACAGCTATCT CCAGACGATG ACTGCGCAAA GCCATGGTGT CGTGCGCGAG TGCCTGCGAC AGGCAACCCA GCGTAACCCT GATAGCGCGA CGTCCTGGGC GCTCTTGTCC CTTGTCTATC TTGACGAAGT GCGATTTCGC TACAGGCTCG GCACCCCATC TTCGGCCGAA CCTCTCGAAT TGGCGAATGC TTCAGCGCAA CGGGCGGCAT CGCTCGCGCC GGACAATACC CGCGTGCTCC GCGCCGTTAT GCTGGTGAAT TTCTTCCGGG GAGATATCGA CAAAGCACTG GCGGCTGGAA CGGCGGCCTA CGCTGCAGAT CCCGACGATG TGGAAGTTGC GGGCGAGTAC GGTTTGCGTT TGGCGATGGC GGGGAAATGG CAATCGGGCT GCGAGTTGGT TTCGATCGCA TTCGACAAAA ATGTCGGCCC CAGAGGCTAT TACGCGGGCG GCATGGCGAT GTGCGCCTTT ATGCGGGGCG ATATCGACGC GGCGGAACAA TGGTCGAGAA TATCCGATCT CGACTTCAAT CCCATGCGCC ACCTTGCCCT GCTGTCCATT CTCGGGGCAG CGGGCAAAAC GGCTGAGGCC AAGCTGGAGC AGGATTGGCT CCTTGCCAAC GCGCCGGCAT TGATGACGAA CATCCGCCAG GAAATTTCCC TACGCTTGCA GCGGCCGGAG GATCAGGAAA GGGTCCTTGC GGGACTGCGG GCCGCAGGCG TCGCCATTGA CCTGCCGTCG GGAGGATAA
|
Protein sequence | MRDLVVGAIM RNATNVNLFS SLGPTAEEIC RQAERILASE EFHAPQRGRN FLEFVVNETL AGRSGFLKAF TIANVVFGRE ASFDPQNDPV VRIEAGRIRK ALERYYLVAG QADEVIITMP KGGYVPHFEY VRDAAMAPPL NEPENARIAD LAHQAHPLPE AHPPTLPAGR GRGIALPAIF VLLLLALVSA LFIARSVPVR APPPAAAVPT VAVEVFAESS SVDSRADIAR GLRDDIIGQL AERDEVIVVA DPSTGDRAVA ADYALQGNIQ MDGSRLRSVA RLVRQRDGVV IWADNFDADF RAQNKLGIQA NVARQIAGAI AQPHGAIFQA EPAMIARSGL KADQNAEACT PAYNSYLQTM TAQSHGVVRE CLRQATQRNP DSATSWALLS LVYLDEVRFR YRLGTPSSAE PLELANASAQ RAASLAPDNT RVLRAVMLVN FFRGDIDKAL AAGTAAYAAD PDDVEVAGEY GLRLAMAGKW QSGCELVSIA FDKNVGPRGY YAGGMAMCAF MRGDIDAAEQ WSRISDLDFN PMRHLALLSI LGAAGKTAEA KLEQDWLLAN APALMTNIRQ EISLRLQRPE DQERVLAGLR AAGVAIDLPS GG
|
| |