Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1659 |
Symbol | |
ID | 8012728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1651205 |
End bp | 1652251 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644824244 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002975485 |
Protein GI | 241204389 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.278996 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGAGA TTGACGGGAT CATCTATTAC GTGGAGTATG CATTTTATAA CCTATCGTCC GAGATTCCCA TGGACCCCTT CGATTCCGTC CTCAGCGCCA TGCAGCTCCA AAGCTCGCTC TTCGTCCGCA TGCGTGCTCA TGCACCATGG GCGATGTCGT TCGATAGCGG CGGTCAGGCG CGGCTGATCG TCATCGCTAA GGGCCGGGGC TGGTTCACCC AAGTCGGCCA CCCCCCGGTC GTTGTCGAGG AAGGCGACTG CCTCATCATC AAGCAGGGGG TCATGGGCAT ATTGGGCGAC GCTCCGGACC GGGTCGCAGT GCCCTGCTGG CAGATTGCCG ACCATGTGAC GGGCGAGACG GTGTCCTTCG GTGGAGACGG CGAAGCGTGC GAGTTCTTCT CGACCCTGTT TACGTTCGAC CACGCTGCGG GCGAGCCCTT ATCGGCGCTG TTGCCCGATG TTGTTCATGT CGCCATGGCG AAGTCCGACG CAGGGCGGAT GGTCTCGATC CTCGAACAGA TCGGAAAAGA GGAGGCGCAG GCGTCGCTTG GCGGCTCCTA TGTCGTCGGC AGGCTGCTCG ACGTGCTGTT CATCCAAGCG ATCCGAAGCT GGGCCAGTTC GGAGGGGAAT ATGCCCGAGG GCTGGCTCGC CGGACTGACC CATCGCCAGT TGGCGCAAAC GCTGCACCGG ATTCATGCCG ATCTGGCGCA CCCGTGGACG CTGGAGCAGC TCGCCCGCGA TGTGGGGATG TCGCGCTCCA CCTTTGCAGT GCTGTTCAAG TCGGTCGTCG GAGTGCCGCC GCTGACCTAC ATCACGACTT GGCGCATTTA TCGTGCGAAA CTCATTCTCG CCGCCGGCCA CTCAATCTCA GCGGCGGCCG CGCAGACCGG CTATGGCACC GACATCGCCC TCAGCCGCGC TTTTAAAGCT GCGACCGGCG TGGCGCCGGG GCAGTGGCGG CGCGAGCGAC GTGGCGTCGA CCGTCCCGTT CCCAGTGGAG ATCGATCAAG GGCGCCGGTC AGGCACCCTG TTCCGGCTGA TTTGTAA
|
Protein sequence | MVEIDGIIYY VEYAFYNLSS EIPMDPFDSV LSAMQLQSSL FVRMRAHAPW AMSFDSGGQA RLIVIAKGRG WFTQVGHPPV VVEEGDCLII KQGVMGILGD APDRVAVPCW QIADHVTGET VSFGGDGEAC EFFSTLFTFD HAAGEPLSAL LPDVVHVAMA KSDAGRMVSI LEQIGKEEAQ ASLGGSYVVG RLLDVLFIQA IRSWASSEGN MPEGWLAGLT HRQLAQTLHR IHADLAHPWT LEQLARDVGM SRSTFAVLFK SVVGVPPLTY ITTWRIYRAK LILAAGHSIS AAAAQTGYGT DIALSRAFKA ATGVAPGQWR RERRGVDRPV PSGDRSRAPV RHPVPADL
|
| |