Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4847 |
Symbol | |
ID | 8007235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 223593 |
End bp | 225245 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644821777 |
Product | transferase hexapeptide repeat containing protein |
Protein accession | YP_002973037 |
Protein GI | 241113202 |
COG category | [R] General function prediction only |
COG ID | [COG0110] Acetyltransferase (isoleucine patch superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0521363 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCACG TCAATAGCCC GCAATCGTCA GTGAGCGAAG AGAAGGAAGC GCGCAGGCTT CAATACCTGA CCTGGGAACA CATTGCGTCT GACCTCCGTC ATCCCACTCA CCTCGCCCGC AAGGCGGAAC TCAGGCGATC ATGCAGTGCG GAACTGGCCG AGACGTCCTA CATTGCCGAG CATGCCGCAA TCTTCACCGA AAGCCTGACG ATGGGCGAGC GGTCCTGGAT CGCCGGGCAC GCGCTCGTTC GCGGCCATGT GATCCTCGGC GACGATTGCA CCATCAATCC CTATGCCTGT GTTTCCGGAA CGGTGACGTG CGGTCATGGC GTTCGGATTG CTTCGCATGC ATCGATCGTC GGCTTCAATC ATGGCTTCGA CGATCCGACC ATTCCTATAC ACCGCCAGGG CGTCGTCAGC ATCGGCATCG CGATCGGCGA CGATGTCTGG ATCGGCGCAA ATTGCGTGAT CCTCGATGGC GCAACAATTG GAAACGGTGC GGTGATCGCC GCCGGCGCTG TGGTCACGGG GGACATTCCC GCCATGGCAA TTGCCGGTGG CGTGCCCGCC CGGGTGCTGC GAAGCCGAGG CTCGGCGCCG ACGAAAACCG GCACCGGCGA CATCGAAGAT CAATTGGTGA GGCTCGGCCA GAAAGCGAAA GACCAGTGGC CGGACATCCT TGCACGCTGG AAAACGCGAG GGTCCTATGA ATCGCTGGAA GCGGACGGCA TCCGCAGACC GGCGATCCGG CACCTCTGCG ATGCAATCGA GATCGCTGCC GGCTTCGGCC ACCTGCCGCC CGATCTCGAT GCGGCGGAGA CCGTCGAGCG TCTCCAAGGT CTTCAGGACC GAGAGACCGG CCTTTTCCCG GAAGGACATT CGCGCATCCT TGGCAAGGCG CTGAGGGATG ATCCAAAGGC GCTCTATAAC GTCCTTGCGG TTGGCTATGC ACTTGAACTG CTTGGTTCAG GTCCGCGCCA ACCCGTCCAC GCAGTCGAGC TCGAGGCCGG GGAACTGGAT GAATGGCTGA GCGCCCTGCC CTGGTCGACC CGGGCATGGC ACGCCGGAAG CGTGGTCGAT GCGATCGGAA CTGCCATGTA CTTCAATGCG AAGTCTTTCG GCATCAGGCA TTCACGGCAG GCGCTCTTCG AATGGCTAAG CCGCAATGCC AACAGCGTTT CGGGGTTGTG GGGTGAACCG ACCGCGGCGG AAGGATGGCT TCAACCGGTG AACGGCTTTT ATCGCCTGAC GCGCGGCACC TACGCCCAGT TCGGCGTGGC ACTTCCCCAC CCGCACGCCT CACTCGAAAC GGTTCATCTC AACTATCGCA ACCACAAGGG CTTCGTTGCT GCAAAATACA ATGCGTGCAA CCTGCTCGAT ACGATTCATC CTCTGCTGCT GATTGCCCGG CAGACCGACT ACAGACGGGC CGACGGCGAG GCGATCGCCC GCAAGGTCAT CTCAAGGGCG CTGGATAGAT GGCGGGATGG CGAAGGATTC CCGTTTGCCG ATGGTGGTGA ACCGAGCTTG CAGGGGACGG AAATGTGGCT TTCCGTCATT CACCTGGCGG CCGATTTTCT CGGCCTGTCA GATCGCTTCG CCTTCGTCCC GAAAGGCGTT CACCGGACGG CAACCGTCGG GCTGGGTTTG TGA
|
Protein sequence | MDHVNSPQSS VSEEKEARRL QYLTWEHIAS DLRHPTHLAR KAELRRSCSA ELAETSYIAE HAAIFTESLT MGERSWIAGH ALVRGHVILG DDCTINPYAC VSGTVTCGHG VRIASHASIV GFNHGFDDPT IPIHRQGVVS IGIAIGDDVW IGANCVILDG ATIGNGAVIA AGAVVTGDIP AMAIAGGVPA RVLRSRGSAP TKTGTGDIED QLVRLGQKAK DQWPDILARW KTRGSYESLE ADGIRRPAIR HLCDAIEIAA GFGHLPPDLD AAETVERLQG LQDRETGLFP EGHSRILGKA LRDDPKALYN VLAVGYALEL LGSGPRQPVH AVELEAGELD EWLSALPWST RAWHAGSVVD AIGTAMYFNA KSFGIRHSRQ ALFEWLSRNA NSVSGLWGEP TAAEGWLQPV NGFYRLTRGT YAQFGVALPH PHASLETVHL NYRNHKGFVA AKYNACNLLD TIHPLLLIAR QTDYRRADGE AIARKVISRA LDRWRDGEGF PFADGGEPSL QGTEMWLSVI HLAADFLGLS DRFAFVPKGV HRTATVGLGL
|
| |