Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4023 |
Symbol | |
ID | 8014829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4099996 |
End bp | 4101630 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644826592 |
Product | HemY domain protein |
Protein accession | YP_002977803 |
Protein GI | 241206707 |
COG category | [S] Function unknown |
COG ID | [COG3898] Uncharacterized membrane-bound protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.412219 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.920435 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATCC GCCTTGTCGT CTTCGCCCTC TTCGTGCTGC TTCTTGCCTA TGGCTTCTCC TGGCTCGCCG ATCGTCCCGG CGACCTCTCG CTGATCTGGG AAGGCCGGAT CTACCAGACG AAGCTGATCG TCGCCGCCAG CGCGATCATC GCCCTCGTCG CCGCCGTCAT GATCGCCTGG TGGTTCGTCC GTCTCGTCTG GACCTCGCCG CATTCGGTGA CGCGTTATTT CCGCGCCCGC AAGCGGGACC GCGGTTATCA GGCGCTGTCG ACCGGCCTGA TTGCTGCCGG CGCCGGCAAT GCGCTGCTCG CCCGCAAGAT GGCGGCCCGC TCGCGCGGCC TGATCCGCGC CGATCAGGAA CCGCTGATCA ACCTGCTCGA GGCCCAGGCC GCCCTGATCG AAGGTCGCCA TGACGAGGCG CGCGCCAAGT TCGAGGCCAT GGCCAACGAT CCCGAGACGC GCGAACTCGG TCTGCGCGGC CTCTATCTGG AAGCCCGCCG TCTCGGGGCC AACGAGGCCG CCCGCCAATA TGCCGAAAAG GCGGCCGACA ACGCGCCATA TCTGCCCTGG GCCGCACAGG CGACGCTCGA ATATCGCAGC CAGGCCGGCC GCTGGGACGA TGCGATCCGC CTGCTCGAAC AGCAAAAGGC TGCCCGCGTC GTCGAAAAGG CCGAAGCCAA CCGCCTGCAC GCCGTCCTTC TGACGGCGCG CGCCGGCGAG AAGCTGGAAA GCAACCCGAC GGGTGCCCGC GACGATGCGC TGCAGGCGCT GAAGCTTGCC GCCGATTTCA TTCCGGCGGC CCTCATTGCC GCAAAAGCGC TGTTTCGCGA AGGCGGCGTG CGCAAGGCCG CCTCGATCCT CGAACAGGCA TGGAAATCCG CACCTCATCC TGAGATCGGA CAAGCCTATG TGAGGGCCCG CAGCGGAGAT TCCACGCTCG ACCGGCTGAA GCGCGCTGAG CGGCTGGAAG GGCAGCGCCC GAACAACGTC GAATCTCTTC TCGTCGTCGC CCAGGCAGCC CTCGACGCGC AGGAATTCGC CAAGGCGCGC GCCAAGGCGG AAGCGGCGGC CCGCATGCAG CCGCGTGAAG CCGCCTACCT GCTGCTGGCA GACATCGAAG AAGCCGAAAC CGGAGACCAG GGTCGCGTGC GCCATTGGCT GGCCCAGGCG CTCAAGGCGC CGCGCGATCC GGCCTGGGTT GCAGACGGCT TCGTGTCCGA CAAGTGGCTG CCGGTATCGC CGGTGACCGG CCGTCTCGAT GCCTTCGAGT GGAAGGCGCC CTTCGGCCAG ATCGAGGGTG CGCTCGAAGA CGGTTCGGCG CCGGCCTCGA TCGAAACGGC TTTGAAGACG TTGCCGCCGC TGCGTGACGT CAGGCCGGAA AGCCCGGTTA ACGACCATCG CATCATTGAG CTGGAACGCG CCGCGACGAT TGCCGAGGCT GTGCGCCCCA CACCAGCACC GGCACCAGCA CCAGCACCGA CATCGGCAAA ACCGAAACCC GTCGAACCGG CCGTAAGCGA TAAGGCGCCC GCACCGAGCG AGGCAAAACC TTTCTTTGGC GGACTGCCGG ATGATCCCGG CGTTCGCGAT CCCAGGGTGG AACCGGAACC CAAGACACGG CTCCGCCTTT TTTGA
|
Protein sequence | MLIRLVVFAL FVLLLAYGFS WLADRPGDLS LIWEGRIYQT KLIVAASAII ALVAAVMIAW WFVRLVWTSP HSVTRYFRAR KRDRGYQALS TGLIAAGAGN ALLARKMAAR SRGLIRADQE PLINLLEAQA ALIEGRHDEA RAKFEAMAND PETRELGLRG LYLEARRLGA NEAARQYAEK AADNAPYLPW AAQATLEYRS QAGRWDDAIR LLEQQKAARV VEKAEANRLH AVLLTARAGE KLESNPTGAR DDALQALKLA ADFIPAALIA AKALFREGGV RKAASILEQA WKSAPHPEIG QAYVRARSGD STLDRLKRAE RLEGQRPNNV ESLLVVAQAA LDAQEFAKAR AKAEAAARMQ PREAAYLLLA DIEEAETGDQ GRVRHWLAQA LKAPRDPAWV ADGFVSDKWL PVSPVTGRLD AFEWKAPFGQ IEGALEDGSA PASIETALKT LPPLRDVRPE SPVNDHRIIE LERAATIAEA VRPTPAPAPA PAPTSAKPKP VEPAVSDKAP APSEAKPFFG GLPDDPGVRD PRVEPEPKTR LRLF
|
| |