Gene Rleg_4023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4023 
Symbol 
ID8014829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4099996 
End bp4101630 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content67% 
IMG OID644826592 
ProductHemY domain protein 
Protein accessionYP_002977803 
Protein GI241206707 
COG category[S] Function unknown 
COG ID[COG3898] Uncharacterized membrane-bound protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.412219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.920435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATCC GCCTTGTCGT CTTCGCCCTC TTCGTGCTGC TTCTTGCCTA TGGCTTCTCC 
TGGCTCGCCG ATCGTCCCGG CGACCTCTCG CTGATCTGGG AAGGCCGGAT CTACCAGACG
AAGCTGATCG TCGCCGCCAG CGCGATCATC GCCCTCGTCG CCGCCGTCAT GATCGCCTGG
TGGTTCGTCC GTCTCGTCTG GACCTCGCCG CATTCGGTGA CGCGTTATTT CCGCGCCCGC
AAGCGGGACC GCGGTTATCA GGCGCTGTCG ACCGGCCTGA TTGCTGCCGG CGCCGGCAAT
GCGCTGCTCG CCCGCAAGAT GGCGGCCCGC TCGCGCGGCC TGATCCGCGC CGATCAGGAA
CCGCTGATCA ACCTGCTCGA GGCCCAGGCC GCCCTGATCG AAGGTCGCCA TGACGAGGCG
CGCGCCAAGT TCGAGGCCAT GGCCAACGAT CCCGAGACGC GCGAACTCGG TCTGCGCGGC
CTCTATCTGG AAGCCCGCCG TCTCGGGGCC AACGAGGCCG CCCGCCAATA TGCCGAAAAG
GCGGCCGACA ACGCGCCATA TCTGCCCTGG GCCGCACAGG CGACGCTCGA ATATCGCAGC
CAGGCCGGCC GCTGGGACGA TGCGATCCGC CTGCTCGAAC AGCAAAAGGC TGCCCGCGTC
GTCGAAAAGG CCGAAGCCAA CCGCCTGCAC GCCGTCCTTC TGACGGCGCG CGCCGGCGAG
AAGCTGGAAA GCAACCCGAC GGGTGCCCGC GACGATGCGC TGCAGGCGCT GAAGCTTGCC
GCCGATTTCA TTCCGGCGGC CCTCATTGCC GCAAAAGCGC TGTTTCGCGA AGGCGGCGTG
CGCAAGGCCG CCTCGATCCT CGAACAGGCA TGGAAATCCG CACCTCATCC TGAGATCGGA
CAAGCCTATG TGAGGGCCCG CAGCGGAGAT TCCACGCTCG ACCGGCTGAA GCGCGCTGAG
CGGCTGGAAG GGCAGCGCCC GAACAACGTC GAATCTCTTC TCGTCGTCGC CCAGGCAGCC
CTCGACGCGC AGGAATTCGC CAAGGCGCGC GCCAAGGCGG AAGCGGCGGC CCGCATGCAG
CCGCGTGAAG CCGCCTACCT GCTGCTGGCA GACATCGAAG AAGCCGAAAC CGGAGACCAG
GGTCGCGTGC GCCATTGGCT GGCCCAGGCG CTCAAGGCGC CGCGCGATCC GGCCTGGGTT
GCAGACGGCT TCGTGTCCGA CAAGTGGCTG CCGGTATCGC CGGTGACCGG CCGTCTCGAT
GCCTTCGAGT GGAAGGCGCC CTTCGGCCAG ATCGAGGGTG CGCTCGAAGA CGGTTCGGCG
CCGGCCTCGA TCGAAACGGC TTTGAAGACG TTGCCGCCGC TGCGTGACGT CAGGCCGGAA
AGCCCGGTTA ACGACCATCG CATCATTGAG CTGGAACGCG CCGCGACGAT TGCCGAGGCT
GTGCGCCCCA CACCAGCACC GGCACCAGCA CCAGCACCGA CATCGGCAAA ACCGAAACCC
GTCGAACCGG CCGTAAGCGA TAAGGCGCCC GCACCGAGCG AGGCAAAACC TTTCTTTGGC
GGACTGCCGG ATGATCCCGG CGTTCGCGAT CCCAGGGTGG AACCGGAACC CAAGACACGG
CTCCGCCTTT TTTGA
 
Protein sequence
MLIRLVVFAL FVLLLAYGFS WLADRPGDLS LIWEGRIYQT KLIVAASAII ALVAAVMIAW 
WFVRLVWTSP HSVTRYFRAR KRDRGYQALS TGLIAAGAGN ALLARKMAAR SRGLIRADQE
PLINLLEAQA ALIEGRHDEA RAKFEAMAND PETRELGLRG LYLEARRLGA NEAARQYAEK
AADNAPYLPW AAQATLEYRS QAGRWDDAIR LLEQQKAARV VEKAEANRLH AVLLTARAGE
KLESNPTGAR DDALQALKLA ADFIPAALIA AKALFREGGV RKAASILEQA WKSAPHPEIG
QAYVRARSGD STLDRLKRAE RLEGQRPNNV ESLLVVAQAA LDAQEFAKAR AKAEAAARMQ
PREAAYLLLA DIEEAETGDQ GRVRHWLAQA LKAPRDPAWV ADGFVSDKWL PVSPVTGRLD
AFEWKAPFGQ IEGALEDGSA PASIETALKT LPPLRDVRPE SPVNDHRIIE LERAATIAEA
VRPTPAPAPA PAPTSAKPKP VEPAVSDKAP APSEAKPFFG GLPDDPGVRD PRVEPEPKTR
LRLF