Gene Rleg2_3018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3018 
Symbol 
ID6981763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3078114 
End bp3079163 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content61% 
IMG OID643397728 
ProductHaemin-degrading family protein 
Protein accessionYP_002282511 
Protein GI209550594 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3720] Putative heme degradation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0282112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAC AGACAAGACC GGCGCCAGCC GAAATCCGGG CGTTTCGCGC CGAAAATCCG 
AAGATGCGCG AGCGCGATAT CGCCGCCCAG TTGAAGATTT CCGAGGCAGC CCTCGTCGCC
GCCGAAACCG GCATCAGCGT GACCCGCATC GATGGCAGCG CGCTGAAGCT TCTCGAACGC
GTGGCGAGCC TCGGCGAAGT GATGGCGCTG TCGCGCAACG AAAGTGCCGT GCACGAAAAG
ATCGGCGTCT TCGAAAACAT CAAAAGCGGC GTACAGGCCG CAATCGTTCT CGGCGAGAAT
ATCGACCTGC GCATCTTCCC GAGCCGATGG GAACATGGCT TCGCCGTATC CAAGAAGGAT
GGCGACCAGC TGCGCCTCAG CCTGCAATAT TTCGACAAGG CGGGCAACGC CGTGCACAAG
GTGCACCTGC GCCCGAATTC GAATGTCGAG GCCTATCACG CGCTGGTTGC CGAGTTGAAG
CTGGAAGACC AGTCGCAGGA CTTCGTCGAG GCCGAGACCG CAGATACCGT CGATGAAACC
GCCGACGTCA GCCGCGACGA GCTGCGCGAC AACTGGAGCA GGCTCACCGA CACGCATGAG
TTCTTCGGCA TGCTGAAGCG CCTGAAGATC GGCCGCCAGG CGGCCGTGCG CAGCGTCGGC
GACGACTATG CCTGGAAGCT CGACAGCAGC GCCACGGCGG AGATGATGCA TGCCTCGGTG
AAATCCGGCC TGCCGATCAT GTGCTTCGTC GCCAGTGACG GTGTCGTTCA GATCCATTCC
GGCCCGATCT TCAACGTCCA GACCATGGGC CCATGGATTA ATATCATGGA CCCAACCTTC
CATCTGCATC TGCGGCAGGA TCACATCGCC GAGACCTGGG CGGTGCGCAA GCCGACCAAA
GACGGCCACG TCACCTCGCT GGAGGCTTAC AATGCGCAAG GCGAGATGAT CATCCAGTTC
TTCGGCAAGC GGAAGGAAGG GTCCGACGAA CGCACCGAGT GGCGCGAGAT CATGGAAAAC
CTGCCGCGGG CAGCCAGTGT CGCCGCATAA
 
Protein sequence
MTEQTRPAPA EIRAFRAENP KMRERDIAAQ LKISEAALVA AETGISVTRI DGSALKLLER 
VASLGEVMAL SRNESAVHEK IGVFENIKSG VQAAIVLGEN IDLRIFPSRW EHGFAVSKKD
GDQLRLSLQY FDKAGNAVHK VHLRPNSNVE AYHALVAELK LEDQSQDFVE AETADTVDET
ADVSRDELRD NWSRLTDTHE FFGMLKRLKI GRQAAVRSVG DDYAWKLDSS ATAEMMHASV
KSGLPIMCFV ASDGVVQIHS GPIFNVQTMG PWINIMDPTF HLHLRQDHIA ETWAVRKPTK
DGHVTSLEAY NAQGEMIIQF FGKRKEGSDE RTEWREIMEN LPRAASVAA