Gene Rleg_3079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3079 
Symbol 
ID8013989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3076546 
End bp3078795 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content59% 
IMG OID644825647 
ProductTonB-dependent hemoglobin/transferrin/lactoferrin family receptor 
Protein accessionYP_002976875 
Protein GI241205779 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01785] TonB-dependent heme/hemoglobin receptor family protein
[TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.578184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.640721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTCC GGCATTGGCG CTCGGTTCTA TTGGTCTGCA CGGCAGCCAC AGTTGTTCTT 
CCTGTTTCGC AGTCTTTCGC GCAAAGCGCT CCGGCAACAG CGCCGCAAGC CGCGACGGAG
GAAAGCACCG TCCTGCAGAA GATCGTCGTC AAGGGCAAAC GCGTGGCGCC GGGCAGCGTC
GCCGACACGC CGCTGGCCAC CGAGATCACT GCCAAGCAGC TCGAAGAAAA GCAGGTCACC
AATTTTGACG ATATCGGCCG CAGCGTCGAT GCAGGCGTAA ATTATTCGCG TGGTGACGCA
GGCTTCAACC TGCGTGGCCT TTCGGGCGCT CGCATCCTGA CCACCGTCGA CGGCATTCCA
ATTCCCTATA TTTCGAACAG CTCGCGCCAG GGCGCTTTCG CCCTCAGCAA TGCCAATGGC
GGTGGCGATA CGTTCGATTT CGATTCGCTG TCCTCGCTCG ATATCGTGCG CGGCGCGGAT
TCGAGCAAGG GTGGTTCCGG CATGCTCGGC GGCGCCATCG TTCTCAATAC GCTGGAGCCG
GAGGATCTCA TTCCTGAAGG CCGCGACTGG GGTGCGATCG TCAAGTCGAC CTATGACAGC
GAAGACCGCA GCATTTCCGG CTCAGCCGCT GCCGCCAAGA AGATCGGCGG TACGTCGATC
CTGTTTCAGG GTGGCTACCG CAAGGGGCAT GAACGCGACA ACATGGGCGA CAATGACAGC
TATGGTCGTT TCCGCACCGA GGCGGATCCA GCTGATTTCG ATCAGCATAA TCTGCTCTTC
AAGCTGCGTC AGGAGCTGGA GGGTGGCCAC CGCATCGGTC TGACTGCCGA GCGTTTCAGG
CGTGACCTGC AGACTGATCT GCGTGAACAG CAAGGTACCG GACGCACTTA TATGATCGAC
AATTACGACG GTCGCGAACT GCGTGATCGT GATCGTGTGT CGCTCGATTA TGACTATGAA
GCGCAGTCTT CCGATGCGTT CTTCAGCAGT GCGCGCGCCA CGCTCTATTG GCAGGACTTG
AAGAAGGAAT CCGGCAGCAA GGGGCGTACC GCGGCGAATG TGGCCTATGG CCGTAACAAC
GAGATTGAAA ACGAAACCTG GGGCTTCAGC GGCACTGCAA CGAAGGATTT TGAGTATTCC
GGTCTCAGCC ACTCCGTTCG CATTGGCCTC GACGTCGGCG TTTCCAGCTG GAGCCAGTAT
AGCTGGGCCC TTTGCCCCAC ACCGACGACC TGCCCGTCGC TGAACAATCA GGCGGAAGTG
CCCAATGTCG ACAGCCAGAA TCTTGGGCTT GTTGTCGAAG ACAAGATCGA AATCGGCAAT
ACCGGCTTCG CGCTGACACC CGGCTTCCGC TTTGACTGGT TCAACTACAA TCCCTCGACC
GGCGGGAGTT TCGCAAGCAA TACCGGCCTT GCTCGTTTCG GGGACCTGAG CGACCGAACG
GAAGCCGGCC TGTCACCGAA GATTCTTGCG ACATACGACT TGACGCCCGA CGTGCAGCTC
TACCTGCAGC TGGCAGTCGG CTTCCGCGCA CCCACCGTGG ACGAACTCTA CAGCCGCTTC
TACAACCCGA CGGGCCGCTA CGCCCAGCTC GGCAATCCCG ATCTGCAACC CGAAATCGGC
CGCGGCGTCG AAATCGGCGC TAATTTTGAC ACGGGCGATT TTACCGGTCG GGTTGCGGCC
TTCCACACCC GCTACCAGAA TTTTATCGAG ACGGTGACGA GCGTCGACGC GACCGGCTTT
ACGGCATTCA ACTACACGAA CGTATCTGCG GCAACGATCT CGGGTATTGA AGCCAGCGCC
GCCAAGACAT TCAACAACGG CATCAATCTC CATACGTCAC TTGCCTATGC CTATGGCAGG
AACGAGGAAA CTGCCCAGCG CCTGCGATCG GTGGCACCGT TCAAGGCGAT AGTCGGCGGC
GGCTGGAGCA ACGAGACTTT CGGCTTCGAT CTTTCTTCGA CGCTTTCGGC GGGCATGCTT
ACCGATCATC TCGATACGCC CGTCACCAAC ACCACCGACA CGACCTTCGA TGCGCCGGGC
TACGCGATCG TCGACCTGAC CGGCTGGTGG ACGCCGGACC AGGTGCCGGG CCTGCGCGTG
CAGGCAGGTG TCTACAACAT CTTCGACCAG GAATATTTCA ACGCGCTCGC CGTGCGCGAT
GTCAACCTGA TCTCGACGGC GTCGCAGCCG CGCGATTGGT ATTCCGAGCC CGGACGCACC
TTCAAGATCT CGCTGACCAA GACTTTCTGA
 
Protein sequence
MIVRHWRSVL LVCTAATVVL PVSQSFAQSA PATAPQAATE ESTVLQKIVV KGKRVAPGSV 
ADTPLATEIT AKQLEEKQVT NFDDIGRSVD AGVNYSRGDA GFNLRGLSGA RILTTVDGIP
IPYISNSSRQ GAFALSNANG GGDTFDFDSL SSLDIVRGAD SSKGGSGMLG GAIVLNTLEP
EDLIPEGRDW GAIVKSTYDS EDRSISGSAA AAKKIGGTSI LFQGGYRKGH ERDNMGDNDS
YGRFRTEADP ADFDQHNLLF KLRQELEGGH RIGLTAERFR RDLQTDLREQ QGTGRTYMID
NYDGRELRDR DRVSLDYDYE AQSSDAFFSS ARATLYWQDL KKESGSKGRT AANVAYGRNN
EIENETWGFS GTATKDFEYS GLSHSVRIGL DVGVSSWSQY SWALCPTPTT CPSLNNQAEV
PNVDSQNLGL VVEDKIEIGN TGFALTPGFR FDWFNYNPST GGSFASNTGL ARFGDLSDRT
EAGLSPKILA TYDLTPDVQL YLQLAVGFRA PTVDELYSRF YNPTGRYAQL GNPDLQPEIG
RGVEIGANFD TGDFTGRVAA FHTRYQNFIE TVTSVDATGF TAFNYTNVSA ATISGIEASA
AKTFNNGINL HTSLAYAYGR NEETAQRLRS VAPFKAIVGG GWSNETFGFD LSSTLSAGML
TDHLDTPVTN TTDTTFDAPG YAIVDLTGWW TPDQVPGLRV QAGVYNIFDQ EYFNALAVRD
VNLISTASQP RDWYSEPGRT FKISLTKTF