Gene Rleg_3266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3266 
Symbol 
ID8014155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3267585 
End bp3268940 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content64% 
IMG OID644825825 
ProductTonB family protein 
Protein accessionYP_002977052 
Protein GI241205956 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0913504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTT CAGCGAAGAC CAGATCGAGA CAGGTGCTCA TCGGGGAGCC GGACGCTGAC 
GGCGGTCTGA ACGACAACAA CATGCATCCC GGCCACGAGC TTTCCGACCT GCGCAATGTG
CAGCGGCAGC CGGCTGGCGA GGCGGTGGTC CATTATGCGC GTTTCGCGCA GATACCCTCC
TTTCCCGATC ATCCGGAGGC CGAACCGACA GCCTCTGTTC CTGCGCCGCC GATGGATGCA
GCGGTAGAAA AGCAGGGAGA CGAGAAGGCA CCGGTGCGGA GACGGGTGGC GCTGACGTAC
ATCTGCTCGC TGATCTTTCA CGCGACACTG GCGGCTGTGC TGCTGATTGC TTTCCCCAAA
GCGCCGGAGG AGGCAATCGA AGAAGCCGGT CAGGCGATGA GCGTCGTCAT GTATGGCGAT
TCCGATATCG ACCAGGCTGC TGCTGGCGAG ACCGAAACGA CCATCCAGCA GGAAATCATT
CCCGAAGAAG TGCAGCCTGA CACGATCCAG CCGACGCAAA CGGCGGAAGT TCAACCGGAA
ACCGTGCAGC CGACCGAAGT TTCGCCCTTC GAAGCGCAGG ATCCTATTCA GCAGGCGCCG
GCACCGGAAG TGACGCGCGT TTCGCCCGAA ACGGCCGCGG CCGTCGAGCC GGAGATTCTG
GTGTCCGAGG TGCCGGCGGA GGAGTCCGTC GCGCAGCCAA TGTCGACGGT TGTTCCCGAA
CAGCAGCAGG TGCCGCTCGA CGCAGTGCCG CCATCCGAGG TACAGCCGAC TGCGGTCCAG
CCGAGCGAGG TGCAGCCTGC GGAAACCCCG GCGGAAGTCG CGGAGGAGAC TCCACAGGGG
GTGAAGCCCA TAGAAACGGC AGAAATCCAG CCGAAACCGG AACAGCCGCC CGAGGTTGTA
ACGCCGACGC CAAAGCCGAA AGTGGCACAG GAGAAGCCCA AGCCGGTCGA GAAGAAGCGT
CCGCCGCAGA AGGCCGCTGG CGACAAGGGG GAGGGGCAGC AGACTTCAAC GCGTGGCGTT
GCCGAAGGCA ATTCATCGGC GCAATCCGAC AACAGTTCGC AGGCCGCCAA CGGCAATAAC
GGGGTGGGGA CGGCCGCGAC CGCAAACTAT AAAGGCAAGG TCCGTAGCCG TATTCGGCGT
GCGATCAGGA AGCCCCGAGG TGTCGAAGGC AGCGTTGTTG TCACCTTCTC AGTCAACGGC
GGCGGCGGCC TGACCTCCGC TCGTGTCTCG CGTGGGTCCG GCGTTCCGGA GATCGATCAG
CTTGCTCTCG ATGCGGTGCG TCGTGCGGCA CCCTTCAGCC CCCCGCCCGG TGGGCAGGCG
ATGACCATGT CAGCGCCTAT CGAGATCGTG CCATGA
 
Protein sequence
MAISAKTRSR QVLIGEPDAD GGLNDNNMHP GHELSDLRNV QRQPAGEAVV HYARFAQIPS 
FPDHPEAEPT ASVPAPPMDA AVEKQGDEKA PVRRRVALTY ICSLIFHATL AAVLLIAFPK
APEEAIEEAG QAMSVVMYGD SDIDQAAAGE TETTIQQEII PEEVQPDTIQ PTQTAEVQPE
TVQPTEVSPF EAQDPIQQAP APEVTRVSPE TAAAVEPEIL VSEVPAEESV AQPMSTVVPE
QQQVPLDAVP PSEVQPTAVQ PSEVQPAETP AEVAEETPQG VKPIETAEIQ PKPEQPPEVV
TPTPKPKVAQ EKPKPVEKKR PPQKAAGDKG EGQQTSTRGV AEGNSSAQSD NSSQAANGNN
GVGTAATANY KGKVRSRIRR AIRKPRGVEG SVVVTFSVNG GGGLTSARVS RGSGVPEIDQ
LALDAVRRAA PFSPPPGGQA MTMSAPIEIV P