Gene Rleg2_0016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0016 
Symbol 
ID6978725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp15148 
End bp16299 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content66% 
IMG OID643394727 
Producttransporter-associated region 
Protein accessionYP_002279545 
Protein GI209547628 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.305218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACT TTACGACGAA GCCGGCGGCA GACGCCAAGG ACTCCGAGCC ATCCTCCTCT 
TCGGACGAGG CGGGCAGTAG TAGTCGGCCA TCCGGCCGAT CTCAATCCTT CTGGTCGCGC
GCCGCGCGCA TCCTTCGCCC GCAGCAGGGC TCGCTGCGTG AGGATCTTGC CGACGCGCTG
ATGACCGATG CGGCCGGCAA CGATGCTTTT TCGCCCGACG AACGGGCAAT GCTGCACAAT
ATCCTGCGCT TTCGCGAGGT GCGCGTCGCC GACGTGATGG TGCCGCGCGC CGATATCGAG
GCGGTCGACC AGAACATCAC CATCGGCGAA CTGATGATCC TGTTCGAGGA ATCCGGCCGC
TCGCGCATGC CCGTCTATGC CGACACGCTC GACGATCCGC GCGGCATGGT GCATATCCGC
GATCTGCTCT CCTATGTCGC CAAGCAGGCG CGCAACAAGC GCCGCGGCCC GACGAAACCG
GCCGCGGCCC TGCCGGCGCT GCCGGCAATC GAGGTTGCGC CCGAAAATAT CCAGAAGACC
ACGCGGCCGG CCAAGCCGAA TTTGGATCTC GCCCGCGTCG ACCTGCAGAA GACGCTGACG
GAAGCCGGCA TCATCCGCAA GATCCTGTTC GTGCCGCCGT CGATGCTGGC CTCCGACCTG
TTGCGCCGCA TGCAGGTGAA CCGCACGCAG ATGGCGCTTG TTATCGACGA ATATGGCGGC
ACCGACGGGC TCGCTTCGCA TGAGGACATC GTCGAAATGG TGGTCGGCGA CATCGACGAC
GAACATGACG ACGAAGAGGT GATGTTCAAG CGGGTCGCCG AAGACGTCTT CATCGCCGAC
GCCCGCGTCG AACTGGAAGA GATCGCCGCA GCGATCGGGC CGGATTTCGA CATTAGCGAG
CAGGTCGACG AGGTCGATAC GCTGGGCGGC CTGATCTTCT CCGCGCTCGG CCGCATCCCG
GTGCGTGGCG AGGTCGTGCA GGCGCTGCCG GGCTTCGAAT TCCACATCCT CGACGCCGAT
CCGCGCCGCA TCAAGCGGCT GCGCATCACC CGCAAGCGCC ATGCGATCCG CCGCCGCGCC
AAGGTGGATG GCGACGTCCC GCCCGGCTCC GATGCCGGCG ACGACCGGCC GGCGGAATCG
ACCGCCAACT GA
 
Protein sequence
MSDFTTKPAA DAKDSEPSSS SDEAGSSSRP SGRSQSFWSR AARILRPQQG SLREDLADAL 
MTDAAGNDAF SPDERAMLHN ILRFREVRVA DVMVPRADIE AVDQNITIGE LMILFEESGR
SRMPVYADTL DDPRGMVHIR DLLSYVAKQA RNKRRGPTKP AAALPALPAI EVAPENIQKT
TRPAKPNLDL ARVDLQKTLT EAGIIRKILF VPPSMLASDL LRRMQVNRTQ MALVIDEYGG
TDGLASHEDI VEMVVGDIDD EHDDEEVMFK RVAEDVFIAD ARVELEEIAA AIGPDFDISE
QVDEVDTLGG LIFSALGRIP VRGEVVQALP GFEFHILDAD PRRIKRLRIT RKRHAIRRRA
KVDGDVPPGS DAGDDRPAES TAN