Gene Rleg_6236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6236 
Symbol 
ID8016248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp295671 
End bp297878 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content58% 
IMG OID644827541 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_002978741 
Protein GI241258857 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.573901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.403128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATTT CGACTGTGCC GCCAGACCGC AACACCTCGC TCTCATCGAT GCGCTACGAT 
CCCCGGCTTC AGGTAGAGAC CCGTTCGGAC GAAATTGATC TGACGTCAGG CCTGCGCCTT
ATTCGGCGAC GGATGGTCAT GATAATGGCC ATCATCACGG TACTTATGGC GGTTGCCGCA
ACCGTGATTT CGGGATTGAA GCCGACTTAT CATGCCGAGT CCCGGCTGAT CATCCACACG
CCTCTGGCGA CGAAGCTCGG CACCGACGAG TCCGGCCGCA ACGATCCGCT GGACGCCACA
TCTGAGACCG AACGGCTTCT TTCCAGAAGC ATCGCAGAGC GGGTAATCCG CGACCTGCGC
CTCAATGAAT GGCCGGAATT CAATCCGGCG CTGCAAGAAA TCTCGCCTAT CGACAAGATC
CGATCAATGC TTCGCGGTTG GGTCGACAGT GAAAAACCGT CCCTGCCGGT ACGGGACAGC
ATCGAGCCCA TAATCCCGAA GTACTACAAG GCCCTCCGCG TTTGGCGCGA TGGCCAGGGT
GACGTCATTC AGATCGGCTT CGATGCGAGC GACCCCGAAC TGGCCGCCTC GGTCCCGAAC
CGGCTCATCA GCATCTACCT CGAAGAACGC AAGGACAGTC TGCGTGGTCG AGTGGACGCA
GCAGAAGAAT GGATTCTGCT GCGCATCGAC GAACAGATGG GGCGCGCCAA GGCAGCACGC
GATGCCGCCG ACGGGTACCA GAAAATTATG GACGTCGCCT CAAGTGATGA CGATCAGGTT
GAACAGATCA AGTCGATCAT GGAGTTGGGC GAGCGGCGGA CAAAAATCGA ACAAAGCCGC
GTTGAAGCGA GAGCAACGAT ATCTGCACTC GAAGCGGCCG ACGACCCCTC GCTCGCCCTG
CAGAATATGG TCATTCCCGA CAGCATTGGC GCAATGCAAC GCGAGCTTCG TGCGCAGGAG
CAAGATCTCG AGCGCCTTCT CGAGACATAT GGCAACACGG CAGAATCCGT GCTCGACACG
CGGGCCAAGA TCCTTAAATC CCGCACCGAT CTCAGCCTTG CGACTGATCG ATATCTCCAA
TCGATACGCG CCAAGCTTGC GGCGCTCGAT CATGAGGACG ACGCAGTCCG GTCGGCATTG
GCGGCCGCTC ATGAGAAACG CGCTCGCTCC ACACTGGCGC AAACCGAGTT GGCCCGACTC
GAGCGCATGG CTGACAAAGA GCAGACGGCA CTGGACAAGC TCGACGAGCA GCGCCGTGGC
CTGGCTGCGC AAGCCATGTT GCCAGGCGCG GAACTGGAAG TCTTGTCACC GGCCGCGGTG
CCGCTGGCGC CAACGGGACG CGGGCGGCTT TTCTATTTGA TCGGCGCCCT TTTGGCTTCG
GTATCGATCG CGGTGACGGC CGCTTTCGTG GTCGAGATGC TGGACAACTC AGTCCGCAGT
TTCGACCAGA TGGCCGGAAT GTCGCGCATC GTACCCGCCG GATTCATCCC GCACCTCAAA
CGGAAAGACC GGAGAGATCC GTCGATGCTC TTCGGGAGCA TCCAAGACGG GATGTTTGAC
GAAGCAATTC GCTCCGTGAT GACTTCGCTC AAACAATCTA ACGGCGGGAA ACTGCCAAAC
ACGATTGTCG TGACGTCGGC TCACAGCGGA GAGGGCAAGT CGCTTGTCGC CAGATCCCTG
GCAATCGATC TCGCCGCCAA CGGAATTCCG GTGCTGCTTG TCGATGGCGA CCTCAGACTC
GGAAATCTCG ACTCGTTTTT CAAATCAGAG CTAAAGCAGG GGTTGAACGA ATTCCTGTGT
GGGCAGGCGG GATTGCGTGA CATCATCCAT CACCATCCAA GCGGCATCGA CTTCATCCCG
GCTGGCAATG CCAGTCTCCA TCGGCGCGTC CGTTTGACTG ATGCAGCTGA CATCGTGGCG
ATGGCCGCCT CACTGGGCCA AATCGTCATC TTCGACAGCG CGCCTGTGCT CGCCTCGGCT
GATACGATGC ATCTGACAGC GTTAGCAGAA AGAACGCTGG TGGTCGTAAA ATGGGGAAAG
ACGAGCCGTC GGGCGGTAGA GTTTTGCCTA CATCAGCTGA AGACCGCGCG CAACGCAGAG
ATTGCCGTTG CCATAAATAA CGTCAATACG AACAAACACG CCATGTACAA CTTCCGCGAT
TCAGAACTGT TTGCGAGCTC ACTGCGGAAG TACCACGAGT TCACGTGA
 
Protein sequence
MTISTVPPDR NTSLSSMRYD PRLQVETRSD EIDLTSGLRL IRRRMVMIMA IITVLMAVAA 
TVISGLKPTY HAESRLIIHT PLATKLGTDE SGRNDPLDAT SETERLLSRS IAERVIRDLR
LNEWPEFNPA LQEISPIDKI RSMLRGWVDS EKPSLPVRDS IEPIIPKYYK ALRVWRDGQG
DVIQIGFDAS DPELAASVPN RLISIYLEER KDSLRGRVDA AEEWILLRID EQMGRAKAAR
DAADGYQKIM DVASSDDDQV EQIKSIMELG ERRTKIEQSR VEARATISAL EAADDPSLAL
QNMVIPDSIG AMQRELRAQE QDLERLLETY GNTAESVLDT RAKILKSRTD LSLATDRYLQ
SIRAKLAALD HEDDAVRSAL AAAHEKRARS TLAQTELARL ERMADKEQTA LDKLDEQRRG
LAAQAMLPGA ELEVLSPAAV PLAPTGRGRL FYLIGALLAS VSIAVTAAFV VEMLDNSVRS
FDQMAGMSRI VPAGFIPHLK RKDRRDPSML FGSIQDGMFD EAIRSVMTSL KQSNGGKLPN
TIVVTSAHSG EGKSLVARSL AIDLAANGIP VLLVDGDLRL GNLDSFFKSE LKQGLNEFLC
GQAGLRDIIH HHPSGIDFIP AGNASLHRRV RLTDAADIVA MAASLGQIVI FDSAPVLASA
DTMHLTALAE RTLVVVKWGK TSRRAVEFCL HQLKTARNAE IAVAINNVNT NKHAMYNFRD
SELFASSLRK YHEFT