Gene Rleg2_6432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6432 
Symbol 
ID6983503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp93542 
End bp95746 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content56% 
IMG OID643399429 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_002284185 
Protein GI209552270 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.402351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTT CCACTCTTCC GCCGGACCGC AATCTTCCGA TCTCACCGAT GCGCTATGAT 
CCTCGCTTTC AGGTTGAGAC CAAGGCGGAC AACTTCGATC TGGCATCCGT CTTCGGTCTC
ATCAGGCGAA GAATGGTCGT GATCATCACG ATAACCGTGC TGCTGATGGT GCCCGCGGTG
ACGACGATCT CCGGGTTGAA GCCCACATAT CAGGCATGGG CCCGACTGAT TGTCCATCAG
CCTCTCGCGA CATCACTTGG TGGGGACGAT ACCAGCCGCG GTGATGCACT TGACGTGAAA
TCGGAGACTG AGCGGCTCTT ATCCAGAAGC ATCGCAGAGC GAGTCGTTCG CGAAATGCGG
CTCAACATGC AGCCGGAATT CAATCCAGCG CTACGCAAGG TCTCGTTCAT TCAAAGGATC
CGGGCAAAGC TGCGCAGCCT ATTCGACACG AAAAAACCCG CTTTGGCGGT AGGAGACGGC
ATCGACGCCA CAATCCCGGA ATATTATCAG GCGCTCGGTG TGTGGCACGA AGACAAGAGT
GAAGTGATCC AGATCAGCTT CACTGCGAGT GACGCTGAAC TCGCCGCAGC GGTGCCGAAT
CGGCTCGTCA GCATTTACCT GGATGAGCGG AAGGGTAGTG TCAGTGGCCG CCTGGTTTTA
GCGGAAGAAT GGATCCGACA GCGAATCACC GAACAGCAGG TTCGCGCGGA TATTGCTCGC
GACGCTGCCC GCAACTACCA AGAATCGATG GACATCGTCT CGAAAGAGGA CGCTCAGGAT
GAACGTATCA AAACACTCCT GGAACTCACC GGCCGCGAAG GAAAAATCGA GGAAAGCCGG
GTTGAGGTAA AGGCGACGAT ATCCGCGCTT GAGGCAACGG ATAACGCTTC GGTCGCGGTG
CAGAACACCA TTGTCCCGGA CAGCATTACC TCAAAGCAAC GCGATCTTCA CGCACAAGAG
CAAGATCTCG CCCGTCTTCT TGAGACATAT GACGGCAATG CCGAGGCCGT AGTGGATTTG
CGTAACAAGA TCGAACAATC CCGTATCGAT CTCGACCTTG CGACCAAACA ATATCTCCAG
GCAATGCGTG CCAAGCTCGC CGCACTTGAA CATGAAGCCA ACGCCGTGCA GTCCATCCTG
GCTGTCGCTC AGGAAAAACG CACACGCGAT GCCCTGGCGC AGAATGAATT GACGCGACTG
CAGCGTATTG CCGAAAAGGA GCAAGCGGCG CTCGACAAGC TTGAGGACCA GCGCCGTGAC
CTTGCCGCCA AGGCCATGCT GCCTGGGGCG GAATTGGAGG TTTTCTCACC GGCTTCAGTG
CCGCTCGTCT CACAGGGACG TGGACGGCTT TCCTATCTGA TAGGCGCCCT CCTGGCTGCA
ATCTCGGCAG CCGTGACGGC TGCTTTCGTG GTCGAAATGT GGGACCGGTC GGTCCGCAGT
TTCGACCAGC TTGCCGGAAT TGCACGCACC ATACCGGCTG GACTTATCCC GCGTCTGGCG
CGAAGTGAAA ATTCGAAGAC GATGTTCGCA CATATGCCGG TCCCGATGTT TGACGAAGCA
ATCCGTACCG TGGTGCTTTC ACTCAAGCAG TCCAATGGCG GCAAATTGCC GAACAGTATT
GTCGTTACCT CGGCTCATAG CGGAGATGGC AAATCGCTTG TCGCGCGATC GCTGGCCATG
GAACTTGCCG CTGCCGCGAT TCCGGTCCTG CTGGTCGATG GCGACCTTCG ACGGGGAAAA
CTCGATGCGT TTTTCAGGTC GGGGGTCATG TGCGGACTGA ACGAATTCCT GAACGGTCAG
GCCGATTTCC GCGACATCGT CTACCACCAT CCAAGCGGCA TCGATTTCAT TCCATCTGGA
AAATTCAGTC TCCAGCGGCG GGTCCGTCCG AACGGCCTGG CAGAAATCAT CGAGATGGCG
GTCGCGGCCG GCCAGATTGT CATCTTCGAC AGCGCGCCTG TACTCGCCTC GGCAGATACC
GTGCATTTGA CCTCCCTGGC GGAAAGAACA CTGGCAATTG TTAGATGGGG AAAGACGAGC
CGACGCGCGG TCGAGTTCAG CCTGCAGCAA ATGAAAAGCT CGCGAAATTC GGAAATCATC
GTCGCGATCA ACAACGTAAA CCCCGAAAAG CACGCATTGT ATAACTTCAG CGACTCGGAA
TTATTTTCGA AATCTCTGAT GAAATACTAT AAATTCAAAG CATGA
 
Protein sequence
MTISTLPPDR NLPISPMRYD PRFQVETKAD NFDLASVFGL IRRRMVVIIT ITVLLMVPAV 
TTISGLKPTY QAWARLIVHQ PLATSLGGDD TSRGDALDVK SETERLLSRS IAERVVREMR
LNMQPEFNPA LRKVSFIQRI RAKLRSLFDT KKPALAVGDG IDATIPEYYQ ALGVWHEDKS
EVIQISFTAS DAELAAAVPN RLVSIYLDER KGSVSGRLVL AEEWIRQRIT EQQVRADIAR
DAARNYQESM DIVSKEDAQD ERIKTLLELT GREGKIEESR VEVKATISAL EATDNASVAV
QNTIVPDSIT SKQRDLHAQE QDLARLLETY DGNAEAVVDL RNKIEQSRID LDLATKQYLQ
AMRAKLAALE HEANAVQSIL AVAQEKRTRD ALAQNELTRL QRIAEKEQAA LDKLEDQRRD
LAAKAMLPGA ELEVFSPASV PLVSQGRGRL SYLIGALLAA ISAAVTAAFV VEMWDRSVRS
FDQLAGIART IPAGLIPRLA RSENSKTMFA HMPVPMFDEA IRTVVLSLKQ SNGGKLPNSI
VVTSAHSGDG KSLVARSLAM ELAAAAIPVL LVDGDLRRGK LDAFFRSGVM CGLNEFLNGQ
ADFRDIVYHH PSGIDFIPSG KFSLQRRVRP NGLAEIIEMA VAAGQIVIFD SAPVLASADT
VHLTSLAERT LAIVRWGKTS RRAVEFSLQQ MKSSRNSEII VAINNVNPEK HALYNFSDSE
LFSKSLMKYY KFKA