Gene Rleg_4119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4119 
Symbol 
ID8015890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4198215 
End bp4201526 
Gene Length3312 bp 
Protein Length1103 aa 
Translation table11 
GC content64% 
IMG OID644826689 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_002977899 
Protein GI241206803 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAGG GTATTATCGC AAACTCGGTG ATGAACGGGG CGGCAGGCAT GCTGCTGCTT 
CTGACGGGCT TCGTTTCCTC GATCATAACC GCACGGTTGC TCGGGCCGGA AGCCAACGGC
ATCGTCGCCT TTTCGCTCTG GCTGGTGGTG ACCGGCGCCT CGATCGCCGA GCTCGGCTCC
AGCATCACGC TGCTGAAGAC ATTGCCGCAA CTTTCGGCGG AAGGCTTCGA CGCGCGCCGC
CGGCGAGGTT TTGCCGCCAT CCTCGTCAGC TTCATGATGT TCTCGACCGT CGTGCTGCTG
GCGCTCTACG CGCTGTTCTT CCTGACCTCC GAAAAGATGC ACTGGGCCGA CACGGCGCCA
TCCGTCGCTC TGGTCACGGG CGTGCTGTTC TTCGTCCAAG CGATCGGTTC CTTCGTCAAA
TTCTACCTGA TCGGCGAAAA GAAGCTGGGC GCCTTCTTTA AGCTGACGGT CGCCGTCTCG
ATCGTGCAGC TTGCCGGCGT TGCCGTCGGC GCCGTTTTCT ACGGCGTCGA AGGCGTACTC
GTCGGCTATG CGCTCGGCCA GCTGGTGCTG TTTTTCGCCA CGCTGCCGAT CCTTCTGGCG
CGGCGCGACT GGTGCGGGGT CTCGCTGAAA TATCTCGCCT CCTCCTCCGT CATCCTGTCG
ATCCAGTTCA TCATCGATTC CATTTTCCTC AACCGGCTCG AGCTGCTCTT CCTGCAGCAG
TTCTGGTCGG TGGAGATGGT CGGCTATTAT GCCGTCGGCC TGTCGATCGC CAATATCGCG
CTGCAACTGC CGATCCAGAT GACCGGCAGC CTGCTGCCCT ATTATTCCGA GCGACGGCAC
AGCAGCGACG ATTCGACATT GCCGGTCGAG GTCTTTACCG CCGTTACCCG CAGCATGGCC
TATATCGTGC TGCCGATGAG CCTGGGGCTT GCCGCGATCT CCAGCGAACT GGTGCTCGTG
GTGTTCGGCG AAGCGTTCCG CCGCAGCGGG ACGGTGGTGG CGCTGCTTGC GCTTGTTGCT
CCCGCCTATA CCTTCATGCA GATCCTCAGC CTCTACCTGC TGTCGATGGA CAGGGCCCGC
TCCCGCCTGA ACATCAGTGT GATAGGTGGC CTACTGATGG TAGCGGGTTG TTTACTGATC
GTACCTAGGC TTGCGGCCGA GGGCGCCGCA CTCGTGCGCA TCCTCGTATT CGTAGCGATG
TCGGTGATGA TGATCAGACA GACAGGATTC GGATCCCAGC TTTCGGGTCT CTACGCAAGC
TTGACGAAGG TGACGCTCGC CTCCGTGCTG TGTGCTTGCG GAGCGATTTC CGTGCTGGAA
TTCGTCCATG GTCCGGCCGG ATTGGTCGGC GCGATCATCG CCGGCGCATT CGCCTATTTT
GCAGCACTCC GGGTGCTGCG CGCCGTGCCA GGCGAGGATG TCGAAGTCAT GCGCTCCATT
CTCGAGAAGA TGCCATCCCT GCTGCGGCGA CCGGTCGGCC AGGCGATCAA TTTCATCGCG
CCACGGCTTC CCGGCGATCC CGATCGCGCC AAGGTCGCGC CCGGCGAATT CTCGCTGGAA
CCGGCCGAGG GCGCGGGACG CAGCGCCGCC CTGCCTGTGG TCTTCGACGG TACGATCGGG
CTGTTCATGC CTGAAAATCC TCTCGCCAAG AAACGTTCGG CCGCCGTGCT TTTCGTCAGC
CCCTGGGGCT TCGAGGAGAT GTGCAGCCGC AAATTCTTCC GGGTCGCGGC CGAGCACTTC
TCGGATATCG GCGTGCCGAG CCTGCGCTTC GACTATCGTG GCACGGGCGA TGCGCTTAAT
TTCGACGCAC TGCCGGCGAG GCTGGAAACC TGGGAAGATT CGATCCGCGC GGCCGCCGAC
AAGTTGAAGT CGCTGAGCGG CTGCGACCGT ATCATCCTCA TCGCACAGGG CCTCGGCGCG
ACGCTTGCCC ATCGCGTCGG TTCCTCGATC GAAGGCGTCG ACAGCCTCGT CATGCTGGCG
CCGGTGCTGA GCGGCCGGGC CTATCTGCGC GAACTCAACA TGTGGTCCAA GATCATCGAT
GCCGATCTCG GCCTCGGCAA GGAGCATGTC CAGACCACCA AGGTGCAGAT CGCCGGGCTC
GTCATGCCCG AAGAGATCGC CGCCGAGCTC GGCAAGCTCA ACATCACCTC GCCGCAGGGG
CTCGCAACCT CCCGCTACCT GATCCTCGAA CGTCCTGCCA AGGCCGACGA TACCGGCTTT
GCCGATGCGC TGAAGGCGCT TGGTGCCGAT GTCGAGCAGA AGGCCTTCGA GGGCTATGAC
GAGCTCGCCA CCAATCCGCT GTTTGCCAAG ACGCCGATGG CTGTCGTCGG GCTGCTGACG
ACATGGCTGG AGACAACGAC GACGCAGACA TCCGCCGCCC ATTCGCCGGC GGCGATCGAC
AACCCGCTGC TTGCCGGCGA TGATTTTGCG GAAACGCCGG TCCGTTTCGG AAGCCACAAT
CATCTCGTCG GCGTCGTCAG CCGGCCGCTC GGCGAGATCA AGGGTAACGC CGTGCTCTTC
CTGTCGACAG CCTATGACCG GCATGCCGGC TGGGGACGGA CGACGGTAGA CATGGCGCGC
GAGCTCGCCC GCCAAGGCGT CGTTTCGCTG CGCTTTGATT CCGCCAATGT CGGCGACAGC
CCGCCGCGGC CGGATGCGCC GGAACAGGTG CTTTATTCGG ATACCCAGAC CGGGGACGCG
GTCGCCGCGC TCGACCTGCT CGAAAGCGTC GTCGCCGGCC CCGTCATGGT CGCCGGCCGA
TGCAGCGGCG GCTATGTCGC CTTCCGCGCC GGCGTTGCCG ACGAGCGCTT GAAGGCTGTC
GTGTCGATCA ATCCGTTCGT CTATTACTGG GATCCTGACA TGCCGGTGCG CAGGGAGCAT
GTCGTCTCCG TTCCCCGCAG CCTCGATGAT TACAGCCAGC GCCTGGCGCG GCTCGACACG
CTGAAGCGGC TGCTGCGCGG CCAGGTGGAT GTCGTCTCGG CGTTGCAGAA CATCGTCATC
GCCGCCGGCC GCCGGCTGTC GCCGTGGATC GCGCCGGTGC TCGAGCTGCT TCCCGACCGG
CGCCATATCG CCCGCGAGGT CCGGCACTCC TTCGCGCTGT TCGGCAAGCG CAAGGTGCCG
CTGACGCTGA TCTACAGCGA GGGCGATGTC GGGCTCGACC ATGTCTACTT CCACTTCGGG
CCGCGCGGCG CCAGGCTTTC CCGCTATCCG AACGTGCGCC TGCTGATGCT GCCGGATGCC
GACCACAATC TGACGCCGCC GCAATCGCGC AAATTCGTGC TCGACGAGAT CATTCGTCTC
GCCAGAGCAT GA
 
Protein sequence
MSKGIIANSV MNGAAGMLLL LTGFVSSIIT ARLLGPEANG IVAFSLWLVV TGASIAELGS 
SITLLKTLPQ LSAEGFDARR RRGFAAILVS FMMFSTVVLL ALYALFFLTS EKMHWADTAP
SVALVTGVLF FVQAIGSFVK FYLIGEKKLG AFFKLTVAVS IVQLAGVAVG AVFYGVEGVL
VGYALGQLVL FFATLPILLA RRDWCGVSLK YLASSSVILS IQFIIDSIFL NRLELLFLQQ
FWSVEMVGYY AVGLSIANIA LQLPIQMTGS LLPYYSERRH SSDDSTLPVE VFTAVTRSMA
YIVLPMSLGL AAISSELVLV VFGEAFRRSG TVVALLALVA PAYTFMQILS LYLLSMDRAR
SRLNISVIGG LLMVAGCLLI VPRLAAEGAA LVRILVFVAM SVMMIRQTGF GSQLSGLYAS
LTKVTLASVL CACGAISVLE FVHGPAGLVG AIIAGAFAYF AALRVLRAVP GEDVEVMRSI
LEKMPSLLRR PVGQAINFIA PRLPGDPDRA KVAPGEFSLE PAEGAGRSAA LPVVFDGTIG
LFMPENPLAK KRSAAVLFVS PWGFEEMCSR KFFRVAAEHF SDIGVPSLRF DYRGTGDALN
FDALPARLET WEDSIRAAAD KLKSLSGCDR IILIAQGLGA TLAHRVGSSI EGVDSLVMLA
PVLSGRAYLR ELNMWSKIID ADLGLGKEHV QTTKVQIAGL VMPEEIAAEL GKLNITSPQG
LATSRYLILE RPAKADDTGF ADALKALGAD VEQKAFEGYD ELATNPLFAK TPMAVVGLLT
TWLETTTTQT SAAHSPAAID NPLLAGDDFA ETPVRFGSHN HLVGVVSRPL GEIKGNAVLF
LSTAYDRHAG WGRTTVDMAR ELARQGVVSL RFDSANVGDS PPRPDAPEQV LYSDTQTGDA
VAALDLLESV VAGPVMVAGR CSGGYVAFRA GVADERLKAV VSINPFVYYW DPDMPVRREH
VVSVPRSLDD YSQRLARLDT LKRLLRGQVD VVSALQNIVI AAGRRLSPWI APVLELLPDR
RHIAREVRHS FALFGKRKVP LTLIYSEGDV GLDHVYFHFG PRGARLSRYP NVRLLMLPDA
DHNLTPPQSR KFVLDEIIRL ARA